Hadoop

TwitterFacebook
Get flash to fully experience Pearltrees
Use HBase when you need random, realtime read/write access to your Big Data.

HBase

http://hbase.apache.org/
http://wiki.apache.org/hadoop/PoweredBy

PoweredBy Hadoop

This page documents an alphabetical list of institutions that are using Hadoop for educational or production uses. Companies that offer services on or based around Hadoop are listed in Distributions and Commercial Support .
http://www.allthingsdistributed.com/

All Things Distributed

Today is a very exciting day as we release Amazon DynamoDB , a fast, highly reliable and cost-effective NoSQL database service designed for internet scale applications. DynamoDB is the result of 15 years of learning in the areas of large scale non-relational databases and cloud services. Several years ago we published a paper on the details of Amazon’s Dynamo technology , which was one of the first non-relational databases developed at Amazon.

Hadoop

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. http://hadoop.apache.org/
http://hadoop.apache.org/hdfs/ Hadoop Distributed File System (HDFS™) is the primary storage system used by Hadoop applications.

HDFS

Qu'est-ce que Hadoop ?

http://www.piloter.org/business-intelligence/hadoop.htm H adoop est un projet Open Source géré par Apache Software Fundation basé sur le principe Map Reduce et de Google File System, deux produits Google Corp. Le produit est écrit en langage Java. Hadoop peut être considéré comme un système de traitement de données évolutif pour le stockage et le traitement par lot de très grande quantité de données.
Dhruba Borthakur, a Hadoop Engineer at Facebook, has published part of a paper he co-authored with several of his engineering co-workers on Apache Hadoop.

Why Facebook Uses Apache Hadoop and HBase

http://www.readwriteweb.com/hack/2011/05/why-facebook-uses-apache-hadoo.php
http://fr.wikipedia.org/wiki/MapReduce

MapReduce

Un article de Wikipédia, l'encyclopédie libre. MapReduce est un framework de développement informatique , introduit par Google , dans lequel sont effectués des calculs parallèles , et souvent distribués , de données potentiellement très volumineuses (> 1 terabyte ).