
NoSQL
Get flash to fully experience Pearltrees
Over the last couple years, we see an emerging data storage mechanism for storing large scale of data. These storage solution differs quite significantly with the RDBMS model and is also known as the NOSQL. Some of the key players include ... GoogleBigTable, HBase, Hypertable AmazonDynamo , Voldemort, Cassendra, Riak Redis CouchDB , MongoDB These solutions has a number of characteristics in common Key value store Run on large number of commodity machines Data are partitioned and replicated among these machines Relax the data consistency requirement.
NOSQL Patterns
In my previous post, I talk about the methodology of transforming a sequential algorithm into parallel. After that, we can implement the parallel algorithm, one of the popular framework we can use is the Apache Opensource Hadoop Map/Reduce framework. Functional Programming
Hadoop Map/Reduce Implementation
Paper: A Study of Practical Deduplication
With BigData comes BigStorage costs. One way to store less is simply not to store the same data twice . That's the radically simple and powerful notion behind data deduplication . If you are one of those who got a good laugh out of the idea of eliminating SQL queries as a rather obvious scalability strategy, you'll love this one, but it is a powerful feature and one I don't hear talked about outside the enterprise.hadoop
hbase

