No SQL ?
NoSQL Style (Gangnam Style Parody for Geeks)
UC Berkeley Course Lectures: Analyzing Big Data With Twitter | Analyzing Big Data with Twitter Thank you all for a wonderful semester. Here is a summary, in chronological order, of our recorded lectures. You can also view the entire playlist on youtube. Course Introduction
This is a guest post by Siddharth Anand, a senior member of LinkedIn's Distributed Data Systems team. Preamble Ramble If you’ve been working in the online (e.g. internet) space over the past 3 years, you are no stranger to terms like “the cloud” and “NoSQL”. In 2007, Amazon published a paper on Dynamo. The State of NoSQL in 2012
I looked at neo4j briefly the other day, and quite predictably thought ‘wow, this looks like a serious tinkertoy: it‘s basically a bunch of nodes where you just blob your attributes.‘ Worse than that, to wrap objects around it, you have to have them explicitly incorporate their node class, which is ugly, smelly, violates every law of separation of concerns and logical vs. physical models. On the plus side, as I started to look at it more, I realized that it was the perfect way to implement a backend for a bayesian inference engine (more on that later). Why? The Coming SQL Collapse
Choosing a non-relational database; why we migrated from MySQL to MongoDB « Boxed Ice Blog
Welcome to Apache HBase™ Apache HBase™ is the Hadoop database, a distributed, scalable, big data store. When Would I Use Apache HBase? Use Apache HBase when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable: A Distributed Storage System for Structured Data by Chang et al.
July 1, 2011 For most Riak users, Bitcask is the obvious right storage engine to use. It provides low latency, solid predictability, is robust in the face of crashes, and is friendly from a filesystem backup point of view. However, it has one notable limitation: total RAM use depends linearly (though via a small constant) on the total number of objects stored. Leveling the Field
(Yes it's a long title, since people kept asking me to write about this and that too :) I do when it has a point.) While SQL databases are insanely useful tools, their monopoly in the last decades is coming to an end. And it's just time: I can't even count the things that were forced into relational databases, but never really fitted them. (That being said, relational databases will always be the best for the stuff that has relations.) But, the differences between NoSQL databases are much bigger than ever was between one SQL database and another.
We Recommend These Resources I have been thinking about taking a deep dive into NoSQL databases for a long time but wasn't sure which one should I start with. There are a lot of NoSQL databases in the market, each solving a set of problems. I wanted to learn a NoSQL database that does not have a steep learning curve and is generic enough to solve more than one problem. Introduction to Redis - In Memory Key Value Datastore
s tech blog » Why startups should not choose NoSQL December 27, 2010 The NoSQL hype is omnipresent. And many startups are tempted to go for Cassandra/MongoDB/HBase/Redis/… . Here I’ll argue why they should rather stick to a SQL solution – MySQL or PostgreSQL. In my previous post about Cassandra I detailed why I decided not to use it. Now, a dozen presentations watched and several dozen articles read later, I can detail why I think it is not generally a good idea.
BabuDB is an embedded non-relational database system. Its lean and simple design allows it to persistently store large amounts of key-value pairs without the overhead and complexity of similar approaches such as BerkeleyDB. Key features: Support for large-scale databases that exceed the system's main memory Efficient crash recovery Snapshots and asynchronous dumps Prefix and range lookups Transparent replication with tuneable consistency/performance trade-offs BabuDB has been independently implemented for Java and C++ (Win32/Linux). APIs and database formats of both implementations are not compatible. babudb - Project Hosting on Google Code
Banana DB - self-contained key/value pair database for java Banana DB Banana DB is a self-contained key/value pair database implemented in Java. Features Small, ~100KB .jar file with no dependencies.Top level API similar to working with any Map<K, V> on to of core <byte, byte> index.Thread safe and write locking over multiple JVMs using extended Lucene locks.Optionally transactional.More or less ACID -compliant.Simple annotational API.Pluggable serialization strategies. (java.io.Serializable as default) Upcoming features
non-RDBM distributed databases, map/reduce, key/value and cloud computing
Written on 19 January 2009 Please Note: this was written January 2009 - see the comments for updates and additional information. A lot has changed since I wrote this. Richard Jones | Anti-RDBMS: A list of distributed key-value stores | Richard Jones, Esq.
Troubles with Sharding - What can we learn from the Foursquare Incident? For everything given something seems to be taken. Caching is a great scalability solution, but caching also comes with problems. Sharding is a great scalability solution, but as Foursquare recently revealed in a post-mortem about their 17 hours of downtime, sharding also has problems. MongoDB, the database Foursquare uses, also contributed their post-mortem of what went wrong too. Now that everyone has shared and resharded, what can we learn to help us skip these mistakes and quickly move on to a different set of mistakes? First, like for Facebook, huge props to Foursquare and MongoDB for being upfront and honest about their problems.
A Relational Database Management System NoSQL is a fast, portable, relational database management system without arbitrary limits, (other than memory and processor speed) that runs under, and interacts with, the UNIX1 Operating System. It uses the "Operator-Stream Paradigm" described in "Unix Review", March, 1991, page 24, entitled "A 4GL Language". There are a number of "operators" that each perform a unique function on the data. NoSQL/Home Page
A Lightweight SQL Database for Cloud and Web in Launchpad Log in / Register A Lightweight SQL Database for Cloud Infrastructure and Web Applications Registered 2008-05-12 by Drizzle Developers
Eric Lai published a provoking article on Computerworld magazine titled “No to SQL? Anti-database movement gains steam” where he pointed to many references in which different Internet-based companies chose an alternative approach to the traditional SQL database. The write-up was driven from the the inaugural get-together of the burgeoning NoSQL community who seem to represent a growing Anti-SQL database movement. Quoting Jon Travis from this article: No to SQL? Anti-database movement gains stea
Most Trendy Graph Databases
NoSQL Graph Database Comparison