Big Data

TwitterFacebook
Get flash to fully experience Pearltrees
db4o : API: Java, C#, .Net Langs , Protocol: language , Query Method: QBE (by Example), Soda, Native Queries, LINQ (.NET) , Replication: db4o2db4o & dRS to relationals , Written in: Java , Cuncurrency: ACID serialized , Misc: embedded lib, Links : DZone Refcard #53 » , Book » , Versant : Languages/Protocol: Java, C#, C++, Python . Schema: language class model (easy changable).

NOSQL Databases

http://nosql-database.org/

Home - Apache Hive - Apache Software Foundation

Skip to end of metadata Go to start of metadata The Apache Hive TM data warehouse software facilitates querying and managing large datasets residing in distributed storage. Built on top of Apache Hadoop TM , it provides Hive defines a simple SQL-like query language, called QL, that enables users familiar with SQL to query the data. At the same time, this language also allows programmers who are familiar with the MapReduce framework to be able to plug in their custom mappers and reducers to perform more sophisticated analysis that may not be supported by the built-in capabilities of the language. https://cwiki.apache.org/confluence/display/Hive/Home

The Definition of Enterprise Big Data - Wikibon

http://wikibon.org/wiki/v/Enterprise_Big-data With David Vellante With the inaugural O'Reilly Media Strata conference, the topic of data (aka "Big Data" ) is coming into sharper focus. When O'Reilly initiates coverage of a topic through an event like Strata, you can be sure the content will be well-thought-out, rich, relevant and visionary in nature.
Big data is usually discussed in terms of its applicability to business or scientific research, but it can be valuable for much more. Consider, for instance, the release of Continue reading » At the Strata Jumpstart session on Tuesday, Diego Saenz of Data Driven CEO made the case for three skills that are must haves for CEOs to become " data driven." http://www.readwriteweb.com/cloud/tag/big+data

big data - ReadWriteCloud

Hadoop Blog

http://developer.yahoo.com/blogs/hadoop/ Hadoop Summit 2011 is over. If you saw this tweet ”#hadoopsummit planned for 1,500. upped on demand to 1,600. finally accommodated 1,700. ran out of space, good problem to have. :-),” then you probably got an idea of how exciting and mobbed the conference was this year. With folks dropping by from coast-to-coast, and quite [...] On June 29, Yahoo! will host the 4th annual Hadoop Summit at the Santa Clara Convention Center.
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures. A wide variety of companies and organizations use Hadoop for both research and production. Users are encouraged to add themselves to the Hadoop PoweredBy wiki page. Described by the judging panel as a "Swiss army knife of the 21st century", Apache Hadoop picked up the innovator of the year award for having the potential to change the face of media innovations. http://hadoop.apache.org/

Welcome to Apache™ Hadoop™!

This blog was originally posted on the Apache Blog: https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. You can use Sqoop to import data from external structured datastores into Hadoop Distributed File System or related systems like Hive and HBase. Conversely, Sqoop can be used to extract data from Hadoop and export it to external structured datastores such as relational databases and enterprise data warehouses. In its monthly meeting in March of 2012, the board of Apache Software Foundation (ASF) resolved to grant a Top-Level Project status to Apache Sqoop, thus graduating it from the Incubator.

Blog | Apache Hadoop for the Enterprise | Cloudera

http://www.cloudera.com/blog/
http://www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_innovation The amount of data in our world has been exploding, and analyzing large data sets—so-called big data—will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus, according to research by MGI and McKinsey's Business Technology Office. Leaders in every sector will have to grapple with the implications of big data, not just a few data-oriented managers. The increasing volume and detail of information captured by enterprises, the rise of multimedia, social media, and the Internet of Things will fuel exponential growth in data for the foreseeable future. Deep analytical talent: Where are they now? Research by MGI and McKinsey's Business Technology Office examines the state of digital data and documents the significant value that can potentially be unlocked.

Company - Report - Big data: The next frontier for innovation, competition, and productivity - May 2011