Google-Hadoop

TwitterFacebook
Get flash to fully experience Pearltrees
Nutch is coded entirely in the Java programming language , but data is written in language-independent formats. It has a highly modular architecture, allowing developers to create plug-ins for media-type parsing, data retrieval, querying and clustering. In June, 2003, a successful 100-million-page demonstration system was developed. http://en.wikipedia.org/wiki/Nutch

Nutch - Wikipedia, the free encyclopedia

http://en.wikipedia.org/wiki/Apache_Hadoop

Hadoop - Wikipedia, the free encyclopedia

Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license . [ 1 ] It enables applications to work with thousands of computational independent computers and petabytes of data. Hadoop was derived from Google 's MapReduce and Google File System (GFS) papers.
Douglass Read Cutting is an advocate and creator of open-source search technology.

Doug Cutting - Wikipedia, the free encyclopedia

http://en.wikipedia.org/wiki/Doug_Cutting

Welcome to Apache Hadoop!

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. http://hadoop.apache.org/
The USPTO awarded search giant Google a software method patent that covers the principle of distributed MapReduce, a strategy for parallel processing that is used by the search giant. If Google chooses to aggressively enforce the patent, it could have significant implications for some open source software projects that use the technique, including the Apache Foundation's popular Hadoop software framework. "Map" and "reduce" are functional programming primitives that have been used in software development for decades. http://arstechnica.com/open-source/news/2010/01/googles-mapreduce-patent-what-does-it-mean-for-hadoop.ars

Google's MapReduce patent: what does it mean for Hadoop?

http://gigaom.com/2010/01/19/why-hadoop-users-shouldnt-fear-googles-new-mapreduce-patent/

Why Hadoop Users Shouldn’t Fear Google’s New MapReduce Patent

Updated : Google, nearly six years since it first applied for it, has finally received a patent for its MapReduce parallel programming model . The question now is how this will affect the various products and projects that utilize MapReduce.

Google grants license for Apache Hadoop - The H Open Source: News and Features

Google has granted a license for one of its patents to the Apache Hadoop open source framework for distributed computing. http://www.h-online.com/open/news/item/Google-grants-license-for-Apache-Hadoop-987722.html
http://www.businessweek.com/magazine/content/07_52/b4064000281756.htm Why are search engines so fast?

The Two Flavors of Google