
Google-Hadoop
Nutch is an effort to build an open source web search engine based on Lucene and Java for the search and index component. [ edit ] Features Nutch is coded entirely in the Java programming language , but data is written in language-independent formats.
Nutch
Hadoop
Apache Hadoop is an open-source software framework that supports data-intensive distributed applications , licensed under the Apache v2 license. It supports the running of applications on large clusters of commodity hardware.The USPTO awarded search giant Google a software method patent that covers the principle of distributed MapReduce, a strategy for parallel processing that is used by the search giant. If Google chooses to aggressively enforce the patent, it could have significant implications for some open source software projects that use the technique, including the Apache Foundation's popular Hadoop software framework. "Map" and "reduce" are functional programming primitives that have been used in software development for decades.

