Nutch is an effort to build an open source web search engine based on Lucene and Java for the search and index component. [ edit ] Features Nutch is coded entirely in the Java programming language , but data is written in language-independent formats.
Apache Hadoop is an open-source software framework that supports data-intensive distributed applications , licensed under the Apache v2 license. It supports the running of applications on large clusters of commodity hardware.
The USPTO awarded search giant Google a software method patent that covers the principle of distributed MapReduce, a strategy for parallel processing that is used by the search giant. If Google chooses to aggressively enforce the patent, it could have significant implications for some open source software projects that use the technique, including the Apache Foundation's popular Hadoop software framework. "Map" and "reduce" are functional programming primitives that have been used in software development for decades.
Updated : Google, nearly six years since it first applied for it, has finally received a patent for its MapReduce parallel programming model . The question now is how this will affect the various products and projects that utilize MapReduce.
Google has granted a license for one of its patents to the Apache Hadoop open source framework for distributed computing.
A battle could be shaping up between the two leading software platforms for cloud computing, one proprietary and the other open-source Why are search engines so fast?