Huge Data

TwitterFacebook
Get flash to fully experience Pearltrees
One of the key factors behind Lucene's popularity and success is its simplicity. The careful exposure of its indexing and searching API is a sign of the well-designed software. Consequently, you don't need in-depth knowledge about how Lucene's information indexing and retrieval work in order to start using it.

Meet Lucene - Developer.com

http://www.developer.com/open/article.php/3490471/Meet-Lucene.htm
Connecting your users with the content they need when they need it isn't just optional anymore. With the rise of Google and similarly sophisticated search engines, users expect high-quality search results that help them find what they're looking for quickly and easily. Your manager has equally high expectations for your online shopping site -- a scalable, highly available, easy-to-maintain search solution that doesn't cost a fortune to install. As for you, you just want to advance your career, keep your employer and customers happy, and hold on to your sanity. You can meet all these needs with Apache Solr, an open source, Lucene Java-based search server that is easy to incorporate into your Web applications. http://www.ibm.com/developerworks/java/library/j-solr1/

Search smarter with Apache Solr, Part 1: Essential features and the Solr schema

Delve inside the Lucene indexing mechanism

This article introduces you to the indexing mechanism of Lucene, a popular full-text IR library written in the Java language. First, I'll demonstrate how to index your documents with Lucene, then I'll discuss how to improve the indexing performance. Finally, I'll analyze Lucene's index file structure. http://www.ibm.com/developerworks/library/wa-lucene/
http://blog.thecapacity.org/2009/03/04/how-to-build-couchdb-on-dreamhost/ As you know from many of my entries I’m a big fan of couchdb , and if you’re interested you should really be following janl , jchris and lethain as they push this technology forward. As you might also guess from my earlier post I’m working to build and install it on Dreamhost , another thing I support enthusiastically. Unfortunately, being on the outer fringe of technology meant I wasn’t able to get them to install it for me, but that’s completely understandable. Given that the current package release has no Auth support (I believe the repository builds do but that would have required more software installs) if I were supporting a multi-user production environment it might make me a little nervous too.

How to build Couchdb on Dreamhost | thecapacity

http://www.elasticsearch.org/

elasticsearch - - Open Source, Distributed, RESTful, Search Engine

So, we build a web site or an application and want to add search to it, and then it hits us: getting search working is hard . We want our search solution to be fast , we want a painless setup and a completely free search schema , we want to be able to index data simply using JSON over HTTP , we want our search server to be always available , we want to be able to start with one machine and scale to hundreds , we want real-time search , we want simple multi-tenancy , and we want a solution that is built for the cloud . "This should be easier" , we declared, "and cool, bonsai cool" . elasticsearch aims to solve all these problems and more.