background preloader

Huge Data

Facebook Twitter

Meet Lucene. Understanding LuceneUsing the basic indexing APIWorking with the search APIConsidering alternative products One of the key factors behind Lucene's popularity and success is its simplicity. The careful exposure of its indexing and searching API is a sign of the well-designed software. Consequently, you don't need in-depth knowledge about how Lucene's information indexing and retrieval work in order to start using it. Moreover, Lucene's straightforward API requires you to learn how to use only a handful of its classes. In this chapter [From the Manning book Lucene in Action], we show you how to perform basic indexing and searching with Lucene with ready-to-use code examples. 1.1 Evolution of information organization and access In order to make sense of the perceived complexity of the world, humans have invented categorizations, classifications, genuses, species, and other types of hierarchical organizational schemes.

Click here for a larger image. Search functionality is everywhere! Lucene basics. How to build Couchdb on Dreamhost | thecapacity. As you know from many of my entries I’m a big fan of couchdb , and if you’re interested you should really be following janl , jchris and lethain as they push this technology forward. As you might also guess from my earlier post I’m working to build and install it on Dreamhost , another thing I support enthusiastically. Unfortunately, being on the outer fringe of technology meant I wasn’t able to get them to install it for me, but that’s completely understandable.

Given that the current package release has no Auth support (I believe the repository builds do but that would have required more software installs) if I were supporting a multi-user production environment it might make me a little nervous too. However, to in order to continue my interests it’s a major component so I wanted to give it a shot. I don’t have it up and running 100% right now (it appears to have run though I can’t connect) but I wanted to document to build side of things before I forgot So here’s the rundown; . Elasticsearch - - Open Source, Distributed, RESTful, Search Engine.

Home.