
semantic
Get flash to fully experience Pearltrees
microformats
clipboard
POWDER: Use Cases and Requirements
SIREn: Efficient semi-structured Information Retrieval for Lucene Efficient, large scale handling of semi-structured data (including RDF ) is increasingly an important issue to many web and enterprise information reuse scenarios. Querying graph structured data (RDF) is commonly achieved using specific solutions, called triplestores , typically based on DBMS backends. In Sindice we however needed something much more scalable than DBMS and with the desirable features of the typical Web Search engines: top-k query processing, real time updates, full text search, distributed indexes over shards, etc. While Lucene has long offered these capabilities, its native capabilities are not intended for large semi-structured document collections (or documents with very different schemas).

