background preloader

Apache UIMA - Apache UIMA

Apache UIMA - Apache UIMA
Related:  Concept extractionAI

Apache Stanbol - Welcome to Apache Stanbol! Knowledge from Information by Matthias Broecheler LanguageWare Resource Workbench Update: July 20, 2012: Studio 3.0 is out and it is officially bundled with ICA 3.0. If you are a Studio 3.0 user, please use ICA forum instead of LRW forum. LRW is a fixpack that resolves issues in various areas including the Parsing Rules editor, PEAR file export and Japanese/Chinese language support. LRW is still available for download on the Downloads link for IBM OmniFind Enterprise Edition V9.1 Fix Pack users. What is IBM LanguageWare? IBM LanguageWare is a technology which provides a full range of text analysis functions. It is used extensively throughout the IBM product suite and is successfully deployed in solutions which focus on mining facts from large repositories of text. LanguageWare is the ideal solution for extracting the value locked up in unstructured text information and exposing it to business applications. It comprises Java libraries with a large set of features and the linguistic resources that supplement them. How does it work? More information FAQs 1.

mnoGoSearch - Internet search engine software maui-indexer - Maui - Multi-purpose automatic topic indexing Summary Maui automatically identifies main topics in text documents. Depending on the task, topics are tags, keywords, keyphrases, vocabulary terms, descriptors, index terms or titles of Wikipedia articles. Maui performs the following tasks: term assignment with a controlled vocabulary (or thesaurus) subject indexing topic indexing with terms from Wikipedia keyphrase extraction terminology extraction automatic tagging It can also be used for terminology extraction and semi-automatic topic indexing. New:Try out Maui demo! Important: Questions regarding usage, bug reports or support? Also: read more on Download, Installation and Usage pages. Domain and language independence Maui has been successfully tested on computer science, agricultural, medicine, physics, biology, bioinformatics documents, as well as on blog posts and news articles. Examples are provided in Maui's Wiki pages Background Maui has been developed by Olena Medelyan as a part of her PhD project, under supervision of Ian H.

Another Word For It Apache Jena - Apache Jena Welcome to Apache Nutch™ Kea 1. Documents - Kea gets a directory name and processes all documents in this directory that have the extension ".txt". The default language and the encoding is set to English, but this can be changed as long as a corresponding stopword file and a stemmer is provided. 2. 3. 4. TFxIDF is a measure describing the specificity of a term for this document under consideration, compared to all other documents in the corpus. 5. 6. Computational Creativity Triplify — Agile Knowledge Management and Semantic Web (AKSW) More than 20 European Union Datasets Converted to RDF by LATC Project Over the past two years, the LATC project (Linked Open Data Around-The-Clock) has worked on converting more than 20 EU datasets to RDF, make them available as Linked Data and SPARQL, and link them to other datasets. The datasets have gone through internal quality assurance against a publication checklist. Read more about "More than 20 European Union Datasets Converted to RDF by LATC Project" May 4-5: Leipziger Semantic Web Tag 2011 and Local Media Conferenz Like in the past two years, we again organize a Leipzig Semantic Web Day on May 5th at the marvelous Mediencampus Villa Ida. Triplification Challenge Winners Today we announced the winners of this year’s Triplification Challenge, which have been selected from 23 submissions. Semantic Web Journal launched A short while ago the Semantic Web Journal was launched. Semantic Pingback