background preloader

Semantic (web)

Facebook Twitter

Web api

Linked Data. - Semantic Fingerprinting. Our Semantic Fingerprinting method enables the creation of a unique semantic fingerprint for any word, any document, and in the near future even for any entity that can be described with natural language. - Semantic Fingerprinting

The big difference to conventional semantic systems is that the conversion of words into their semantic fingerprints through's Retina is automated. There is no need for costly, time-consuming manual intervention anymore. The invention of's Retina could revolutionize the search and analysis of text-based information, not only because of its transparency and simplicity of use, but also because of its small footprint: huge amounts of text –structured and unstructured- can be processed with moderate computational power. Content extraction with apache tika. Apache Tika - Apache Tika. Semantic tools for big data. Robust Hyperlinks. Proceedings of Digital Documents and Electronic Publishing (DDEP00), Munich, Germany, 13-15 September 2000 In Springer-Verlag Lecture Notes in Computer Science.

Robust Hyperlinks

Copyright © 2000 Springer-Verlag Thomas A. Phelps and Robert Wilensky Division of Computer Science University of California, Berkeley Berkeley, CA, Web Site Abstract. Robust hyperlinks exhibit a number of desirable qualities: They can be computed and exploited automatically, are small and cheap to compute (so that it is practical to make all hyperlinks robust), do not require new server or infrastructure support, can be rolled out reasonably well in the existing URL syntax (so they can retrofit existing links to make them robust), and are easy to understand.

Robust hyperlinks are one example of using the web to bootstrap new features onto itself. Table: Sample Signatures and Query Results. URLsignatureServerRobust ok? 3. 3.1 Encoding Signatures in URLs. ConceptNet 5. UIMA - Standard for unstructured information. UIMA is a component software architecture for the development, discovery, composition, and deployment of multi-modal analytics for the analysis of unstructured information and its integration with search technologies developed by IBM.

UIMA - Standard for unstructured information

The source code for a reference implementation of this framework has been made available on SourceForge, and later on the website of the Apache Software Foundation. Another use of UIMA is in systems that are used in medical contexts to analyze clinical notes, such as the Clinical Text Analysis and Knowledge Extraction System (CTAKES). Structure of UIMA[edit] The UIMA architecture can be thought of in four dimensions: IBM Watson - The Jeopardy Challenge[edit] In February 2011 a computer from IBM Research named Watson won a competition on Jeopardy!

See also[edit] References[edit] Supercalculateur sémantique. "Watson" est un superordinateur de la firme IBM.

Supercalculateur sémantique

Il associe la puissance matérielle (quantitative) à la puissance logicielle (qualitative). Au plan matériel, Watson dispose d'un système d'exploitation GNU-Linux, composé de 10 racks contenant chacun 9 serveurs Power 750 montés en réseau. Chaque serveur possède 32 coeurs qui peuvent gérer un total de 128 tâches en parallèle. "Watson", qui compte donc au total 2 880 coeurs pouvant effectuer 11 520 tâches en parallèle, possède une mémoire vive de 15 000 Go (gigaoctets) et une puissance totale de 80 Tflop (téraflops). À titre comparatif, Deep-Blue, vainqueur en 1966 de Kasparov aux échecs, (n')avait (qu')une puissance totale de 1 Tflop.

Au plan logiciel, les chercheurs de IBM ont développé la suite logicielle "IBM DeepQA" capable d’analyser le langage et les connaissances humaines dans des contexte ambigu et de développer ses connaissance par apprentissage. Ingénieur Développement Hadoop (H/F) at Thales in Sophia Antipolis - Job. Index of /pubs/crosswikis-data.tar.bz2.