background preloader

Extraction

Facebook Twitter

Data Extraction without semantic web. Ok, now imagine you could extract any data from the web or any (un-)structured datasets… you got it ?

Data Extraction without semantic web

Okay now you certainly imagine it’s just a dream, or a very expensive technology. I thought just like you, until i came across this API : with features like these : Entity ExtractionText CategorizationLanguage DetectionConcept TaggingKeyword ExtractionText ExtractionContent Scraping and Structured Data ExtractionMicroformats Parsing/ExtractionRSS / ATOM detection And the basic API is for FREE, and available with a SDK in many languages. I already tried the API in Python and it will clearly be useful to me, now or in the future. My Content Extraction Jobs. RDFa API. Abstract RDFa [RDFA-CORE] enables authors to publish structured information that is both human- and machine-readable.

RDFa API

Concepts that have traditionally been difficult for machines to detect, like people, places, events, music, movies, and recipes, are now easily marked up in Web documents. While publishing this data is vital to the growth of Linked Data, using the information to improve the collective utility of the Web for humankind is the true goal. To accomplish this goal, it must be simple for Web developers to extract and utilize structured information from a Web document. 26 New APIs: Photo Sharing, Mobile Barcodes and Feed Aggregation.