background preloader

Open Calais

Facebook Twitter

From a user perspective it’s pretty simple: You hand the web service unstructured text (like news articles, blog postings, your term paper, etc) and it returns semantic metadata in RDF format. What’s happening in the background is a little more complicated.



Using natural language processing and machine learning techniques, the Calais web service looks inside your text and locates the entities (people, places, products, etc), facts (John Doe works for Acme Corp) and events (Jane Doe was appointed as a Board member of Acme Corp) in the text. Calais then processes the entities, facts and events extracted from the text and returns them to the caller in RDF format.

--- from wikipedia --
Calais is a service by Thomson Reuters that automatically extracts semantic information from web pages in a format that can be used on the semantic web.[1] Calais was launched in January 2008, and is free to use.[2][3]

The Calais Web Service reads unstructured text and returns Resource Description Framework formatted results identifying entities, facts and events within the text.[4] The service appears to be based on technology acquired when Reuters purchased ClearForest in 2007.[5]

Recent uses of the technology have included the automatic tagging of blog articles[6] and the organization of museum collections.[7]

Calais uses natural language processing technologies delivered via a web service interface.

Search · opencalais. Software Provider. Submitted by Anonymous on Mon, 04/14/2008 - 15:44. For a general overview of Calais please take a moment to read the About section. If you’d just like to jump in and learn how Calais is relevant to software providers, read on. Do you build content-driven software? If you do - the Calais team wants to work with you to incorporate Calais functionality in your tools.

Why is it relevant? Calais can add rich semantic metadata to the content managed or processed by your tools. Software Providers is a big category - and we can't suggest specific applications for the software world as a whole. Start the Experiment Calais is open for commercial and non-commercial use. Developer. Submitted by Anonymous on Mon, 04/14/2008 - 15:50. For a general overview of Calais please take a moment to read the About section. If you’d just like to jump in and learn how Calais is relevant to developers, read on.

Developers want… Just the facts, please. I know what I want to do – just tell me how it works and how to get my hands on it. Here’s the fast track. What it is Calais is a big initiative with a lot of components. Entities are things like people, places, companies, geographies. What you do with it is up to you. Get a key The Calais API needs a key. Read the documentation The API is extensively documented. Get some tools The Calais team and members of the Calais community have produced everything from Java code samples to working applications to libraries for PHP, Ruby and others. Communicate, ask questions, brag This site has a Showcase to share your creations and Forums to ask questions, answer questions and generally talk. Content Manager. Submitted by Anonymous on Mon, 04/14/2008 - 15:49. For a general overview of Calais please take a moment to read the About section.

If you’d just like to jump in and learn how Calais is relevant to developers, read on. Content and Collection Management This is a big area that covers everything from corporate knowledge management to librarians to collections at museums. Given the range of needs for this group as a whole, we’re simply going to try and point you in some useful directions. What Calais Does Different areas of this site can provide you with much deeper detail, so let’s keep it simple for the time being. Calais enhances your content with rich semantic metadata. Metadata in and of itself is not too interesting. Tag Your Complete Historical Archives in Hours Perhaps you’re a complete convert to the value of tagging your content and are doing a great job with new material – but what about the tens of thousands to millions of pieces of historical content you have already produced?

Calais Viewer. OPAC2.0 – OpenCalais meets our museum collection / auto-tagging and semantic parsing of collection data. Today we went live with another one of the new experimental features of our collection database – auto-generation of tags based on semantic parsing. Throughout the Museum’s collection database you will now find, in the right hand column of the more recently acquired objects (see a quick sample list ), a new cluster of content titled “Auto-generated tags”. We have been experimenting with Reuters’ OpenCalais web service since it launched in January. Now we have made a basic implementation of it applied to records in our collection database, initially as a way of generating extra structured metadata for our objects.

We can extract proper names, places (by continent, country, region, state and city), company names, technologies and specialist terms, from object records all without requiring cataloguers to catalogue in this way. Here’s a brief description of what OpenCalais is in a nutshell from their FAQ - Obviously the type of content that we are asking OpenCalais to parse is complex.

Calais for CMS

Intro & Docs. REST. Submitted by Anonymous on Wed, 02/25/2009 - 13:19. Calais’s latest release of an improved REST interface is the simplest and fastest way to submit your documents. Here’s how you can invoke request with this API. Web service URL for improved REST API is located at should create an HTTP POST request.Document content should be passed as the body of the HTTP request.Submitted content should be UTF-8 encoded.Your Calais license, different processing and user options are specified as HTTP headers (key-value pairs) of the request.

Sample Java code is here. Web Service API.