
opendata
Get flash to fully experience Pearltrees
An API for European Union law
The API can help you conduct research, create data visualizations or you can even build applications upon it. This is an application programming interface (API) that opens up core EU legislative data for further use. The interface uses JSON, meaning that you have easy to use machine-readable access to meta data on European Union legislation.DocumentCloud
Dis, papa, c’est quoi l’open data ? » OWNI, News, Augmented
Solvent - SIMILE
When requesting and parsing data from a source with unknown properties and random behavior (in other words, scraping), I expect all kinds of bizarrities to occur.
Welcome | ScraperWiki
The Big Clean
Truly global Big Clean – screen scraping event is planned for May or June 2011. In agile manner we decided to start the preparation by actually organising a Big Clean in two cities, namely Prague (Czech) and Jyväskylä (Finland) on March 19 th . The events in these cities will happen at the same time, so that the participants may help each others over IRC and be part of the bigger movement.Overview of Python Tools for Working with Linked Data
We've covered Linked Data - a W3C specification for publishing structured data - frequently at ReadWriteWeb. We've covered its importance , its growth and various projects and tools taking advantage of it. But what about tools to actually get your hands dirty and work with it yourself? RDF is one way of using Linked Data. Michele Pasin , a researcher and Web developer at the Centre for Computing in the Humanities has created a list of resources for Python developers working with RD - including Python libraries, tutorials and Python friendly RDF triplestores.Pattern: A Bundle of Data Mining Modules for Python
Pattern is a collection of open source (BSD license) web mining modules for Python from the Computational Linguistics and Psycholinguistics Research Center .The demand for text mining tools, services like Instapaper and Readability , and Web scraping have increased the importance of extracting article text from HTML pages.

