opendata

TwitterFacebook
Get flash to fully experience Pearltrees
http://api.epdb.eu/

An API for European Union law

The API can help you conduct research, create data visualizations or you can even build applications upon it. This is an application programming interface (API) that opens up core EU legislative data for further use. The interface uses JSON, meaning that you have easy to use machine-readable access to meta data on European Union legislation.

DocumentCloud

http://www.documentcloud.org/home DocumentCloud runs every document you upload through OpenCalais , giving you access to extensive information about the people, places and organizations mentioned in each.
http://owni.fr/2011/04/06/dis-papa-cest-quoi-lopen-data/ Nombreux sont ceux qui estiment que le mouvement "open data" aura, à l'instar de l’apparition de l’alphabet, de l'internet ou encore de l'explosion des réseaux sociaux, des répercussions majeures dans nos sociétés. Connu pour ses logiciels non libres , Microsoft a eu la très bonne idée de demander à Regards sur le numérique ( RSLN , animé par Spintank ), son “ laboratoire d’idées, de réflexions et d’expérimentations en ligne “, de se pencher sur la notion d’ open data , et donc le partage de données publiques dans des formats ouverts, afin de libérer les données récoltées, ou produites, par les autorités publiques, et de les rendre, si possible gratuitement, à la société, ses citoyens, associations, entreprises privées et administrations publiques.

Dis, papa, c’est quoi l’open data ? » OWNI, News, Augmented

http://simile.mit.edu/wiki/Solvent Piggy Bank needs web pages to embed information in a format that it can understand.

Solvent - SIMILE

When requesting and parsing data from a source with unknown properties and random behavior (in other words, scraping), I expect all kinds of bizarrities to occur. https://scraperwiki.com/

Welcome | ScraperWiki

The Big Clean

Truly global Big Clean – screen scraping event is planned for May or June 2011. In agile manner we decided to start the preparation by actually organising a Big Clean in two cities, namely Prague (Czech) and Jyväskylä (Finland) on March 19 th . The events in these cities will happen at the same time, so that the participants may help each others over IRC and be part of the bigger movement. http://bigclean.org/
http://www.readwriteweb.com/hack/2011/03/overview-of-python-tools-for-w.php

Overview of Python Tools for Working with Linked Data

We've covered Linked Data - a W3C specification for publishing structured data - frequently at ReadWriteWeb. We've covered its importance , its growth and various projects and tools taking advantage of it. But what about tools to actually get your hands dirty and work with it yourself? RDF is one way of using Linked Data. Michele Pasin , a researcher and Web developer at the Centre for Computing in the Humanities has created a list of resources for Python developers working with RD - including Python libraries, tutorials and Python friendly RDF triplestores.

Pattern: A Bundle of Data Mining Modules for Python

Pattern is a collection of open source (BSD license) web mining modules for Python from the Computational Linguistics and Psycholinguistics Research Center . http://www.readwriteweb.com/hack/2011/02/pattern-a-web-mining-module-fo.php
The demand for text mining tools, services like Instapaper and Readability , and Web scraping have increased the importance of extracting article text from HTML pages.

Overview of Text Extraction Algorithms

http://www.readwriteweb.com/hack/2011/03/text-extraction.php