background preloader

Data Publica, le portail français des données publiques et de l'open data

Data Publica, le portail français des données publiques et de l'open data

ocropus - The OCRopus(tm) open source document analysis and OCR system Vidéos Buzz, Jeux Flash et Images sur Koreus.com ScanR - use camera phones for OCR scanR is a free service that lets you transform camera phone pictures into PDF documents. You can take a picture of a document, send it to scanR by email and in less than a minute you'll get a PDF file. If you save the file as text in Acrobat Reader, you'll have the text contained in the document. You can also use it for whiteboard images. scanR requires 1 megapixel cameras for whiteboard scanning and 2 megapixel cameras for document scanning. I tested this online OCR service with Sony Ericsson K750i and the results were pretty good. Related:Use Gmail to break PDF DRM

Vidéos Buzz, Drôles, Insolites sur Spi0n.com Open-Source OCR Software, Sponsored by Google Google sponsors the development of an open-source OCR software at the IUPR research group. "OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities." "The goal of the project is to advance the state of the art in optical character recognition and related technologies, and to deliver a high quality OCR system suitable for document conversions, electronic libraries, vision impaired users, historical document analysis, and general desktop use," explains Thomas Breuel, who leads the project. The software is partly based on Tesseract, the best open source OCR engine available for now. While the project is expected to be released at the end of next year and will be used for Google's book scanning project, the team has some interesting applications in mind:

L’enfant, cette créature prodigieuse Le docteur Maugrin se redressa nerveusement dans son fauteuil, éclaircissant sa voix de quelques toussotements avant de soigneusement joindre ses mains pour prendre la posture qui était la sienne lorsqu’il avait des nouvelles importantes à délivrer à ses patients. Face à lui, le jeune couple sentant qu’il se passait quelque chose sembla rapetisser quelque peu dans les chaises mises à sa disposition, attendant que tombe la sentence de l’homme de science. Maugrin leur jeta tour à tour un regard grave, puis prit la parole. Le médecin se leva en faisant craquer son siège de cuir, ses patients le suivant du regard alors qu’il se dirigeait vers la fenêtre pour prendre un air philosophe. Maugrin pivota sur ses talons, les yeux plissés. Si un jour vous vous voyez exactement cela, c’est qu’ils s’apprêtent probablement à vous bouffer le foie Sujet de bien des débats, celui-ci reste profondément mystérieux : qui est-il ? Définition Histoire Biologie Comportement Vous le reconnaissez ? Structure sociale

Top 5 Free OCR Software Tools To Convert Images Into Text Quite frankly, I wish I knew about this simple way to use freely available OCR software back in my school days. Of course, we didn’t have camera mobile phones or inexpensive Digicams, but wouldn’t it have saved hours of copying notes! Ah, modern technology is wonderful; take a scanned image (or take a snap using a mobile camera/Digicam) and presto – OCR software extracts all the information from the image into easily editable text format. Optical character recognition (OCR) is a system of converting scanned printed/handwritten image files into its machine readable text format. Maybe you have already come across our previous How to Extract Text from Images (OCR) post and used JOCR, a a free OCR software tool. We will be looking at 5 free pieces of OCR software and to start off let’s see the overlooked two that are already installed on our systems. OCR Using Microsoft OneNote 2007 For the occasional basic OCR stuff, MS OneNote’s optical character recognition feature is a timesaver. SimpleOCR

Tesserac ou l'OCR Open source Bonjour Les programmes de reconnaissance de caractères OCR sous licence libre ne sont pas légion Mais ceux qui donnent de bons résultats sont encore plus rares (*) J'en ai pourtant trouvé un (qui intéresse Google également) Les sources sont disponibles Et j'ai pu l'installer sur mon ubuntu Edgy en suivant les indications de cette page NB La premiere fois que j'ai lancer la commande make il me manquait un paquet pour compiler correctement les sources à la lecture des logs j'ai compris qu'il manquait c++ , je l'ai donc installé et l'installation s'est poursuivie jusqu'à son terme. Les résultats sont bons pour une mise en page simple mais l'accentuation est perdue lors de la reconnaissance. Mes questions : 1) Quelqu'un aurait t'il déjà fait un script pournautilus Actionpermettant de lancer automatiquement la conversion d'un fichier image (clic droit > convertir en texte)

OS mapping data: a new landscape unfolds | Technology The Free Our Data campaign has scored a major victory, with the announcement by the government that it intends to make Ordnance Survey maps free for use online by any organisation – including commercial ones – at resolutions more detailed than commercial 1:25,000 Landranger maps from April next year. The announcement of the opening of a consultation on the plan by Gordon Brown at Downing Street on Tuesday, as part of a seminar on making public data public – set in the wider context of public service reform, under the "Smarter Government" umbrella – indicates that the ideas underpinning the campaign have now been taken on board at the highest levels of government. "Mid-range" maps, with resolutions from 1:10,000 upwards, will be made available for re-use, under the plans announced by the prime minister, along with information on postcode areas and electoral and council boundaries. The issue appears to have gone to the top of government to be resolved.

Related: