background preloader

Outils & Doc

Facebook Twitter

France - Résultats élections régionales 2015. Le Monde pousse sa data éditoriale. « Les données du Monde » sont en ligne depuis quelques jours à peine. Mélange de data et d’éditorial dédiés aux communes de France, elles ambitionnent de faire basculer Le Monde dans un data journalisme ambitieux et assumé. Robots-journalistes compris. Luc Bronner, directeur de la rédaction du Monde, s’explique. Interview. CB News : Vous avez discrètement mis en ligne il y a quelques jours « Les données du Monde ». Qu’est-ce que c’est ? Luc Bronner : C’est un sujet sur lequel nous travaillons depuis un an. CB News : concrètement ? Luc Bronner : Chacune de ces pages comporte un moteur de recherche, des informations sur la population, la démographie, sur les revenus, sur le prix de l’immobilier, le taux de chômage dans les zones d’emplois, le maire de la commune, les membres du conseil municipal, le nombre de naissance, de décès, les résultats électoraux de la commune depuis 2002… À cela, nous rajoutons un fil d’actualité du Monde lié à la région de la localité recherchée.

Hei-Da | Data Journalism | Digital AgencyHei-Da | Data Journalism | Digital Agency. TextWrangler. TextWrangler TextWrangler is an all-purpose text and code editor for Mac OS X, based on the same award-winning technology as BBEdit, our leading professional HTML and text editor. We will be eventually retiring TextWrangler from our product line, and so we encourage anyone interested in TextWrangler to download and use BBEdit instead. We’ve put together a handy chart comparing BBEdit and TextWrangler, to help you out.

Should I upgrade to BBEdit? BBEdit is TextWrangler’s elder sibling. It’s a text editing power tool, with a rich set of HTML markup tools and other web development aids; an advanced “Projects” feature for helping you keep track of related files; integrated support for source-code control using Git, Subversion or Perforce; and many more features to help software developers, web developers, and anyone else with text editing needs to work more smoothly and productively. A better free alternative How do I get BBEdit? Download BBEdit here. How do I get TextWrangler? Introducing Autotune | Vox Product Blog. Written by Kavya Sukumar and Ryan Mark, July 8, 2015 Today we're announcing a new project we've been working on at Vox Media: Autotune. We built this application to address the problem of reusability in our work. This project is open source and available to everyone. As any news hacker knows, one of the most challenging requests we get is for "more of those things.

" We'll make a neat chart, visualization or map, which sees some success: our readers or reporters like it or maybe it helps tell a better story. You better believe other folks will come around asking for "one of those charts like on that one story. " One of the most difficult messages to communicate to our non-developer colleagues is how tricky "reusability" is. It may sound as if this is a problem, a lack of foresight or a rookie mistake, but it is not. The goal of Autotune is to shorten the gap between building a one-off website or interactive graphic and building a reusable tool for generating many things. What is Autotune? Home - Journalist's Resource Journalist's Resource: Research for Reporting, from Harvard Shorenstein Center. The best rapper alive, as decided by computers. Finally, data science has begun tackling rap. It makes sense, because rap is a pretty good subject for algorithms to latch onto: lyrics are a dense data set, analysts have a lot of words to work with, and songs are heavy on allusions and references that make for fascinating connections.

In 2014, Matt Daniels ranked rappers by the breadth of their vocabularies (Aesop Rock and GZA took first and second place, respectively). Now, Eric Malmi, a Finnish doctoral candidate, has looked at something more integral to rap: rhymes. Assonance is a great way to judge rhyming in rap Malmi analyzed popular rap lyrics using something called assonant rhymes. Without getting too deep into the phonetic weeds, an assonant rhyme is one where the vowels rhyme, but the consonants may or may not.

It's also a better measure than looking at just vocabulary or rhymes at the end of a line. My name is Joe, I walk down the street,And now I'm going to look at my feet. Who is the best rapper? Rappers, ranked and graphed. Algorithm That Counts Rap Rhymes and Scouts Mad Lines | Mining for Meaning. “Men lie, women lie, numbers don’t” – Jay Z Among the many things rappers like to boast about, some are relatively easy to quantify, like money, whereas rhyming skills are something that have been very difficult to measure – up till now.

In this post, I’ll present Raplyzer, a computer program which automatically detects rhymes from rap lyrics and which is used to rank popular rappers based on their average Rhyme factor. I’ll also present another program called BattleBot, which is a search engine for rhyming rap lines based on the algorithm used in Raplyzer. Rap Rhyming 101 In rap lyrics, assonance, where words don’t have necessarily the same ending but they share a vowel sound, is the most typical form of rhyming nowadays [1]. “This is a job – I get paid to sling some raps,What you made last year was less than my income tax” [2] As one author puts it: “Multis are hallmarks of all the dopest flows, and all the best rappers use them” [2]. Automatic Rhyme Detection Final Words Acknowledgements: Automated Insights - High Quality Automated Content Services. New Articles on

Emergent. Fact-checking U.S. politics. Semantria: Text Analytics and Sentiment Analysis for Everyone. Wolfram|Alpha: Computational Knowledge Engine. ClikView. Publish your data online. Recueillir des données sur le Web. Recueillir des données sur le Web Vous avez tout essayé, et vous n’êtes toujours pas parvenu à mettre la main sur les données que vous voulez. Vous avez trouvé les données sur le web, mais hélas – aucune option de téléchargement n’est disponible et le copier-coller montre ses limites. N’ayez crainte, il y a toujours un moyen d’extraire les données. Vous pouvez par exemple tenter les actions suivantes. Obtenir les données par l’intermédiaire d’une API web, telles que les interfaces fournies par les bases de données en ligne et de nombreuses applications web modernes (comme Twitter, Facebook et bien d’autres).

En plus de ces excellentes options techniques, n’oublions pas les options simples : parfois, cela vaut la peine de passer un peu de temps à chercher un fichier contenant des données déjà exploitables ou d’appeler l’institution qui détient les données que vous voulez. Que sont des données lisibles par machine ? Le webscraping : pour quoi faire ? Data + Design. DocumentCloud. Visualization and Data Mining Software. OpenRefine (ex-Google Refine) How to Scrape Google Search Results with Google Sheets. This tutorial explains how you can easily scrape Google Search results and save the listings in a Google Spreadsheet. It can be useful for monitoring the organic search rankings of your website in Google for particular search keywords vis-a-vis other competing websites. Or you can exporting search results in a spreadsheet for deeper analysis.

There are powerful command-line tools, curl and wget for example, that you can use to download Google search result pages. The HTML pages can then be parsed using Python’s Beautiful Soup library or the Simple HTML DOM parser of PHP but these methods are too technical and involve coding. If you ever need to extract results data from Google search, there’s a free tool from Google itself that is perfect for the job. The idea is simple. Features Free Premium Maxiumum number of Google search results fetched per query Details fetched from Google Search Results Web page title, URL and website favicon Perform time limited searches No Yes PDF Manual None Included Email. Kimono : Turn websites into structured APIs from your browser in seconds. This Simple Data-Scraping Tool Could Change How Apps Are Made | Wired Design. The number of web pages on the internet is somewhere north of two billion, perhaps as many as double that.

It’s a huge amount of raw information. By comparison, there are only roughly 10,000 web APIs–the virtual pipelines that let developers access, process, and repackage that data. In other words, to do anything new with the vast majority of the stuff on the web, you need to scrape it yourself. Even for the people who know how to do that, it’s tedious. Ryan Rowe and Pratap Ranade want to change that. For the last five months, Rowe and Ranade have been building out Kimono, a web app that lets you slurp data from any website and turn it instantly into an API. Excitement’s already bubbling around the potential. Eliminating the Bottleneck The idea for Kimono was born out of Rowe’s time as a developer at the design consultancy Frog, where he continually ran into the same frustrating problem. It’s about letting artists, historians, sociologists cull and combine content.

Go Back to Top. Announcing Portia, the open source visual web scraper! | Scrapinghub Blog. DataMiner. Temboo. Magic | Web Data Platform & Free Web Scraping Tool. @joelmatriche » Le blog de jo. Lorsque des données sont correctement formatées sous forme de tableau dans une page web, il est facile de les importer directement dans une feuille Excel. Et d'automatiser les mises à jour. Exemples : l'importation dynamique de cotations boursières dans une feuille de calcul et la surveillance, depuis Excel, des changements éventuellement apportés à une page web. Exemple 1 : la surveillance d'une page web. Imaginons que journaliste, je veuille consulter dès leur parution les comptes-rendus des commissions qui ont lieu à la Chambre belge des Représentants.

La première étape est bien sûr d'ouvrir cette page et d'en copier l'adresse. Une boîte de dialogue apparaît, je colle l'adresse de la page web à scrapper dans la barre d'adresse de ce mini-navigateur et je clique sur "Ok". Je fais défiler la page jusqu'à rencontrer le tableau que je veux importer, il est signalé par une petite flèche noire sur fond jaune - ce qui signifie que l'importation est techniquement possible. Post Tags: Publier ses datas en responsive. ScraperWiki.