Data Science Toolkit

Data Science Toolkit Steal this server! Grab this entire site as a free, self-contained, ready-to-run VM Independence - Never worry about the provider going offline, or charging once you're hooked. Security - Run on your intranet, so customer information stays within the firewall. Scalability - No API limits. Run a cluster of as many instances as you need.
by Allen B. Downey, published by O'Reilly Media. Order Think Stats from Amazon.com Download this book in PDF. Read this book online. Think Stats: Probability and Statistics for Programmers Think Stats: Probability and Statistics for Programmers
Data Publica

Data Publica

Data Publica lauréat du Concours Avec C-Radar, l'offre Big Data au service des données entreprises Data Publica lance C-Radar Les ventes prédictives appliquées au B2B débarquent en France Découvrir C-Radar GRANDS GROUPES PME ILS TRAVAILLENT AVEC DATA PUBLICA START UP Découvrez data publica en une animation Les tableaux de bord des territoires : Créez votre tableau de bord personnalisé à partir de données internes et d’open data
Indicateurs de développement humain Rapport sur le développement humain 2013, Programme des Nations Unies pour le développement Les données utilisées pour calculer l'Indice de développement humain (IDH) et autres indices composites présentés dans le Rapport sur le développement humain ... Public Data Explorer

Public Data Explorer

Freebase
Prompted in part by a presentation I have to give tomorrow as an OU eLearning community session (I hope some folks turn up – the 90 minute session on Mashing Up the PLE – RSS edition is the only reason I’m going in…), and in part by Scott Leslie’s compelling programme for a similar duration Mashing Up your own PLE session (scene scetting here: Hunting the Wily “PLE”), I started having a tinker with using Google spreadsheets as for data table screenscraping. So here’s a quick summary of (part of) what I found I could do. The Google spreadsheet function =importHTML(“”,”table”,N) will scrape a table from an HTML web page into a Google spreadsheet. The URL of the target web page, and the target table element both need to be in double quotes. The number N identifies the N’th table in the page (counting starts at 0) as the target table for data scraping. Data Scraping Wikipedia with Google Spreadsheets Data Scraping Wikipedia with Google Spreadsheets
Many Eyes

Many Eyes

Try out the newest version of IBM Many Eyes! New site design and layout Find visualization by category and industry New visualization expertise and thought leadership section
Last night OpenHeatMap creator Pete Warden announced that the tool now allowed you to visualise UK data . I’ve been gleefully playing with the heat-mapping tool today and thought I’d share some pointers on visualising data on a map. This is not a tutorial for OpenHeatMap – Pete’s done a great job of that himself (video below) – but rather an outline of the steps to get some map-ready data in the first place. [youtube:http://www.youtube.com/watch?v=yFy80kMcaRo] 1. Playing with heat-mapping UK data on OpenHeatMap Playing with heat-mapping UK data on OpenHeatMap
Scraping for Journalism: A Guide for Collecting Data Photo by Dan Nguyen/ProPublica Our Dollars for Docs news application lets readers search pharmaceutical company payments to doctors. We’ve written a series of how-to guides explaining how we collected the data. Most of the techniques are within the ability of the moderately experienced programmer. The most difficult-to-scrape site was actually a previous Adobe Flash incarnation of Eli Lilly’s disclosure site.

Scraping for Journalism: A Guide for Collecting Data

Definition: Tools that help users discern patterns in data--dynamic graphs, charts, maps, plots, etc. Tools: Chartle: Create simple interactive charts. Data Visualization

Data Visualization

Tutorials

Tutorials How to map connections with great circles There are various ways to visualize connections, but one of the most intuitive and straightforward ways is to actually connect entities or objects with lines. And when it comes to geographic connections, great circles are a nice way to do this.
How to Make Bubble Charts A bubble chart can also just be straight up proportionally sized bubbles, but here we're going to cover how to create the variety that is like a scatterplot with a third, bubbly dimension. The advantage of this chart type is that it lets you compare three variables at once. One is on the x-axis, one is on the y-axis, and the third is represented by area size of bubbles. Have a look at the final chart to see what we're making. Step 0. Download R