background preloader

Data

Facebook Twitter

[ProPublica] Tools & Data. Import.io | Free Structured Web Data Scraping Tool.

DataViz

Anonymous Scraping with Visual Web Ripper. It’s very common to use proxy servers for web data extraction. If you want to stay undetected when you scrape a website, you have to change your IP address periodically. Otherwise it is very easy to detect unusual activity by observing a large number of requests from a single IP address. Visual Web Ripper has a built-in support of proxy servers called Private Proxy Switch. How to get it? To access this functionality you need to sign up entering your serial key (you can enter your trial key as well) or voucher (if you have one). You can use either Serial Key authentication or IP authentication, but the latter is available on paid plans only. How much is it? There are five plans for this service (you can change your plan when necessary from the control panel): Free – 500 Mb/monthLight – $9 + 5 Gb/month (subscription)Standard – $39 + 25 Gb/month (subscription)Heavy – $79 + 75 Gb/month (subscription)Year – $49 + 20 Gb/year (one time payment) How does it work?

Alternatives. Streamdrill - real-time big data. [Tester conclusions] The R Project for Statistical Computing. [Tester conclusions] RStudio - Home. [Data Cleaning] OpenRefine. Data Wrangler.

UPDATE: The Stanford/Berkeley Wrangler research project is complete, and the software is no longer actively supported. Instead, we have started a commercial venture, Trifacta. For the most recent version of the tool, see the free Trifacta Wrangler. Why wrangle? Too much time is spent manipulating data just to get analysis and visualization tools to read it. Wrangler is designed to accelerate this process: spend less time fighting with your data and more time learning from it. Wrangler allows interactive transformation of messy, real-world data into the data tables analysis tools expect. Export data for use in Excel, R, Tableau, Protovis, ... [Extracteur de texte] Free online OCR. [Extract from PDF] Tabula. [Convertisseur de fichiers] Zamzar - convert document, eBook, image, audio and video - free online file conversion.

[Convertisseur de fichiers] 7 tools to convert between different data formats.