background preloader

Data Wrangler

Data Wrangler
UPDATE: The Stanford/Berkeley Wrangler research project is complete, and the software is no longer actively supported. Instead, we have started a commercial venture, Trifacta. For the most recent version of the tool, see the free Trifacta Wrangler. Why wrangle? Too much time is spent manipulating data just to get analysis and visualization tools to read it. Wrangler is designed to accelerate this process: spend less time fighting with your data and more time learning from it.

Related:  bigdataWizualizacja danychData Storytelling tools2 importfing

CS109 Data Science Learning from data in order to gain useful predictions and insights. This course introduces methods for five key facets of an investigation: data wrangling, cleaning, and sampling to get a suitable data set; data management to be able to access big data quickly and reliably; exploratory data analysis to generate hypotheses and intuition; prediction based on statistical methods such as regression and classification; and communication of results through visualization, stories, and interpretable summaries. We will be using Python for all programming assignments and projects. All lectures will be posted here and should be available 24 hours after meeting time. The course is also listed as AC209, STAT121, and E-109. 51 Tools Chart selection 1. ChartChooser from Juice Analytics2. Charts suggestions from Andrew Abela3. Graph Selection Matrix from Perceptual Edge4.

Protovis Protovis composes custom views of data with simple marks such as bars and dots. Unlike low-level graphics libraries that quickly become tedious for visualization, Protovis defines marks through dynamic properties that encode data, allowing inheritance, scales and layouts to simplify construction. Protovis is free and open-source, provided under the BSD License. It uses JavaScript and SVG for web-native visualizations; no plugin required (though you will need a modern web browser)! Although programming experience is helpful, Protovis is mostly declarative and designed to be learned by example.

ChickBot - the fully functional low-cost robot by Nick McCloud Kickstarter Collections Projects We Love Saved Trending Product Overview - Big Data Analytics - Datameer Integrate, prepare, analyze and visualize any data Datameer simplifies the big data analytics environment into a single application on top of the powerful Hadoop platform. The only end-to-end big data analytics application for Hadoop designed to make big data simple for everyone, Datameer combines self-service data integration, analytics and visualization functionality that provides the fastest time to insights. Data integration Liberate your data Data is the raw materials of insight and the more data you have, the deeper and broader the possible insights. Not just traditional, transaction data but all types of data so that you can get a complete view of your customers, better understand business processes and improve business performance. Learn about data integration Self-service data analytics Insights without boundaries Datameer provides complete analytics from simple joins and transforms to complex predictive analytics.

Introduction to open-source GIS tools for journalists Originally published by Matt Wynn on on 19 September 2011. This excerpt is republished with permission. For years, it’s been only the most committed of journo-nerds who could invest both the time and money to put geography to use. To say that’s changing is an understatement. VisIt About VisIt VisIt is an Open Source, interactive, scalable, visualization, animation and analysis tool. From Unix, Windows or Mac workstations, users can interactively visualize and analyze data ranging in scale from small (<101 core) desktop-sized projects to large (>105 core) leadership-class computing facility simulation campaigns.

(Unofficial) API Guide for mDrawBot mScara - All This is a guide to the drawing API used by the MakeBlock mDrawBot kit mScara robot. It’s a reverse engineering of their scara.ino and mDraw v1.1 python from Github (Aug.25,2015 commit) available at The Makeblock mDrawBot kit is available in many Barnes & Nobles stores now and at the same price as mail order (even cheaper if you are a B&N member!) This document's purpose is to provide a guide for creating alternative programs to mDraw, the program that it normally used with mScara, such as linux command line tools or interconnection to other CNC type plotter programs.

Related:  Data ToolsAPI / Toolkit / Toolstools for data visualization and analysisData Visualisation