background preloader



Related:  DataVizLangages de programmationdata viz tools

Online XML to CSV converter This free online tool converts from XML to CSV (comma separated values) format. It uses code from the open source project XmlToCsv which is available from codeplex. Note that it may take a considerable amount of time to convert a large XML file to CSV format and that the maximum size allowed is set to 4mb. For larger files, please download the free desktop XML to CSV conversion software from . Application Details

How to turn everyone in your newsroom into a graphics editor Today Quartz is open-sourcing the code behind Chartbuilder, the application we use to make most of our charts. Along with the underlying charting library — called Gneisschart — the tool has given everyone in our worldwide newsroom 24-hour access to simple charts at graphics-desk quality. It has helped all of our reporters and editors become more responsible for their own content and less dependent on others with specialized graphics skills. The rise of Chartbuilder The project started as a tool for myself.

Requirements Once you understand the requirements, continue to the installation documentation. UNIX vs Windows If you are a Windows users, read how how Circos differes on UNIX and Windows. Weka 3 - Data Mining with Open Source Machine Learning Software in Java Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes.

Off the Staff - C82: Works of Nicholas Rougeux Seeing music I can't read music but I can parse it. The talent of reading music has always escaped me which is a little ironic considering I grew up in a musical family. However, I've always enjoyed how sheet music looks so I took a shot at visualizing the notes from musical scores and the result is this series of posters. How they were made » Click to enlarge and for ordering options.

Create An Intimate Map Of Your Life, Using Just Your Email Inbox We rarely think twice when sending an email, or adding a few CCs just for good measure. But these small interactions add up, and when deconstructed en masse, will reveal more about you than you might ever expect. Immersion is an interactive network data visualization created at MIT Media Lab’s Macro Connections group by Deepak Jagdish, Daniel Smilkov and Cesar Hidalgo. All you do is give the site access to your Gmail account. Visualizing Linguistic Variation with LATtice The transformation of literary texts into “data” – frequency counts, probability distributions, vectors – can often seem reductive to scholars trained to read closely, with an eye on the subtleties and slipperiness of language. But digital analysis, in its massive scale and its sheer inhuman capacity of repetitive computation, can register complex patterns and nuances that might be beyond even the most perceptive and industrious human reader. To detect and interpret these patterns, to tease them out from the quagmire of numbers without sacrificing the range and the richness of the data that a text analysis tool might accumulate can be a challenging task. A program like DocuScope can easily take an entire corpus of texts and sort every word and phrase into groups of rhetorical features. It produces a set of numbers for each text in the corpus, representing the relative frequency counts for 101 “language action types” or LATs.

Tangle: a JavaScript library for reactive documents Tangle is a JavaScript library for creating reactive documents. Your readers can interactively explore possibilities, play with parameters, and see the document update immediately. Tangle is super-simple and easy to learn. Apache Zeppelin 0.7.0-SNAPSHOT Documentation: Data Ingestion Data Discovery Data Analytics Data Visualization & Collaboration Multiple Language Backend Apache Zeppelin interpreter concept allows any language/data-processing-backend to be plugged into Zeppelin. Currently Apache Zeppelin supports many interpreters such as Apache Spark, Python, JDBC, Markdown and Shell. Adding new language-backend is really simple.

Cohesion (computer science) In computer programming, cohesion is a measure of how strongly related the responsibilities of a single module are. As applied to object-oriented programming, if the methods that serve the given class tend to be similar in many aspects, then the class is said to have high cohesion. In a highly cohesive system, code readability and the likelihood of reuse is increased, while complexity is kept manageable. Cohesion is increased if:

Essential Resources: Tools for collecting and handling data This is part of a series of posts to share with readers a useful collection of some of the most important, effective and practical data visualisation-related resources. This post presents a collection of useful tools, resources and references for gathering, cleaning and preparing your data for analysis and design. Please note, I may not have personally used all the packages or tools presented but have seen sufficient evidence of their value from other sources. Whilst some inclusions may be contentious from a quality/best-practice perspective, they may still provide some good features and provide value to a certain audience out there. Finally, to avoid re-inventing the wheel, descriptive text may have been reproduced from the native websites if they provide the most articulate descriptions. Your feedback is most welcome to help curate this collection, keep it up to date and preserve its claim to be an essential list of resources!

Interlinear » SIL FieldWorks More demo movies. Screenshots The interlinear tool has multiple different views of your texts. The Baseline tab allows you to enter and edit a text.

Related:  Creative Coding