background preloader

First steps in data visualisation using d3.js, by Mike Dewar

First steps in data visualisation using d3.js, by Mike Dewar
This happens to be one of those rare instances where the benefit of hindsight does not make me regret something said flippantly on a panel. I deeply believe that in order to truly change the world we cannot simply "throw analytics at the problem." To that end, the medical and health industries are perhaps the most primed to be disrupted by data and analytics. To be successful, however, a deep respect for both the methodological and clinical contexts of the data are required. It is incredibly exciting to be at an organization that is both working within the current framework of health care and data to create new insight for people, but also pushing the envelope with respect to individuals' relationships with their own health. The challenges are technical, sociological, and political; but the potential for innovation that exists in this space comes along very rarely. I feel lucky to have an opportunity to move into the health data space now. Sensor data Strength of team

http://drewconway.com/zia/

Related:  d3.jsData analysis

D3 for Mere Mortals By Luke Francl (look@recursion.org), August 2011 d3.js is a data visualization library by Mike Bostock, who is also the primary creator of Protovis, which D3 is designed to replace. D3 has a steep learning curve, especially if (like me) you are not used to the pixel-precision of graphics programming. To build a visualization with D3, you need to understand JavaScript objects, functions, and the method-chaining paradigm of jQuery; the basics of SVG and CSS; D3's API; and the principles for designing effective infographics.

The key word in “Data Science” is not Data, it is Science One of my colleagues was just at a conference where they saw a presentation about using data to solve a problem where data had previously not been abundant. The speaker claimed the data were "big data" and a question from the audience was: "Well, that isn't really big data is it, it is only X Gigabytes". While that exact question would elicit groans from most people who work with data, I think it highlights one of the key problems with the thinking around data science. Most people hyping data science have focused on the first word: data. They care about volume and velocity and whatever other buzzwords describe data that is too big for you to analyze in Excel. This hype about the size (relative or absolute) of the data being collected fed into the second category of hype - hype about tools.

Facing Afghan mistrust, al-Qaeda fighters take limited role in insurgency On Aug. 14, a U.S. airstrike in the northern Afghan province of Kunduz killed a Taliban commander known as Abu Baqir. In a country where insurgents are killed daily, this attack was notable for one unusual detail: Abu Baqir, the military said afterward, was also a member of al-Qaeda. Although U.S. officials have often said that al-Qaeda is a marginal player on the Afghan battlefield, an analysis of 76,000 classified U.S. military reports posted by the Web site WikiLeaks underscores the extent to which Osama bin Laden and his network have become an afterthought in the war.

How to be a data journalist Data journalism is huge. I don't mean 'huge' as in fashionable - although it has become that in recent months - but 'huge' as in 'incomprehensibly enormous'. It represents the convergence of a number of fields which are significant in their own right - from investigative research and statistics to design and programming. The idea of combining those skills to tell important stories is powerful - but also intimidating. Who can do all that? The reality is that almost no one is doing all of that, but there are enough different parts of the puzzle for people to easily get involved in, and go from there. Wave - Welcome to Apache Wave (incubating) Wave is a distributed, near-real-time, rich collaboration platform that allows users to work together in new and exciting ways. Wave allows for flexible modes of communication, blending chat, email and collaborative document editing in to one seamless environment. Wave provides a lively and responsive environment that promotes more fluid and dynamic collaboration between users.

D3.js is Not a Graphing Library, Let's Design a Line Graph Working with graphing libraries can be tedious. Designing them can be downright frustrating. Each one of them slightly different, but most of them share two common flaws: a design-by-configuration and template design approach. A bar graph can be just a few bars with labels and tick marks...until it isn't. Want to change the background color? New option.

Statistics and the Science Club One of my favorite movies is Woody Allen’s Annie Hall. If you’re my age and you haven’t seen it, I usually tell people it’s like When Harry Met Sally, except really good. The movie opens with Woody Allen’s character Alvy Singer explaining that he would “never want to belong to any club that would have someone like me for a member”, a quotation he attributes to Groucho Marx (or Freud). How this was released Julian Assange on the Afghanistan war logs: 'They show the true nature of this war' Link to video: Julian Assange on the Afghanistan war logs: 'They show the true nature of this war' US authorities have known for weeks that they have suffered a haemorrhage of secret information on a scale which makes even the leaking of the Pentagon Papers during the Vietnam war look limited by comparison. The Afghan war logs, from which the Guardian reports today, consist of 92,201 internal records of actions by the US military in Afghanistan between January 2004 and December 2009 – threat reports from intelligence agencies, plans and accounts of coalition operations, descriptions of enemy attacks and roadside bombs, records of meetings with local politicians, most of them classified secret. The Guardian's source for these is Wikileaks, the website which specialises in publishing untraceable material from whistleblowers, which is simultaneously publishing raw material from the logs.

rgraphics R Graphicsby Paul Murrell The SECOND EDITION of this book is now available, with its own web page. A book on the core graphics facilities of the R language and environment for statistical computing and graphics (Chapman & Hall/CRC, August 2005). A link to the publisher's web page for the book. A list of Errata. The Habit Change Cheatsheet: 29 Ways to Successfully Ingrain a Behavior We are what we repeatedly do. Excellence, then, is not an act, but a habit. – Aristotle Our daily lives are often a series of habits played out through the day, a trammeled existence fettered by the slow accretion of our previous actions. By Leo Babauta But habits can be changed, as difficult as that may seem sometimes.

JUNG in Neo4j – Part 2 A few weeks ago I showed you how to visualize a graph using the chord flare visualization and how to visualize a network using a force directed graph visualization from D3.js. On Twitter Claire Willett from Riparian Data asked: Visualizing a 200-node network w/ #Cypher + #D3js –cool method, but maybe a node quilt would work better? ow.ly/92wDE @maxdemarzi— (@RiparianData) February 13, 2012 Cooperation between Referees and Authors Increases Peer Review Accuracy Peer review is fundamentally a cooperative process between scientists in a community who agree to review each other's work in an unbiased fashion. Peer review is the foundation for decisions concerning publication in journals, awarding of grants, and academic promotion. Here we perform a laboratory study of open and closed peer review based on an online game. We show that when reviewer behavior was made public under open review, reviewers were rewarded for refereeing and formed significantly more cooperative interactions (13% increase in cooperation, P = 0.018). We also show that referees and authors who participated in cooperative interactions had an 11% higher reviewing accuracy rate (P = 0.016).

Related:  Afghan War Diaryr-projectNetworks Theory