background preloader

Glossary of Statistical Terms

Glossary of Statistical Terms
Related:  DataVis Tutorials - PrinciplesData

How to create a visualization Over the last few years I’ve created a few popular visualizations, a lot of duds, and I’ve learned a few lessons along the way. For my latest analysis of where Facebook users go on vacation, I decided to document the steps I follow to build my visualizations . It’s a very rough guide, these are just stages I’ve learned to follow by trial and error, but following these guidelines is a good way to start if you’re looking to create your first visualization. Play with your data I was lucky enough to spend a few hours with Andreas Weigend recently, head of the Stanford Social Data lab. He has nine rules of data, and the first is “Start with the problem, not the data.” In my case, we have a Cassandra cluster with information on more than 350 million photos shared on Facebook. Click to enlarge. I was chatting with my colleague Chris Raynor about this, and he asked me if we could tell where all the visitors to those places were coming from. Pick a question Sketch out your presentation Related:

Worldometers Visualization series: Insight from Cleveland and Tufte on plotting numeric data by groups | Solomon Messing After my post on making dotplots with concise code using plyr and ggplot, I got an email from my dad who practices immigration law and runs a website with a variety of immigration resources and tools. He pointed out that the post was written for folks who already know that they want to make dot plots, and who already know about bootstrapped standard errors. That’s not many people. In an attempt to appeal to a broader audience, I’m starting a series in which I’ll outline the key principles I use when developing a visualization. In this post, I’ll articulate these principles, which combine some of Tuft’s aesthetic guidelines with Cleveland’s scientific approach to visualization, which is based on the psychological processes involved in making sense of visualizations, and has been rigorously tested via randomized controlled experiments. Based on these principles, I’ll argue that dotplots and scatterplots are better than other types of plots (especially pie charts) in most situations.

Complete Sun and Moon Data for One Day Use these forms to obtain rise, set, and transit times for the Sun and Moon; civil twilight beginning and end times; and, lunar phase information. First, specify the date and location in one of the two forms below. Then, click the "Get data" button at the end of the form. Use Form A for cities or towns in the U.S. or its territories. Use Form B for all other locations. Both forms are immediately below. Be sure to read the Notes section located after the two forms, especially if you wish to use these data for legal purposes. Form A - U.S. Form B - Locations Worldwide Notes Legal Use of the Calculated Data Please see Astronomical Data Used for Litigation if you are interested in using the data produced by this service for legal purposes. Definitions For more information on the terms used, see the Astronomical Almanac On-line Glossary, Rise, Set, and Twilight Definitions or Phases of the Moon and Percent of the Moon Illuminated in the Astronomical Information Center. Time Formats Time Zones

Report Builder 3.0 – Chart Types, Visualizations, and Properties Report Builder 3.0 – Chart Types, Visualizations, and Properties by Jes Borland on March 8, 2012 in category Database Programming Microsoft SQL Server SSRS. Article views: 5,238 Instapaper This is part four of a series about Report Builder 3.0. Charts and other visualizations can be a very powerful and effective way to make your data tell a story. There are many types of charts and other visualizations, and many settings for them. Types of Charts There are eight categories of charts in Report Builder: Column Line Shape Bar Area Range Scatter Polar Column charts will show the data as a series of vertical bars. Stacked Column Line charts show a series of data as a set of points, connected by a line. Line Shapes included are Pie, Doughnut, Funnel, and Pyramid. 3D Funnel Bar charts show the data as horizontal bars. 3D Stacked Horizontal Cylinder Area charts show your data as a series of points, with the space below filled in. Stacked Area Range (Image courtesy of Microsoft TechNet) Data Bar Sparkline

GIS, CAD, GPS, mobile industry maps,news,jobs,data free software Creating your own data visualisations | Louise Brown At the NCVO Annual Conference on Monday I went to the workshop on visualising data. Now, I love data and I love visualisations so I was pretty much like a pig in the stinky stuff. The workshop was run by Ed and Matt from Reason Digital. Visualisations are essentially a way to show (often complex) information using graphics. Whether you have to write reports full of facts and figures or you need to demonstrate the impact of your work to the public then creating a graphical representation of the information might be an effective way of doing it. I have to say that I don’t agree with visualisations for visualisation’s sake. In general there are two types of data visulations. The second type are visualisations that use graphics to illustrate a figure, such as on this poster about Mashable users: In the workshop it was also suggested that an infographic (such as the first example) is telling a specific story and leading you to a conclusion. Have you tried creating visualisations? Like this:

Poodwaddle POODWADDLE WORLD CLOCKThe World Stats Counter (V 7.0) This minute 250 babies will be born, 100 people will die, 20 violent crimes will be reported, and the US debt will climb $1 million. The World Clock tells more than time. The World Clock is too large for a single page. You can rotate the map by dragging left or right or by clicking the left and right buttons on the menu. The menu provides several filter options. We show two types of stats:1.

A Case Study In How Infographics Can Bend The Truth We’ve made the point time and time again that charts and graphs, though they feel official and true, can lie. Rarely do you get to see that at work, but the good folks at Hyperakt have sent us a prime case study in infographic deception. The subject, of course, is politics--and in particular, the raging debate over whether the rich should be made to pay more taxes. "Using the same data, very different stories can be told depending on different agendas," says Deroy Peraza, one of the founders of Hyperakt. A story from the Wall Street Journal's far-right op-ed page gets us started, with a chart showing how much taxable income is made by Americans ranging from the rich to poor: Looking at that, the conclusion seems glaringly obvious: The rich don’t make so much money! The chart most certainly does not demonstrate the Journal’s point. And look closer: The left side of the chart deals with people who make between $0 and $50K. Whoa whoa whoa! Top image: S.

Reference, Facts, News - Refdesk 7 Key Tips for Big Data Dan Kimball is the Chief Marketing Officer of Kontagent, and an advisory board member for AppNation. Follow Dan on Twitter at @dmkimball05. One of my company's clients, the CEO of a top social and mobile game publisher, recently commented: “When you’re learning how to ski, you learn pretty quickly that whatever direction you’re looking is the direction your skis ultimately will take you down the mountain. The same holds true for data: You really need to make sure the metrics you are looking at from the start are the ones that are most important to your business’ success. That seems like a simple concept, but it’s also incredibly insightful, particularly when it comes to big data. The business world has been abuzz about the ability to process petabytes of data at incredible speeds and ultimately leverage it for increased efficiency, engagment and profitability. So, what are some of the lessons learned that can enable organizations to effectively use big data to improve their businesses?

Numbeo - Crowdsourcing of everydaylife information Numbeo is the world’s largest database of user contributed data about cities and countries worldwide. Numbeo provides current and timely information on world living conditions including cost of living, housing indicators, health care, traffic, crime and pollution. Numbeo is a collection of Web pages containing numerical and other itemizable data about cities and countries, designed to enable anyone to contribute or modify content. Numbeo uses the wisdom of the crowd to obtain the most reliable information possible. Numbeo then provides you with a statistical analysis of the data collected. 4,066,631 prices in 6,991 cities entered by 447,697 contributors