Statistics vs Data Science vs BI. As someone who trained as a statistician, I've always struggled with that title. I love the rigor and insight that Statistics brings to data analysis, but let's face it: Statistics — the name — has always had a bit of a branding problem. Telling someone I was a statistician was more likely to conjure up images of me counting runs at a baseball (or cricket) game than pursuing serious science.
And the image of what Statistics ideally is about — collaborative, interactive, applied, fun — was too often subsumed by the stereotype image — isolated, actuarial, ivory tower, report driven. (And hey, even actuaries can be fun sometimes.) That's why I'm a fan of the term "data scientist" — it embodies everything that Statistics always should be, without the baggage and tradition of the term "statistician". On the other hand, I have no qualms about making a competitive comparison between Data Science and Business Intelligence: Kalido: Data Scientist: Your Must-Have Business Investment NOW. 2013 is all about statistics!
The Community for Information Management, Business Intelligence & Analytics. Hans Rosling: Religions and babies. The Birthday Problem. May 16, 2012 Many of you have seen The Birthday Problem: Given a group of n people, what is the probability that someone shares a birthday? Here, we are only concerned with birth day and month (not year). The solution assumes that a person is equally born on any of the 365 days in the year, thus ignoring leap years.
Let P(n) = the probability that someone shares a birthday in a group of n people and let Q(n) = the probability that everyone has unique birthdays. There are 365^n ways for n people to be born on any of the 365 days.Then P(n) = 1 – Q(n) = 1 – (365*364*…*(365-n+1))/365^n. P(n) P(60) = 0.9941 –> in a room with 60 people, you are almost certain to have at least two people that share a birthday! The key assumption is that all birth dates are equally likely. This will, of course, change our answer above. On a side note, the image below suggests that babies are induced on December 27-30 for a tax break. How likely are people to be born on different birth dates? Like this: Like Loading... R news & tutorials from the web. English Fluency Is Greater in Philippines than in India or UK - The Daily Stat - April 23. Which Social Network Should You Use -- and When? [INFOGRAPHIC]
Video Boosts Brand Engagement, Site Visits. Many marketers have moved past a direct-response-centric model for online display advertising, recognizing that despite low clickthrough rates, banner ads also have a branding effect. And research suggests that adding rich media or video to those banner ads can improve both types of response—increasing the likelihood users will click the ads as well as boosting the lingering brand awareness that results from viewing. Ad solution provider MediaMind found that web users in North America who were exposed to a campaign that included rich media display ads were nearly three times as likely as those who saw only standard banners to end up at a marketer’s website—either by clicking on the ad directly or by navigating to the site at a later date.
Those exposed to banners that included online video were about 5.6 times as likely to visit a marketer's site as those exposed to standard banners. The branding effect was smaller, but still evident. Market research & statistics: Internet marketing, advertising & demographics. Finding the beauty in numbers... - experimentalist. Blog year 2010 in review. Exploring MARS: a Tutorial on Multivariate Adaptive Regression Splines. Duncan Williamson: box and whisker diagrams: getting Microsoft Excel to plot them for you boxplot.html.
Box and Whisker Diagrams: The Microsoft Excel 2003 Solution Introduction See also pages two and three in this series for more up to date versions of this page: look at the menu on the left. The updated versions are also explained in fulll in Chapter Seven of my Excel Book: The Excel Project ... Kindle Version Paperback Version This page takes us a stage further down the road of analysing the dispersion or variability of data sets. We used the standard deviation as a measure that allowed us to compare data sets and say that if one data set had a standard deviation of, say, 3 and the other data set had a standard deviation of, say, 1.2, then we could conclude that the first data set was more disperse than the second data set.
The problem with the standard deviation, however, is that many people find it an abstract idea and because of that they find it difficult both to calculate and interpret. Let x1, x2, … xn be a set of n measurements … arranged in increasing (or decreasing) order. Your Turn.