Bayesian statistics: a comprehensive course This playlist provides a complete introduction to the field of Bayesian statistics. It assumes very little prior knowledge and, in particular, aims to provide explanations of concepts with as little maths as possible. The course covers the following topics: probability distributions, marginal and conditional probability, the Bayesian formula, the difference between Bayesian and Frequentist statistics, Likelihood, how to specify a prior, the probability of data given model choice, an introduction to the probability distributions commonly used in Bayesian data analysis, conjugate priors, credible intervals, highest density posterior intervals, Objective Bayesian data analysis, Jeffrey's prior, Reference priors, Zellners's G-priors, forecasting in Bayesian systems, Markov Chain Monte Carlo, grid approximations, Metropolis-Hastings sampling, Gibbs sampling, hypothesis testing: classical test analogues and pure Bayesian methods, hierarchical models, hyperpriors, linear regression.

LeaRning Path on R - Step by Step Guide to Learn Data Science on R One of the common problems people face in learning R is lack of a structured path. They don’t know, from where to start, how to proceed, which track to choose? Though, there is an overload of good free resources available on the Internet, this could be overwhelming as well as confusing at the same time. To create this R learning path, Analytics Vidhya and DataCamp sat together and selected a comprehensive set of resources to help you learn R from scratch. This learning path is a great introduction for anyone new to data science or R, and if you are a more experienced R user you will be updated on some of the latest advancements. Statistics Using Technology I hope you find this book useful in teaching statistics. When writing this book, I tried to follow the GAISE Standards (2014, January 05), which are: Emphasis statistical literacy and develop statistical understanding.Use real data.Stress conceptual understanding, rather than mere knowledge of procedure.Foster active learning in the classroom.Use technology for developing concepts and analyzing data. To this end, I ask students to interpret the results of their calculations.

HyperStat Online: An Introductory Statistics Textbook and Discussion of whether most published research is false Recommend HyperStat to your friends on Facebook Click here for more cartoons by Ben Shabad. Other Sources Stat Primer by Bud Gerstman of San Jose State University Statistical forecasting notes by Robert Nau of Duke University related: RegressIt Excel add-in by Robert Nau CADDIS Volume 4: Data Analysis (EPA) The little handbook of statistical practice by Gerard E.

Collaborative Statistics Have you heard others say, "You're taking statistics? That's the hardest course I ever took!" They say that, because they probably spent the entire course confused and struggling. They were probably lectured to and never had the chance to experience the subject. Weka 3 - Data Mining with Open Source Machine Learning Software in Java Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes. Found only on the islands of New Zealand, the Weka is a flightless bird with an inquisitive nature. The name is pronounced like this, and the bird sounds like this.

A geometric interpretation of the covariance matrix Introduction In this article, we provide an intuitive, geometric interpretation of the covariance matrix, by exploring the relation between linear transformations and the resulting data covariance. Most textbooks explain the shape of data based on the concept of covariance matrices. Instead, we take a backwards approach and explain the concept of covariance matrices based on the shape of data. In a previous article, we discussed the concept of variance, and provided a derivation and proof of the well known formula to estimate the sample variance. Figure 1 was used in this article to show that the standard deviation, as the square root of the variance, provides a measure of how much the data is spread across the feature space.

The Season for Sharing Data: Working with the newly released Census 2010-2014 ACS 5 year data in R On December 3, 2015 the U.S. Census Bureau released the 2010-2014 5 year ACS (American Community Survey) data. You can read all about it on the Census website. This fantastic five-year statistical database provides aggregate social and economic characteristics about American individuals and families down to the block group level. A number of online tools provide access to the ACS 2010-2014 data using graphical user interfaces (GUIs). These include the Census American FactFinder tool or via Social Explorer.

