background preloader

Big Data, Data Mining, Predictive Analytics, Statistics, StatSoft Electronic Textbook

Big Data, Data Mining, Predictive Analytics, Statistics, StatSoft Electronic Textbook
This free ebook has been provided as a public service since 1995. Statistics: Methods and Applications textbook offers training in the understanding and application of statistics and data mining. It covers a wide variety of applications, including laboratory research (biomedical, agricultural, etc.), business statistics, credit scoring, forecasting, social science statistics and survey research, data mining, engineering and quality control applications, and many others. The Textbook begins with an overview of the relevant elementary (pivotal) concepts and continues with a more in depth exploration of specific areas of statistics, organized by "modules", representing classes of analytic techniques. You have filtered out all documents. Related:  StatsEstadística

Free Statistics Book How to use R R is a powerful, free and open source, cross-platform, statistical and graphing software package;programming language;software environment for statistical computing. Downloading R[edit] Visit the R Project home page. Tutorials[edit] Books that are Helpful When Learning R[edit] See also[edit] External links[edit] Books[edit] Census at School - United States Census at School is an international classroom project that engages students in grades 4–12 in statistical problemsolving. Students complete a brief online survey, analyze their class census results, and compare their class with random samples of students in the United States and other countries. More Census at School New Zealand now hosts the random sampler for the international Census at School data, the New Zealand data, and also for the cleaned USA data. Their online random sampler allows students and teachers to take random samples up to size 1000 from the international, New Zealand, or U.S. database and either download the data or start up the free, online iNZight Lite software with the data already loaded and ready for analysis. The international database includes data from Australia, Canada, New Zealand, the United Kingdom, and the United States. The American Statistical Association and Population Association of America are seeking champions to expand U.S.

HyperStat Online: An Introductory Statistics Textbook and Online Tutorial for Help in Statistics Courses Recommend HyperStat to your friends on Facebook Click here for more cartoons by Ben Shabad. Other Sources NIST/SEMATECH e-Handbook of Statistical Methods Stat Primer by Bud Gerstman of San Jose State University Statistical forecasting notes by Robert Nau of Duke University related: RegressIt Excel add-in by Robert Nau CADDIS Volume 4: Data Analysis (EPA) The little handbook of statistical practice by Gerard E. Stat Trek Tutorial Statistics at square 1 by T. Concepts and applications of inferential statistics by Richard Lowry of Vassar College CAST by W. SticiGui by P. SurfStat by Keith Dear of the University of Newcastle. Introductory statistics: Concepts, models, and applications by David W. Multivariate statistics: Concepts, models, and applications by David W. Electronic textbook by StatSoft A new view of statistics by Will Hopkins of the University of Otago The knowledge base: An online research methods textbook by William M.

What are Basic Statistics Descriptive Statistics "True" Mean and Confidence Interval. Probably the most often used descriptive statistic is the mean. The mean is a particularly informative measure of the "central tendency" of the variable if it is reported along with its confidence intervals. Shape of the Distribution, Normality. More precise information can be obtained by performing one of the tests of normality to determine the probability that the sample came from a normally distributed population of observations (e.g., the so-called Kolmogorov-Smirnov test, or the Shapiro-Wilks' W test. The graph allows you to evaluate the normality of the empirical distribution because it also shows the normal curve superimposed over the histogram. Correlations Purpose (What is Correlation?) The most widely-used type of correlation coefficient is Pearson r, also called linear or product- moment correlation. Simple Linear Correlation (Pearson r). How to Interpret the Values of Correlations. Significance of Correlations.

Mathematicians mapped out every “Game of Thrones” relationship to find the main character — Quartz Fans of the Game of Thrones books and TV series have long quarreled over who the true hero of the story is. Daenerys? Tyrion? But several main characters remain. Andrew J. The books and HBO fantasy series, with their massive cast of characters, various shifting allegiances, and intricate relationship dynamics, were a perfect fit to be studied mathematically. “This is a fanciful application of network science,” Beveridge told Quartz. The pair started by connecting characters every time they “interacted” in the third book of the series, A Storm of Swords. The resulting network structure (above) broke the characters into extremely accurate communities that show the geographical, familial, and even adversarial ties between them. “We didn’t tell it what the communities were, the network actually tells you what the communities are,” Beveridge said. Then the mathematicians ranked the characters by several different measures.

R by example Basics Reading files Graphs Probability and statistics Regression Time-series analysis All these examples in one tarfile. Outright non-working code is unlikely, though occasionally my fingers fumble or code-rot occurs. Other useful materials Suggestions for learning R The R project is at : In particular, see the `other docs' there. Over and above the strong set of functions that you get in `off the shelf' R, there is a concept like CPAN (of the perl world) or CTAN (of the tex world), where there is a large, well-organised collection of 3rd party software, written by people all over the world. The dynamism of R and of the surrounding 3rd party packages has thrown up the need for a newsletter, R News. library(help=boot) library(boot) ? But you will learn a lot more by reading the article Resampling Methods in R: The boot package by Angelo J. Ajay Shah, 2005

This is Statistics | Statistics Jobs Around the World All your Bayes are belong to us! This week's post contains solutions to My Favorite Bayes's Theorem Problems, and one new problem. If you missed last week's post, go back and read the problems before you read the solutions! If you don't understand the title of this post, brush up on your memes. 1) The first one is a warm-up problem. Suppose there are two full bowls of cookies. First the hypotheses: A: the cookie came from Bowl #1 B: the cookie came from Bowl #2 And the priors: P(A) = P(B) = 1/2 The evidence: E: the cookie is plain And the likelihoods: P(E|A) = prob of a plain cookie from Bowl #1 = 3/4 P(E|B) = prob of a plain cookie from Bowl #2 = 1/2 Plug in Bayes's theorem and get P(A|E) = 3/5 You might notice that when the priors are equal they drop out of the BT equation, so you can often skip a step. 2) This one is also an urn problem, but a little trickier. The blue M&M was introduced in 1995. A friend of mine has two bags of M&Ms, and he tells me that one is from 1994 and one from 1996. Again, P(A) = P(B) = 1/2.

Course Catalog "Web forums are excellent." S. Clark, GlaxoSmithKline "Our company is coming to the end of our first year working with statistics.com as its primary provider of statistical training and I wanted to thank you for the excellent classes and overall service that you have provided. The classes have been consistently on target and provided a foundation that have allowed my team to take their new skills and immediately implement them in research efforts. My research team is significantly more skilled and efficient than they were at the beginning of the year. J. "Considering all of the material that needed to be covered, I thought the course was well written and thought provoking." P.

Related: