background preloader

Data Science

Facebook Twitter

50 New Data Science Resources. Top 50 Data Science Resources: The Best Blogs, Forums, Videos and Tutorials to Learn All about Data Science.

50 New Data Science Resources

Published by Ngdata.com. This is a short version (summary). The field of data science is constantly evolving and ever-advancing, with new technologies placing more valuable insights in the hands of modern enterprises. How to choose a statistical model. 19 Excellent Free & Open Data Sources for #DataScienceData-Mania by Lillian Pierson. Painless-conjugate-gradient.pdf.

Metronic - Responsive Admin Dashboard Template. R-intro.pdf. NSPpart.pdf. New Windows into Law Using Big Data Analytics March 19, 2015. Quant Education. Book. Customised Social Intelligence Engine. Welcome · Advanced R. Robinson-icebreaker.pdf. R news and tutorials contributed by (573) R bloggers. The R language, for programmers. IntroductionAssignment and underscoreVariable name gotchasVectorsSequencesTypesBoolean operatorsListsMatricesMissing values and NaNsCommentsFunctionsScopeMisc.Other resources Introduction I have written software professionally in perhaps a dozen programming languages, and the hardest language for me to learn has been R.

The R language, for programmers

The language is actually fairly simple, but it is unconventional. These notes are intended to make the language easier to learn for someone used to more commonly used languages such as C++, Java, Perl, etc. R is more than a programming language. This document is a work in progress. Assignment and underscore The assignment operator in R is <- as in e <- m*c^2. It is also possible, though uncommon, to reverse the arrow and put the receiving variable on the right, as in m*c^2 -> e. R course notes. Statistics, R, Graphics and Fun. R by example. Basics Reading files Graphs.

R by example

Statistics with R. Warning Here are the notes I took while discovering and using the statistical environment R.

Statistics with R

However, I do not claim any competence in the domains I tackle: I hope you will find those notes useful, but keep you eyes open -- errors and bad advice are still lurking in those pages... Should you want it, I have prepared a quick-and-dirty PDF version of this document. The old, French version is still available, in HTML or as a single file. You may also want all the code in this document. 1. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 2.5 License. Quick-R: Home Page. Resources to help you learn and use R. R Tutorial — R Tutorial. 321a Boyd Graduate Studies University of Georgia Athens, Georgia 30602.

R Tutorial — R Tutorial

Big Data for Small Businesses. Rvest: easy web scraping with R. Rvest is new package that makes it easy to scrape (or harvest) data from html web pages, inspired by libraries like beautiful soup.

rvest: easy web scraping with R

It is designed to work with magrittr so that you can express complex operations as elegant pipelines composed of simple, easily understood pieces. Install it with: install.packages("rvest") Random variables and probability distributions. Probability and statistics. Learning To Be A Data Scientist. DataTau. The Analytics Edge. Rvest: easy web scraping with R. Web-Scraping: the Basics. Quantifying Memory: Web Scraping: working with APIs. APIs present researchers with a diverse set of data sources through a standardised access mechanism: send a pasted together HTTP request, receive JSON or XML in return.

Quantifying Memory: Web Scraping: working with APIs

Today we tap into a range of APIs to get comfortable sending queries and processing responses. These are the slides from the final class in Web Scraping through R: Web scraping for the humanities and social sciences This week we explore how to use APIs in R, focusing on the Google Maps API. We then attempt to transfer this approach to query the Yandex Maps API. Finally, the practice section includes examples of working with the YouTube V2 API, a few 'social' APIs such as LinkedIn and Twitter, as well as APIs less off the beaten track (Cricket scores, anyone?). I enjoyed teaching this course and hope to repeat and improve on it next year. Download the .Rpres file to use in Rstudio here. Going deeper with dplyr: New features in 0.3 and 0.4 (video tutorial) Introduction to R. 2x1. Download Companion Content for Microsoft Official Courseware. Notebook. E-Learning for Deloitte. Customer Lifetime Value.

Beginner's guide to R: Get your data into R. Beginner's guide to R: Get your data into R. Learn how to crunch big data with R. A few years ago I was the CTO and co-founder of a startup in the medical practice management software space.

Learn how to crunch big data with R

One of the problems we were trying to solve was how medical office visit schedules can optimize everyone’s time. Too often, office visits are scheduled to optimize the physician’s time, and patients have to wait way too long in overcrowded waiting rooms in the company of people coughing contagious diseases out their lungs. One of my co-founders, a hospital medical director, had a multivariate linear model that could predict the required length for an office visit based on the reason for the visit, whether the patient needs a translator, the average historical visit lengths of both doctor and patient, and other possibly relevant factors. One of the subsystems I needed to build was a monthly regression task to update all of the coefficients in the model based on historical data. 24 beautifully-designed web dashboards that data geeks will love. We live in a world of big (and little) data, and many people have to make sense of numbers as part of their job.

24 beautifully-designed web dashboards that data geeks will love

The trouble is that there can be a lot of friction involved when mining the data. This is where dashboards come into play: a well-designed dashboard can save huge amounts of time, helping people to quickly identify the numbers that matter, in order to make insightful observations or to compile reports. Dashboard design is a tricky business. The challenge is to communicate the key numbers in a straightforward way, while allowing users to drill down into the specifics.

It is about avoiding clutter, about catering for personalisation, and about the prioritisation of the right metrics. Data dashboards for businesses. E-Learning for Deloitte. Active Data Mining, Data Science blogs. Here are 85 or so active (recently updated) data mining, data science, and machine learning blogs.

Active Data Mining, Data Science blogs

By Qian Wang. Here are some of the most interesting and regularly-updated blogs on Data Mining, Data Science, and related topics, given in alphabetical order. If we missed some, please add in comments below. Using Excel to Scrape Twitter. April 23rd, 2012 by Tim Resnik We work with several publishing clients that manage a multitude of Twitter accounts.

We frequently run across the problem of spending too much time compiling a large cross section of key Twitter stats for multiple accounts. For example, we have a client that has over 20 Twitter accounts for unique brands unique brands. Analytics, Data Mining, and Data Science.