background preloader

Analytics

Facebook Twitter

DataCenter Access - Subscription Center. Salary surveys for data scientists and related job titles.

Accountability Tools

Mining of Massive Datasets. The book has now been published by Cambridge University Press. The publisher is offering a 20% discount to anyone who buys the hardcopy Here. By agreement with the publisher, you can still download it free from this page. Cambridge Press does, however, retain copyright on the work, and we expect that you will obtain their permission and acknowledge our authorship if you republish parts or all of it. We are sorry to have to mention this point, but we have evidence that other items we have published on the Web have been appropriated and republished under other names. --- Jure Leskovec, Anand Rajaraman (@anand_raj), and Jeff Ullman Download Version 2.1 The following is the second edition of the book, which we expect to be published soon. There is a revised Chapter 2 that treats map-reduce programming in a manner closer to how it is used in practice, rather than how it was described in the original paper.

Version 2.1 adds Section 10.5 on finding overlapping communities in social graphs. CodeSkulptor.

Visualization

Unstructured data is worth the effort when you've got the right tools. It’s dawning on companies that data analysis can yield insights and inform business decisions. As data-driven benefits grow, so do our demands about what more data can tell us and what other types we can mine. During her PhD studies, Alyona Medelyan (@zelandiya) developed Maui, an open source tool that performs as well as professional librarians in identifying main topics in documents.

Medelyan now leads the research and development of API-based products at Pingar. Pingar senior software researcher Anna Divoli (@annadivoli) studied sentence extraction for semi-automatic annotation of biological databases. “Big data is important in many diverse areas, such as science, social media, and enterprise,” observes Divoli. How did you get started in big data? Anna Divoli: I began working with big data as it relates to science during my PhD. Alyona Medelyan: Like Anna, I mainly focus on unstructured data and how it can be managed using clever algorithms. What projects are you working on now?

Data-bases

Tehnical Analysis Resources. Analytics Professionals. Marketing Data Generic. Attribution. Social Analytics. Non-marketing Analytics.