background preloader

Data mining

Facebook Twitter

Data Mining Map. 推荐系统的循序进阶读物(从入门到精通) - 张子柯的博文. Publications by Googlers. Data mining without prejudice. The information age is also the age of information overload.

Data mining without prejudice

Companies, governments, researchers and private citizens are accumulating digital data at an unprecedented rate, and amid all those quintillions of bytes could be the answers to questions of vital human interest: What environmental conditions contribute most to disease outbreaks? What sociopolitical factors contribute most to educational success? What player statistics best predict a baseball team’s win-loss record?

There are a host of mathematical tools for finding possible relationships among data, but most of them require some prior knowledge about what those relationships might be. The problem becomes much harder if you start with a blank slate, and harder still if the datasets are large. MINE: Maximal Information-based Nonparametric Exploration. 使用 Ruby 和 Twitter 进行数据挖掘. 2008 年 10 月,与其他许多人一样,出于好奇,我创建了一个 Twitter 帐户。

使用 Ruby 和 Twitter 进行数据挖掘

与大多数人一样,我与朋友建立连接,随意进行一些搜索,以便更好地理解这项服务。 使用 140 个字符进行通信似乎并不是使 Twitter 广受欢迎一条创意。 一个不相关的事件帮助我理解了 Twitter 的真实价值。 2009 年 7 月初,我的 Web 托管提供者突然无法使用。