background preloader

Data Sets

Facebook Twitter

EDM WhitePaper 17062015. DBpedia. List of datasets for machine learning research - Wikipedia. These datasets are used for machine learning research and have been cited in peer-reviewed academic journals and other publications.

List of datasets for machine learning research - Wikipedia

Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such as deep learning), computer hardware, and, less-intuitively, the availability of high-quality training datasets.[1] High-quality labeled training datasets for supervised and semi-supervised machine learning algorithms are usually difficult and expensive to produce because of the large amount of time needed to label the data.

Although they do not need to be labeled, high-quality datasets for unsupervised learning can also be difficult and costly to produce.[2][3][4][5] This list aggregates high-quality datasets that have been shown to be of value to the machine learning research community from multiple different data repositories to provide greater coverage of the topic than is otherwise available.

Reviews[edit] Information is Beautiful. FlowingData. Our World In Data. 170 Amazing Twitter Statistics and Facts (July 2016) Discover Relevant Business Information. 1st level connections of LinkedIn users 2015. The Open Database Of The Corporate World.