background preloader

Infochimps Data Marketplace + Commons: Download Sell or Share Databases, statistics, datasets for free

Infochimps Data Marketplace + Commons: Download Sell or Share Databases, statistics, datasets for free

Related:  Open Dataworking with dataDataData markets

Cool Infographics - Cool Posters A collection of great infographic posters from around the world. Click the image to be taken to the poster site to view details and order yourself a copy. Purchasing posters through these links will help support Cool Infographics on most of the posters below. Thank you! Slides, Tools and Other Resources From the School of Data Journalism 2013 The School of Data Journalism, Europe's biggest data journalism event, brings together around 20 panelists and instructors from Reuters, New York Times, Spiegel, Guardian, Walter Cronkite School of Journalism, Knight-Mozilla OpenNews and others, in a mix of discussions and hands-on sessions focusing on everything from cross-border data-driven investigative journalism, to emergency reporting and using spreadsheets, social media data, data visualisation and mapping for journalism. In this post we will be listing links shared during this training event. The list will be updated as the sessions progress. If you have links shared during the sessions that we missed, post them in the comments section and we will update the list. Video recordings Slides, tutorials, articles

Pattern Pattern is a web mining module for the Python programming language. It has tools for data mining (Google, Twitter and Wikipedia API, a web crawler, a HTML DOM parser), natural language processing (part-of-speech taggers, n-gram search, sentiment analysis, WordNet), machine learning (vector space model, clustering, SVM), network analysis and <canvas> visualization. The module is free, well-document and bundled with 50+ examples and 350+ unit tests. Download Installation Pattern is written for Python 2.5+ (no support for Python 3 yet).

Big data: The next frontier for innovation, competition, and productivity Big data will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus—as long as the right policies and enablers are in place. The amount of data in our world has been exploding, and analyzing large data sets—so-called big data—will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus, according to research by MGI and McKinsey's Business Technology Office. Leaders in every sector will have to grapple with the implications of big data, not just a few data-oriented managers. The increasing volume and detail of information captured by enterprises, the rise of multimedia, social media, and the Internet of Things will fuel exponential growth in data for the foreseeable future. Open interactive popup 1.

Research Publication: Sawzall Interpreting the Data: Parallel Analysis with Sawzall Rob Pike, Sean Dorward, Robert Griesemer, Sean Quinlan Abstract Very large data sets often have a flat but regular structure and span multiple disks and machines. Examples include telephone call records, network logs, and web document repositories. Royal Society journal archive made permanently free to access 26 October 2011 Around 60,000 historical scientific papers are accessible via a fully searchable online archive, with papers published more than 70 years ago now becoming freely available. The Royal Society is the world’s oldest scientific publisher, with the first edition of Philosophical Transactions of the Royal Society appearing in 1665. Henry Oldenburg – Secretary of the Royal Society and first Editor of the publication – ensured that it was “licensed by the council of the society, being first reviewed by some of the members of the same”, thus making it the first ever peer-reviewed journal.

FAQ: The Geography of Hate Dear Readers, Thanks to everyone (well, almost everyone) for their comments and constructive critiques on our Geography of Hate map. In light of all of the different directions these comments have come from, we wanted to respond to some of the more common questions and misunderstandings all at once. Before commenting or emailing about the map, please keep the following in mind... 1. First, read our original post . Second, read through this FAQ. Third, read the "Details about this map" section included in the interactive map , itself. Essential Resources: Mapping applications, frameworks and libraries This is part of a series of posts to share with readers a useful collection of some of the most important, effective and practical data visualisation resources. This post presents the many different options for visualisation spacial data. Please note, I may not have personally used all the packages or tools presented but have seen sufficient evidence of their value from other sources. Whilst some inclusions may be contentious from a quality/best-practice perspective, they may still provide some good features and provide value to a certain audience out there. Finally, to avoid re-inventing the wheel, descriptive text may have been reproduced from the native websites if they provide the most articulate descriptions.

How to seed a two-sided marketplace with the Yelp Model... and why Craigslist hates it! I’ve talked about the chicken and egg problem in seeding two-sided marketplaces at length earlier. Producers won’t show up without Consumers and vice versa. One of the models that I’d proposed, that works especially well for startups like ShopKick is to build the value proposition in such a manner that your producers bring in the consumers.

Related:  Databases & Linked DataData MiningDatavisData sourcesdata visualizationStudioHow to techSmart platformdemoBig DataOpen APIInfosUncategorized 10ITBig DataInternational Datawanderworriesfunexplorer