Twitter Census: Publishing the First of Many Datasets | blog.inf As useful as the Twitter API is, developers, designers, and researchers have long clamored for more than the trickle of data that service currently allows. We agree — some of the sexiest uses of data require processing not just all that is now, but the vast historical record. Twitter doesn’t provide the only use case for this , but until now its historical bulk data has been hard to find. Today we are publishing a few items collected from our large scrape of Twitter’s API.
When we first launched public data on Google.com, we wanted to make statistics easier to find and to encourage debate based on facts rather than intuition. The day after we launched, a friend who worked at the World Bank called me, her voice filled with enthusiasm, "Did you know that the World Bank also just released an API for their data?" Excited, I checked it out , and found an amazing treasure trove of statistics for most economies in the world. After some hard work and analysis, today we're happy to announce that 17 World Development Indicators (list below*) are now conveniently available to you in Google search. With today's update, you can quickly access more data with a broad range of queries. Search should be intuitive, so we've done the work to think through queries where public data will be most relevant to you. World Bank public data, now in search
Public Datasets « Elastic Web Mining | Bixolabs Bixo Labs has merged with Scale Unlimited and is now providing complete consulting and training services for a wide range of big data problems, including web crawling, data mining and search. Why Did Bixo Labs Merge With Scale Unlimited? During client engagements, we repeatedly saw the need for mentoring and training to ensure a smooth hand-off of projects to internal team members. In addition, Bixo Labs has already been teaching Hadoop classes under contract with Scale Unlimited. Given our increased focus on mentoring and training, it made sense for Bixo Labs to acquire and merge with Scale Unlimited, to provide complete consulting solutions that include support for bringing our clients’ internal staff up-to-speed on the open source technologies we use, such as Hadoop, Solr and Cascading.