Can Kaggle make data science a spectator sport? — Data. The Data Scientist Will Be Replaced By Tools. Why The Search For The Mystical Data Scientist Should Not Be A Feat Of Magic. The data scientist is a mystical spirit.
A wizard, whose skills are fired in the deep unknown of a developer’s lair. Their secrets are worth the gold of a million empires.They possess the keys to eternity.They have pet dragons. Not! It’s time to take away the staff and stop thinking of data scientists as lord wizards of middle earth lore. Charlotte Prepares Students To Meet Demand For Data Scientists. What Everyone Needs to Learn from the Data Journalism Handbook. IBM VP Anjul Bhambhri on the Era of the Data Scientist. Just a few short years ago, the problem of database size scaling to colossal capacities that exceeded the scope of entire network storage units, seemed insurmountable.
Today, it's practically under control, with a wealth of open source technology emerging not from database engineers but rather from Internet architects. Hadoop has transformed the very nature of transformation, becoming one of the most readily adopted technologies in the history of the data center. But is it mature? And will businesses have access to the right people with the skill sets necessary to master this new aspect of information management? After having spent five years as a senior engineer at Sybase, another six years as a development director at Informix, and over three years managing DB2 development for IBM, Anjul Bhambhri is arguably one of the most skilled plain data architects in the business.
From that standpoint, what has been happening in the open source community has been fabulous. Do you need a data scientist? How can big data and smart analytics tools ignite growth for your company?
Find out at DataBeat, May 19-20 in San Francisco, from top data scientists, analysts, investors, and entrepreneurs. Register now and save $200! Some of the world’s biggest tech companies from Google to Facebook are data-driven, but few startup founders have any idea what a data scientist does, never mind whether they should hire one. Here is VentureBeat’s guide to data science for startups.
EMC Greenplum's Steven Hillion on What Is a Data Scientist? Amazon's John Rauser on "What Is a Data Scientist?" Does Science Need More Compelling Stories to Foster Public Trust? Image courtesy of iStockphoto/SchulteProductions The touching stories that advocacy groups are so good at telling—the 49-year old mother whose breast cancer was detected by an early mammogram before it had spread; the 60-year-old neighbor who had a prostate tumor removed thanks to a routine PSA test—should inspire scientists to use anecdotes of their own, argue two doctors from the University of Pennsylvania.
In the scientific realm, anecdotal evidence—the individual patient, the single result—tends to be shunned in favor of large, dense data sets and impersonal statistical analyses. Growing Your Own Data Scientists. CIOs and CTOs must learn to address a challenge, involving the divide between the people who know about the vast amount of new sources of data emanating from machines and other devices (“big data”) and the questions in the enterprise whose answers can be monetized.
LinkedIn's Daniel Tunkelang On "What Is a Data Scientist?" How to be a data journalist. Data journalism is huge.
I don't mean 'huge' as in fashionable - although it has become that in recent months - but 'huge' as in 'incomprehensibly enormous'. It represents the convergence of a number of fields which are significant in their own right - from investigative research and statistics to design and programming. The idea of combining those skills to tell important stories is powerful - but also intimidating. Who can do all that? The reality is that almost no one is doing all of that, but there are enough different parts of the puzzle for people to easily get involved in, and go from there. 1. 'Finding data' can involve anything from having expert knowledge and contacts to being able to use computer assisted reporting skills or, for some, specific technical skills such as MySQL or Python to gather the data for you. 2. 3. 4. Tools such as ManyEyes for visualisation, and Yahoo! How to begin? So where does a budding data journalist start?
Play around. And you know what? Big Data Technology Evaluation Checklist. Anyone who’s been following the rapid-fire technology developments in the world that is becoming known as “big data” sees a new capability, product, or company founded literally every week.
The ambition of all of these players, established and newcomer, is tremendous, because the potential value to business is enormous. Each new arrival is aimed at addressing the pain that enterprises are experiencing around unrelenting growth in the velocity, volume, and variety of the data their operations generate. What’s being lost, however, in some of this frothy marketing activity, is that it’s still early for big data technologies. There are vexing problems slowing the growth and the practical implementation of big data technologies.
For the technologies to succeed at scale, there are several fundamental capabilities they should contain, including stream processing, parallelization, indexing, data evaluation environments and visualization. Some general questions to begin the evaluation process: