Keep an Eye on the emerging Open-Source Analytics Stack | Smart Data Collective. This post is contributed by Revolution Analytics CEO Norman Nie, and cross-posted from the Future of Open Source Forum. A lot of attention has been focused recently on Big Data, and rightly so: Big Data is a Big Deal. (See this LinuxInsider article, Big Data, Big Open Source Tools, for a compehensive overview of Big Data issues.) But what, exactly, is Big Data? Ed Dumbill of O’Reilly defines Big Data as “data that becomes large enough that it cannot be processed using conventional methods”.
That’s a good definition: Big Data is primarily about your capability of making use of the data, not just its size in terabytes or petabytes. In that same article, Dumbill coins a term to represent the emerging software infrastructure — typically open source, distributed, and running on commodity hardware — for handling Big Data: “A stack for big data systems has emerged, comprising layers of Storage, MapReduce and Query (SMAQ).”
To be fair, the answer is yes — sort of. Open source wins, hands down. Chris Harrison's Visualization Projects. Information aesthetics - Information Visualization & Visual Communication. Visualisation Magazine | collates the most creative and innovative visualisations of information. VizWorld.com - Visualization, Computer Graphics, and Animation.