background preloader

Apache Hadoop Distribution

Apache Hadoop Distribution

https://www.mapr.com/home

Related:  NOSQLbig data

MVC, MOVE - Or Simply A State Machine One problem with a state machine approach compared to MVC is that it isn't as familiar. Do you use a Moore or a Mealy machine? A combination of the two? Learn Data Science by nborwankar Who Nitin Borwankar - primary developer (Sponsored by Pivotal Inc. and Alpine Data Labs). What Install DSS on Windows with Docker - Dataiku - Collaborative Data Science Platform Dataiku DSS only runs on Linux servers for production usage. Support of Mac OS X is available for experimentation, evaluation purpose and playing with Kaggle datasets. Dataiku does not provide any version of DSS for Microsoft Windows. However, you have the option to virtualize a Linux system on Windows and install DSS on it.

Couchbase Server Manual 2.0 - Chapter 9. Views and Indexes Couchbase Server is a NoSQL document database for interactive web applications. It has a flexible data model, is easily scalable, provides consistent high performance and is ‘always-on,’ meaning it is can serve application data 24 hours, 7 days a week. Couchbase Server provides the following benefits: Flexible Data Model With Couchbase Server, you use JSON documents to represent application objects and the relationships between objects. This document model is flexible enough so that you can change application objects without having to migrate the database schema, or plan for significant application downtime. Even the same type of object in your application can have a different data structures.

6 dataset lists curated by data scientists Docs Blog 6 dataset lists curated by data scientists November 21, 2013 Scott Haylon Install Kitematic on Windows 10, 8, and 7 all Editions? GUI for Docker In this post, I will show you how to install Kitematic on Windows 10, 8, and 7 all editions. Kitematic is a Docker GUI that makes managing Docker containers a breeze. Previously, I have explained what is Docker and how it can make life easier for home server owners. In addition, we also showed you how to install Docker on Windows 10 64-bit Pro/Ent and other Editions, as well as on Ubuntu. Yesterday, we saw how to install SickRage in Docker from commandline.

Couchbase Developer's Guide 2.0 - Chapter 2. Modeling Documents Couchbase Server is a NoSQL document database for interactive web applications. It has a flexible data model, is easily scalable, provides consistent high performance and is “always-on,” meaning it is can serve application data 24 hours, 7 days a week. Couchbase Server provides the following benefits: Flexible Data Model With Couchbase Server, you use JSON documents to represent application objects and the relationships between objects. This document model is flexible enough so that you can change application objects without having to migrate the database schema, or plan for significant application downtime. Even the same type of object in your application can have a different data structures.

Unpivoting Data with Excel, Open Refine and Python "How can I unpivot or transpose my tabular data so that there's only one record per row?" I see this question a lot and I thought it was worth a quick Friday blog post. Data often aren’t quite in the format that you want. Couchbase Server Manual 2.0 - Chapter 9. Views and Indexes - 9.5. Writing Views Couchbase Server is a NoSQL document database for interactive web applications. It has a flexible data model, is easily scalable, provides consistent high performance and is ‘always-on,’ meaning it is can serve application data 24 hours, 7 days a week. Couchbase Server provides the following benefits: Flexible Data Model With Couchbase Server, you use JSON documents to represent application objects and the relationships between objects. This document model is flexible enough so that you can change application objects without having to migrate the database schema, or plan for significant application downtime.

8 cool tools for data analysis, visualization and presentation Reporters wrangle all sorts of data, from analyzing property tax valuations to mapping fatal accidents -- and, here at Computerworld, for stories about IT salaries and H-1B visas. In fact, tools used by data-crunching journalists are generally useful for a wide range of other, non-journalistic tasks -- and that includes software that's been specifically designed for newsroom use. And, given the generally thrifty culture of your average newsroom, these tools often have the added appeal of little or no cost. I came back from last year's National Institute for Computer-Assisted Reporting (NICAR) conference with 22 free tools for data visualization and analysis -- most of which are still popular and worth a look. At this year's conference, I learned about other free (or at least inexpensive) tools for data analysis and presentation. CSVKit

The NoSQL “Family Tree” A few weeks back, one of our marketing teammates caught me explaining the NoSQL product landscape to some new employees, and they thought it would make a pretty infographic. I use this diagram a lot to help customers and business partners understand some important NoSQL basics: Create a free Cloudant account and start the NoSQL goodness NoSQL arose from "Big Data" (before it was called "Big Data")

Big Data, Data Mining, Predictive Analytics, Statistics, StatSoft Electronic Textbook This free ebook has been provided as a public service since 1995. Statistics: Methods and Applications textbook offers training in the understanding and application of statistics and data mining. It covers a wide variety of applications, including laboratory research (biomedical, agricultural, etc.), business statistics, credit scoring, forecasting, social science statistics and survey research, data mining, engineering and quality control applications, and many others. The Textbook begins with an overview of the relevant elementary (pivotal) concepts and continues with a more in depth exploration of specific areas of statistics, organized by "modules", representing classes of analytic techniques. A glossary of statistical terms and a list of references for further study are included.

A Carefully Selected List of Recommended Tools on Datavisualization.ch When I meet with people and talk about our work, I get asked a lot what technology we use to create interactive and dynamic data visualizations. At Interactive Things, we have a set of preferred libraries, applications and services that we use regularly in our work. We will select the most fitting tool for the job depending on the requirements of the project. Sometimes a really simple tool is all you need to create something meaningful. On other occasions, a more multifaceted repertoire is needed.

Uses some different concepts than its competitors, especially support for a native Unix file system instead of HDFS (with non-open-source components) for better performance and ease of use. Native Unix commands can be used instead of Hadoop commands. Besides, MapR differentiates from its competitors with high availability features such as snapshots, mirroring or stateful failover. The company is also spearheading the Apache Drill project, an open-source re-envisioning of Google’s Dremel for SQL-like queries on Hadoop data for offering real time processing. by sergeykucherov Jul 15

Related: