background preloader

OSS Big Data

Facebook Twitter

GEOFABRIK // Home. eNovance | Cloud & Managed Services Provider. Box plot. In descriptive statistics, box plot or boxplot is a convenient way of graphically depicting groups of numerical data through their quartiles. Box plots may also have lines extending vertically from the boxes whiskers indicating variability outside the upper and lower quartiles, hence the terms box-and-whisker plot and box-and-whisker diagram. Outliers may be plotted as individual points. This is also called a "box and whisker plot". Box plots are non-parametric: they display variation in samples of a statistical population without making any assumptions of the underlying probability distribution statistical distribution. The spacings between the different parts of the box indicate the degree of statistical dispersion (spread) and skewness in the data, and show outliers. In addition to the points themselves, they allow one to visually estimate various L-estimators, notably the interquartile range, midhinge,[[range (statistics)|range, mid-range, and trimean.

Types of boxplots[edit] if and. MapR. The C10K problem. [Help save the best Linux news source on the web -- subscribe to Linux Weekly News!] It's time for web servers to handle ten thousand clients simultaneously, don't you think? After all, the web is a big place now. And computers are big, too. You can buy a 1000MHz machine with 2 gigabytes of RAM and an 1000Mbit/sec Ethernet card for $1200 or so. In 1999 one of the busiest ftp sites, cdrom.com, actually handled 10000 clients simultaneously through a Gigabit Ethernet pipe. And the thin client model of computing appears to be coming back in style -- this time with the server out on the Internet, serving thousands of clients.

With that in mind, here are a few notes on how to configure operating systems and write code to support thousands of clients. Contents Related Sites See Nick Black's execellent Fast UNIX Servers page for a circa-2009 look at the situation. Book to Read First I/O frameworks I/O Strategies Designers of networking software have many options. 1. 2. 3. 4. LinuxThreads NPTL links: ... Business analytics and business intelligence leaders - Pentaho. KNIME | Konstanz Information Miner. Fast Analytics and Rapid-fire Business Intelligence from Tableau Software | Tableau Software. High-Performance Analytics - L'architecture Big Data de SAS.

Hadoop, MapReduce, NoSQL, Appliances… tous ces termes techniques fleurissent pour décrire le phénomène Big Data, à l'origine du Big Analytics chez SAS. Si le Peta-octet n'est pas encore l'unité de base des applications décisionnelles, on peut estimer que les données disponibles pour le monde analytique vont augmenter et se diversifier.

La capacité à valoriser et utiliser ces informations dans un laps de temps réduit est l'enjeu majeur des trois prochaines années. Le Big Data et l'Analytique : la réponse à des enjeux métiers Nous vous proposons en téléchargement gratuit un livre blanc qui énonce les utilisations et les bénéfices métiers dans différents secteurs d'activité. Ce sont autant d’exemples apportant un éclairage pertinent sur la gestion, le stockage, l'analyse et l'exploitation d'importants volumes de données réalisés avec SAS dans le contexte du Big Data. SAS® High-Performance Analytics Server : une offre dédiée Exploration visuelle des données avec SAS® Visual Analytics. Spark Cluster Computing Framework. Hadoop | Hadoop Download | Cloudera Hadoop | Cloudera.

Hortonworks | Future of big data using Apache Hadoop.