background preloader

Big Data, Business Analytics & BI

Big Data, Business Analytics & BI

Gapminder: Unveiling the beauty of statistics for a fact based world view. Periscope Welcome to Scriptella ETL Project QR-Code | Convirtiendo la tinta en bits Web pioneer warns of data Dark Age Vint Cerf, a "father of the internet", says he is worried that all the images and documents we have been saving on computers will eventually be lost. Currently a Google vice-president, he believes this could occur as hardware and software become obsolete. He fears that future generations will have little or no record of the 21st Century as we enter what he describes as a "digital Dark Age". Mr Cerf made his comments at a large science conference in San Jose. He arrived at the annual meeting of the American Association for the Advancement of Science stylishly dressed in a three-piece suit. I felt obliged to thank him for the internet, and he bowed graciously. His focus now is to resolve a new problem that threatens to eradicate our history. Our life, our memories, our most cherished family photographs increasingly exist as bits of information - on our hard drives or in "the cloud". "I worry a great deal about that," Mr Cerf told me. "Plainly not," Vint Cerf laughed.

Apache Solr und ElasticSearch im Streitgespräch | heise Developer Lange Zeit war Apache Solr unter den quelloffenen Suchtechniken gesetzt. Seit einiger Zeit hat das Projekt jedoch Konkurrenz durch ElasticSearch bekommen. Fürsprecher der jeweiligen Suchplattform diskutieren in einem fiktiven Interview ihre Vor- und Nachteile. Doch lassen wir zuerst den Moderator des Gesprächs zu Wort kommen. Moderator: Wenn ich mir den Open-Source-Markt im Bereich Suche und Suchtechnologie ansehe, scheinen ihn derzeit Apache Solr und ElasticSearch zu dominieren. Beide Produkte haben in den letzten 15 Monaten erhebliche Fortschritte gemacht. Beide nutzen Apache Lucene als Indexstruktur. These 1: Easy to use – nicht mit Solr! Moderator: Lassen sich Solr und ElasticSearch überhaupt miteinander vergleichen? ElasticMAN: Klar gibt es Unterscheide zwischen ElasticSearch und Solr! SolrMAN: Das gilt im Grunde auch für Solr und entspricht nur der halben Wahrheit. ElasticMAN: Bei ElasticSearch brauchst du dich erst gar nicht mit so was herumärgern, denn da ist alles schon drin.

Cumplo.cl conecta a quienes necesitan dinero con otras que están dispuesto a prestarlo. Chicago builds ETL toolkit for open data -- GCN Chicago builds ETL toolkit for open data By Stephanie KanowitzJan 16, 2015 Data officials in Chicago are churning out open datasets faster than ever by using technology rather than developers to get the job done. About a year ago, the city government embedded Pentaho Data Integration (PDI), a graphical extract-transform-load (ETL) tool with pre-built and custom components to process big data, into its OpenData ETL Utility Kit. The kit provides several utilities and a framework to help governments extract data from a database and upload it to an open data portal using automated ETL processes. Before working with PDI, city workers updated datasets manually, said Jon Levy, open data program manager at the Chicago Department of Innovation and Technology. That also meant Java developers were spending time on updates rather than writing applications that could help city workers and residents, added Tom Schenk, the city’s chief data officer. Chicago’s toolkit is free to download. About the Author

Abfragen und Schemafreiheit | heise Developer Abfragen von Statusinformation Läuft ElasticSearch, lassen sich JSON-Dokumente mit einem HTTP-POST- oder HTTP-PUT-Befehl zur Indizierung übergeben. Um beispielweise eine deutsche Post-Adresse als Dokumententyp man im Index addresses zu indizieren, reicht der unten zu sehende cURL-Befehl. Jede Ressource in ElasticSearch lässt sich über eine REST-konforme URL angesprechen. So setzt sich die URL für ein in ElasticSearch abgelegtes Dokument dabei nach folgendem Schema zusammen: ElasticSearch quittiert den Aufruf mit dem im Beispiel aufgeführten Dokument, aus dem sich der Erfolg der Operation, der Name des verwendeten Index und Dokumententyps sowie ID und Version des Dokuments entnehmen lässt. Da das Dokument mit POST und ohne Angabe einer ID übergeben wurde, erzeugt ElasticSearch selbst eine. Da jedes Dokument einen eindeutigen Bezeichner hat, lässt sich ElasticSearch je nach Anwendungsfall auch als Key-Value-Store nutzen.

Tienda On Line de Moda y Ropa | DAFITI CHILE Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase vs Couchbase vs Hypertable vs ElasticSearch vs Accumulo vs VoltDB vs Scalaris comparison :: Software architect Kristof Kovacs While SQL databases are insanely useful tools, their monopoly in the last decades is coming to an end. And it's just time: I can't even count the things that were forced into relational databases, but never really fitted them. (That being said, relational databases will always be the best for the stuff that has relations.) But, the differences between NoSQL databases are much bigger than ever was between one SQL database and another. In this light, here is a comparison of Open Source NOSQL databases: The most popular ones # Redis # Best used: For rapidly changing data with a foreseeable database size (should fit mostly in memory). For example: To store real-time stock prices. Cassandra # Written in: JavaMain point: Store huge datasets in "almost" SQLLicense: ApacheProtocol: CQL3 & ThriftCQL3 is very similar to SQL, but with some limitations that come from the scalability (most notably: no JOINs, no aggregate functions.)CQL3 is now the official interface. MongoDB # ElasticSearch # CouchDB #

Quote from home page:
"Pentaho brings together IT and business users to easily access, integrate, visualize and explore all data that impacts results. Pentaho Business Analytics provides a complete solution, is fast to deploy, easy to use, and extremely cost-effective. The suite includes data access, integration, visualization, exploration and mining." by tjstoner64 Aug 11

Related: