background preloader

Databases

Facebook Twitter

The MySQL 5 series introduced quite a few changes.

Triggers and stored procedures were two of the big ticket items. One of the lesser known additions, at least from the amount of writing on the subject, is the introduction of views. Modern Telecom Architectures Built with Hadoop. This is the third in our series on modern data architectures across industry verticals.

Modern Telecom Architectures Built with Hadoop

Others in the series are: Many of the world’s largest telecommunications companies use Hortonworks Data Platform (HDP) to manage their data. Through partnership with these companies, we have learned how our customers use HDP to improve customer satisfaction, make better infrastructure investments and develop new products. Hortonworks partner Teradata recently gave some use case examples in this video about how Verizon Wireless uses Teradata in combination with Hortonworks Data Platform to keep their customer churn below 1%.

Rob Smith, Verizon Wireless’ Executive Director for IT, describes how his team uses their discovery platform to improve customer interactions, by: Identifying better ways to communicate with customers about their paymentsAnalyzing social media to better understanding customer sentiments about Verizon policy changesTailoring marketing communications to each customer’s individual needs. Hadoop MapReducers in .NET - Making C# and VB First Class Citizens in Hadoo. We Recommend These Resources Hadoop was written in Java.

Hadoop MapReducers in .NET - Making C# and VB First Class Citizens in Hadoo

So it makes sense that Java will always be Hadoop's best friend. Hadoop streaming is great tool that allows interoperability with other languages, but this still doesn't make other languages a 'first-class citizen' when it comes to working with Hadoop. Microsoft thought it could unlock the big data world for the .NET developers with a project called Dryad, but no one ever used it. So they put their backing behind Hadoop as well. Map-reduce explained. Outside of functional programming context, map-reduce refers to a technique for processing data.

Map-reduce explained

Thanks to properties of map and reduce operations, computations which can be expressed using them can be highly parallelised, which allows for faster processing of high volumes of data. If you've ever wondered how tools such as Apache Hadoop work, you're at the right page. How Infochimps wants to become Heroku for Hadoop. What it really means when someone says 'Hadoop' The NoSQL movement. In a conversation last year, Justin Sheehy, CTO of Basho, described NoSQL as a movement, rather than a technology.

The NoSQL movement

This description immediately felt right; I’ve never been comfortable talking about NoSQL, which when taken literally, extends from the minimalist Berkeley DB (commercialized as Sleepycat, now owned by Oracle) to the big iron HBase, with detours into software as fundamentally different as Neo4J (a graph database) and FluidDB (which defies description). But what does it mean to say that NoSQL is a movement rather than a technology? We certainly don’t see picketers outside Oracle’s headquarters. Justin said succinctly that NoSQL is a movement for choice in database architecture. There is no single overarching technical theme; a single technology would belie the principles of the movement. Think of the last 15 years of software development. Since the ’80s, the dominant back end of business systems has been a relational database, whether Oracle, SQL Server or DB2.

MapReduce Patterns, Algorithms and Use Cases. In this article I digested a number of MapReduce patterns and algorithms to give a systematic view of the different techniques that can be found on the web or scientific articles.

MapReduce Patterns, Algorithms and Use Cases

Several practical case studies are also provided. All descriptions and code snippets use the standard Hadoop’s MapReduce model with Mappers, Reduces, Combiners, Partitioners, and sorting. This framework is depicted in the figure below. MapReduce Framework. NoSQL Data Modeling Techniques. NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency.

NoSQL Data Modeling Techniques

This aspect of NoSQL is well-studied both in practice and theory because specific non-functional properties are often the main justification for NoSQL usage and fundamental results on distributed systems like the CAP theorem apply well to NoSQL systems. Dempsy – a New Real-time Framework for Processing BigData. Real time processing of BigData seems to be one of the hottest topics today.

Dempsy – a New Real-time Framework for Processing BigData

Nokia has just released a new open-source project - Dempsy. Dempsy is comparable to Storm, Esper, Streambase, HStreaming and Apache S4. The code is released under the Apache 2 license Dempsy is meant to solve the problem of processing large amounts of "near real time" stream data with the lowest lag possible; problems where latency is more important that "guaranteed delivery.

" This class of problems includes use cases such as: Real time monitoring of large distributed systems Processing complete rich streams of social networking data Real time analytics on log information generated from widely distributed systems Statistical analytics on real-time vehicle traffic information on a global basis. NoSQL has been one of the most talked about topics over the past couple of months.

This tutorial will introduce you to CouchDB, a NoSQL implementation and teach you how to get started with the platform. What is NoSQL? NoSQL is schema free -- you don't need to decide the structure up front. Guide to MongoDB for startups. Introduction to MongoDB for Java, PHP and Python Developers. Introduction to NoSQL Architecture with MongoDB for Java, PHP and Python Developers This article covers using MongoDB as a way to get started with NoSQL.

Introduction to MongoDB for Java, PHP and Python Developers

It presents an introduction to considering NoSQL, why MongoDB is a good NoSQL implementation to get started with, MongoDB shortcommings and tradeoffs, key MongoDB developer concepts, MongoDB architectural concepts (sharding, replica sets), using the console to learn MongoDB, and getting started with MongoDB for Python, Java and PHP developers. The article uses MongoDB, but many concepts introduced are common in other NoSQL solutions. SQLite Programming. In any language, SQLite has some very straightforward interfaces.

This is a description of the Javascript variant, but the core SQLite implementation in C is not very different. Other languages just build on the C interface. Here are the basic functions: “openDatabase” creates a new SQLite database file or opens an existing one. It returns a handle to that open database. “executeSql” executes the SQLite statement. These are the basic interfaces. Chances are, you know what a database trigger is, at least in conceptual terms. Chances are even greater that you know that MySQL supports triggers and has supported them for quite some time.