background preloader

HANA and Hadoop

Facebook Twitter

Big Data with SAP Sybase IQ and Hadoop. Streaming Data to Hadoop and HANA. For those that are interested in Hadoop and Hana I've recently created a prototype which attempts to leverage some of the key strengths of the companion solutions, to deal with Big Data challenges. The term ‘Big Data’ is commonly used these days, but is perhaps best reflected by the high volumes of data generated every minute by Social media, Web logs & Remote sensing/POS equipment. Dependent on the source it probably doesn't make sense to stream all this data into HANA, instead it may better to only store a subset most relevant for analytic reporting in HANA. [As an example Twitter alone may generate 100,000’s of tweets a minute - High Volume Low Value] The following Diagram illustrates an example of how 'Big Data' might flow to HANA via HADOOP: The key point is that I use Hadoop Flume to establish a connect to Twitter (via Twitter4j API) and then store the details of each tweet in Hbase,while simultaneously sending a subset of the fields to HANA, via Server Side Javascript. 1. 2. 3.

SAP: Hana künftig mit Hadoop für Big Data. SAP + Hortonworks = Instant Access + Infinite Scale with HANA + Hadoop. SAP® provides a best-in-class portfolio of databases, information management solutions, analytic tools, and analytic applications. The strategic relationship between Hortonworks and SAP enables SAP to resell Hortonworks Data Platform (HDP) and provide enterprise support for their global customer base. This means SAP customers can incorporate enterprise Hadoop as a complement within a data architecture that includes SAP HANA and SAP BusinessObjects enabling a broad range of new analytic applications. SAP HANA + Hadoop = Instant access + Infinite scale By using SAP HANA and Hadoop together, customers get the power of instant access with SAP HANA and infinite scale with Hadoop.

This gives SAP users a broad range of options for storing and analyzing new types of data and the ability to create applications that can uncover new business opportunities from vast amounts of data that would not have been previously possible. Integrated technologies ease adoption Integrated support. Hadoop | SAP. In today’s enterprise, Apache Hadoop has found a place alongside established databases and data warehouses. The Apache Hadoop software library is an open-source framework that allows for the distributed processing of large data sets across clusters of computers. With Apache Hadoop, you can: Store massive amounts of modern data (social media, click-stream data, Web logs, sensor data etc.) cost effectively across commodity services with the Hadoop distributed file system (HDFS)Distribute data processing across clusters of coordinated nodes using MapReduce – a batch processing paradigm to scale up or down without system interruptionManage resources using YARN – the Hadoop operating system – to handle non-MapReduce workloads more effectively SAP is committed to providing choice among Hadoop distributions.

SAP HANA platform for Big Data + Hadoop By using SAP HANA platform and Hadoop together, you combine instant results with infinite storage for real-time insights of Big Data. SAP HANA and Hadoop in the Cloud. Like most media organizations around the world, the Toronto-headquartered The Globe and Mail has struggled to make a profitable transition from physical newspapers to online journalism.

But now a combination of Hadoop and SAP HANA in the cloud is helping make critical decisions about how and when to charge readers for online access to articles. In print for 167 years, The Globe is Canada’s largest newspaper, with over 300 journalists covering national, international, business, technology, arts, entertainment and lifestyle news for around 3.5 million readers a week across the country.

Over the last decade, the company has invested in comprehensive data gathering and analysis systems, starting with SAP ERP in 2002 and a full enterprise data warehouse using SAP BW in 2007. In early 2012, data analysis became an urgent business priority because of the company’s paywall project. But that didn’t solve all the analysts’ problems. Yang explains: “The result is a whole lot of numbers.