Enterprise Data Warehouse (EDW) is an organization’s central data repository that is built to support business decisions. EDW contains data related to areas that the company wants to analyze. For a manufacturer, it might be customer, product or bill of material data. EDW is built by extracting data from a number of operational systems. As the data is fed into EDW it is converted, reformatted and summarized to present a single corporate view. Data is added into the data warehouse over time in the form of snapshots and normally an enterprise data warehouse contains data spanning 5 to 10 years. A Hadoop data warehouse architecture enables deeper analytics and advanced reporting from these diverse sets of data.
Problems with a typical EDW
The Enterprise Data Warehouse has become a standard component of the corporate data architectures. However, the complexity and volume of data has posed some interesting challenges to the efficiency of existing EDW solutions.
Realizing the transformative potential of Big Data depends on the corporations’ ability to manage complexity while leveraging data sources of all types such as social, web, IoT and more. The integration of new data sources into the existing EDW system will empower corporations more and deeper analytics and insights. More importantly, EDW optimization using Hadoop provides a highly cost-efficient environment with optimal performance, scalability and flexibility.
Hortonworks Data Platform
Powerful open Hadoop data warehouse architecture with capabilities for data governance and integration, data management, data access, security and operations—designed for deep integration with your existing data center technology. Learn More
EDW offload to Hadoop - High-performance ETL software to access and easily onboard traditional enterprise data to HDP. Learn More
Expert guidance and support to quickly prove the value of your new architecture and maximize the value of the full tested and validated Hortonworks data architecture optimization solution. Learn More
EDW optimization with Apache Hadoop ®
Data can be loaded in HDP without having a data model in place
Data model can be applied based on the questions being asked of data (schema-on-read
HDP is designed to answer questions as they occur to the user
100% of the data is available at granular level for analysis
HDP can store and analyze both structured and unstructured data
Data can be analyzed in different ways to support diverse use cases
HDP (Hortonworks Data Platform) is 100% open - there is no licensing fee for software
HDP runs on commodity hardware
New data can be landed in HDP and used in days or even hours
Use-Cases on EDW Optimization
Fast BI on Hadoop
Proprietary EDW systems were adopted for fast BI and deep slice-and-dice analytics, but EDW prices are unsustainably high and these systems have not adapted to modern big data challenges like unstructured data and large-scale analytics.
Hortonworks makes fast BI on Hadoop a reality, with the combination of a fast in-memory SQL engine to create data marts with an OLAP cubing engine that lets you query huge datasets in seconds. This gives you the choice of querying pre-aggregated data for maximum performance or in full-fidelity form when the nest grains of detail are needed, allowing access from any major BI tool that supports ODBC, JDBC or MDX.
A typical EDW spends between 45 to 65 percent of its CPU cycles on ETL processing.These lower-value ETL jobs compete for resources with more business-critical workloads and can cause SLA misses. Hadoop can EDW offload these ETL jobs with minimal porting effort and at substantially lower cost, saving money and freeing up capacity on your EDW for higher-value analytical workloads. Hortonworks makes it easy by providing high-performance ETL tools, a powerful SQL engine and integration with all major BI vendors.
Increasing data volumes and cost pressures force many companies to archive old data to tape where it can’t be analyzed or must be retrieved at great expense.
A Hadoop data warehouse architecture offers cost per terabyte on par with tape backup solutions. Because of the appealing cost, you can store years of data rather than months. All of your enterprise data remains available for retrieval, query and deep analytics with the same tools you use on existing EDW systems.
How Customers are Optimizing their EDW for Fast, Secure, Cost Effective Actionable Insights
Businesses are striving to get the most value out of their data and turn it into actionable insights. The shift towards becoming a data-centric organization requires a modern data architecture with the ability to access all critical enterprise data at the right time. This is easier said than done. Most organizations find themselves challenged by…
Accelerating Big Data Insights with Dell EMC Ready Bundles for Hortonworks
Hadoop’s data analytics capabilities offer tremendous potential for deriving new and differentiated business insights. But, many organizations get bogged down with the DIY infrastructure decisions and fail to keep up with the evolving needs of their business. Dell EMC and Hortonworks can help organizations get past this challenge with proven and certified architectures which allow…
Exploring the Heated-and Completely Unnecessary- Data Lake Debate
When it comes to the data lakes and data warehouses, there’s no shortage of controversy: Is one better than the other? The real answer is, there’s no need for heated debate—a data lake actually complements the data warehouse. Integrating a data lake with your EDW is really just an evolution of architecture that can provide…
Using Big Data & Hive 2 with LLAP At Geisinger Health System
Big Wins in a Short Time with HDP & Hive 2 with LLAP Geisinger Health System is well known in the healthcare community as a pioneer in data and analytics. They were one of the first adopters of Electronic Health Record (EHR) in 1996 and went with Epic. In addition, they used an Enterprise Data…
Enterprise Data Warehouse Optimization: 7 Keys to Success
You have a legacy system that no longer meet the demands of your current data needs, and replacing it isn’t an option. But don’t panic: Modernizing your traditional enterprise data warehouse is easier than you may think. Join us on August 1st at 11am PDT to hear from David Loshin, President of Knowledge Integrity,…
LLAP wins the fastest execution among the SQL engines! Comcast is one of the nation's leading providers of communications, entertainment and cable products and services. Headquartered in Philadelphia, PA, they employ over 100,000 employees nationwide whose goal is to deliver the highest level of service and improve the customer experience. Comcast decided to run what…
You have a legacy system that no longer meet the demands of your current data needs, and replacing it isn’t an option. But don’t panic: Modernizing your traditional enterprise data warehouse is easier than you may think. Traditional data warehouses are built on a costly model: with lengthy deployment cycles, time to value can delay…
Forrester Lists Hortonworks as a Leader in Big Data Warehousing
The Enterprise Data Warehouse (EDW) has had a great run for the past several decades. But as is the norm in technology, newcomers are ready to stake their claim in this business critical environment, as illustrated in Forrester’s newly released The Forrester Wave™: Big Data Warehouse, Q2 2017 report. Hortonworks delivers a viable open source…
Announcing the availability of Dell EMC Ready Bundle for Hortonworks Hadoop
Last week at Dataworks Summit, Dell EMC released the Dell EMC Ready Bundle for Hortonworks Hadoop. Dell EMC and Hortonworks brings together industry leading solutions for enterprise-ready open data platforms and modern data applications, helping our customers Modernize, Automate and Transform how they deliver IT services. The goal of these solutions is to help businesses…
Which Big Data solution will help you make a big difference?
See the Forrester Report that rates the leaders. How do enterprise architecture professionals make the right choice for their enterprise data warehouse strategy? A recent Forrester Wave Report looked at Big Data warehouse vendors to see how they measured up in various categories, including deployment, support, roadmap and partners. Hortonworks was ranked as a leader…
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Global Technology Services (GTS) was challenged by a multi-tier, labor-intensive process when trying to migrate data from disparate sources into a data lake to create financial reports and business insights. Join experts from Verizon GTS, Attunity and Hortonworks on June 8th at 11:00 a.m. PT/2:00 p.m. ET to learn more about how Verizon: Easily…
The data-driven mindset. Becoming a data-driven enterprise starts with a particular mindset—a culture of approaching decisions objectively, getting away from anecdotal evidence and “gut” decisions, and trusting that your data will provide the right answers. Read the newest eBook from Hortonworks and TDWI “EDW Modernization: Becoming a Data-Driven Enterprise” and learn what it means to…
Hive / Druid integration means Druid is BI-ready from your tool of choice This is Part 3 of a Three-Part series (Part 1, Part 2) of doing ultra fast OLAP Analytics with Apache Hive and Druid. Connect Tableau to Druid Previously we talked about how the Hive/Druid integration delivers screaming-fast analytics, but there is another, even…
This could be the most valuable actionable intelligence you ever see. Sometimes it’s good to get back to the basics. The day-to-day queries, data ingestion and analysis, allocation of storage all consume substantial financial resources. But perhaps the most insidious resource they devour is an organization’s ability to stop and see the big picture. How…
Apache, Hadoop, Falcon, Atlas, Tez, Sqoop, Flume, Kafka, Pig, Hive, HBase, Accumulo, Storm, Solr, Spark, Ranger, Knox, Ambari, ZooKeeper, Oozie, Phoenix, NiFi, HAWQ, Zeppelin, Atlas, Slider, Mahout, MapReduce, HDFS, YARN, Metron and the Hadoop elephant and Apache project logos are either registered trademarks or trademarks of the Apache Software Foundation in the United States or other countries.