Enterprise Data Warehouse (EDW) is an organization’s central data repository that is built to support business decisions. EDW contains data related to areas that the company wants to analyze. For a manufacturer, it might be customer, product or bill of material data. EDW is built by extracting data from a number of operational systems. As the data is fed into EDW it is converted, reformatted and summarized to present a single corporate view. Data is added into the data warehouse over time in the form of snapshots and normally an enterprise data warehouse contains data spanning 5 to 10 years. A Hadoop data warehouse architecture enables deeper analytics and advanced reporting from these diverse sets of data.
Problems with a typical EDW
The Enterprise Data Warehouse has become a standard component of the corporate data architectures. However, the complexity and volume of data has posed some interesting challenges to the efficiency of existing EDW solutions.
Realizing the transformative potential of Big Data depends on the corporations’ ability to manage complexity while leveraging data sources of all types such as social, web, IoT and more. The integration of new data sources into the existing EDW system will empower corporations more and deeper analytics and insights. More importantly, EDW optimization using Hadoop provides a highly cost-efficient environment with optimal performance, scalability and flexibility.
Hortonworks Data Platform
Powerful open Hadoop data warehouse architecture with capabilities for data governance and integration, data management, data access, security and operations—designed for deep integration with your existing data center technology. Learn More
EDW offload to Hadoop - High-performance ETL software to access and easily onboard traditional enterprise data to HDP. Learn More
Expert guidance and support to quickly prove the value of your new architecture and maximize the value of the full tested and validated Hortonworks data architecture optimization solution. Learn More
EDW optimization with Apache Hadoop ®
Data can be loaded in HDP without having a data model in place
Data model can be applied based on the questions being asked of data (schema-on-read
HDP is designed to answer questions as they occur to the user
100% of the data is available at granular level for analysis
HDP can store and analyze both structured and unstructured data
Data can be analyzed in different ways to support diverse use cases
HDP (Hortonworks Data Platform) is 100% open - there is no licensing fee for software
HDP runs on commodity hardware
New data can be landed in HDP and used in days or even hours
Use-Cases on EDW Optimization
Fast BI on Hadoop
Proprietary EDW systems were adopted for fast BI and deep slice-and-dice analytics, but EDW prices are unsustainably high and these systems have not adapted to modern big data challenges like unstructured data and large-scale analytics.
Hortonworks makes fast BI on Hadoop a reality, with the combination of a fast in-memory SQL engine to create data marts with an OLAP cubing engine that lets you query huge datasets in seconds. This gives you the choice of querying pre-aggregated data for maximum performance or in full-fidelity form when the nest grains of detail are needed, allowing access from any major BI tool that supports ODBC, JDBC or MDX.
A typical EDW spends between 45 to 65 percent of its CPU cycles on ETL processing.These lower-value ETL jobs compete for resources with more business-critical workloads and can cause SLA misses. Hadoop can EDW offload these ETL jobs with minimal porting effort and at substantially lower cost, saving money and freeing up capacity on your EDW for higher-value analytical workloads. Hortonworks makes it easy by providing high-performance ETL tools, a powerful SQL engine and integration with all major BI vendors.
Increasing data volumes and cost pressures force many companies to archive old data to tape where it can’t be analyzed or must be retrieved at great expense.
A Hadoop data warehouse architecture offers cost per terabyte on par with tape backup solutions. Because of the appealing cost, you can store years of data rather than months. All of your enterprise data remains available for retrieval, query and deep analytics with the same tools you use on existing EDW systems.
You have a legacy system that no longer meet the demands of your current data needs, and replacing it isn’t an option. But don’t panic: Modernizing your traditional enterprise data warehouse is easier than you may think. Traditional data warehouses are built on a costly model: with lengthy deployment cycles, time to value can delay…
This could be the most valuable actionable intelligence you ever see. Sometimes it’s good to get back to the basics. The day-to-day queries, data ingestion and analysis, allocation of storage all consume substantial financial resources. But perhaps the most insidious resource they devour is an organization’s ability to stop and see the big picture. How…
How Customers are Optimizing their EDW for Fast, Secure, Cost Effective Actionable Insights
Businesses are striving to get the most value out of their data and turn it into actionable insights. The shift towards becoming a data-centric organization requires a modern data architecture with the ability to access all critical enterprise data at the right time. This is easier said than done. Most organizations find themselves challenged by…
Integrating Apache Hadoop with the Enterprise Data Warehouse New data architectures, new data outcomes It’s not just big data anymore--it’s gargantuan. And tiny. To keep pace with rapid data growth and an array of new data sources, future-leaning organizations are investing in data architecture optimization--augmenting their Enterprise Data Warehouse (EDW) environments with Hadoop. Why? Hadoop…
Why a Connected Data Strategy is critical to the future of your data The advent of big data revolutionized analytics and data science and created the concept of new data platforms, allowing enterprises to store, access and analyze vast amounts of historical data. The world of big data was born. But existing data platforms need…
Supercharging your Enterprise Data Warehouse Optimization with Dell EMC Ready Bundle for Hortonworks
Hadoop and its Data Analytics capabilities offer tremendous potential for deriving business insights from ever increasing stores of data, but many organizations struggle with lack of in-house skills and time to work on Hadoop design and architecture. Dell EMC and Hortonworks can help customers get past this challenge with proven and certified architectures. Dell EMC…
Enterprise Data Warehouse Optimization: 7 Keys to Success
You have a legacy system that no longer meet the demands of your current data needs, and replacing it isn’t an option. But don’t panic: Modernizing your traditional enterprise data warehouse is easier than you may think. Join us on August 1st at 11am PDT to hear from David Loshin, President of Knowledge Integrity,…
LLAP wins the fastest execution among the SQL engines! Comcast is one of the nation's leading providers of communications, entertainment and cable products and services. Headquartered in Philadelphia, PA, they employ over 100,000 employees nationwide whose goal is to deliver the highest level of service and improve the customer experience. Comcast decided to run what…
Forrester Lists Hortonworks as a Leader in Big Data Warehousing
The Enterprise Data Warehouse (EDW) has had a great run for the past several decades. But as is the norm in technology, newcomers are ready to stake their claim in this business critical environment, as illustrated in Forrester’s newly released The Forrester Wave™: Big Data Warehouse, Q2 2017 report. Hortonworks delivers a viable open source…
Announcing the availability of Dell EMC Ready Bundle for Hortonworks Hadoop
Last week at Dataworks Summit, Dell EMC released the Dell EMC Ready Bundle for Hortonworks Hadoop. Dell EMC and Hortonworks brings together industry leading solutions for enterprise-ready open data platforms and modern data applications, helping our customers Modernize, Automate and Transform how they deliver IT services. The goal of these solutions is to help businesses…
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Global Technology Services (GTS) was challenged by a multi-tier, labor-intensive process when trying to migrate data from disparate sources into a data lake to create financial reports and business insights. Join experts from Verizon GTS, Attunity and Hortonworks on June 8th at 11:00 a.m. PT/2:00 p.m. ET to learn more about how Verizon: Easily…
Hive / Druid integration means Druid is BI-ready from your tool of choice This is Part 3 of a Three-Part series of doing ultra fast OLAP Analytics with Apache Hive and Druid. Connect Tableau to Druid Previously we talked about how the Hive/Druid integration delivers screaming-fast analytics, but there is another, even more powerful benefit to…
Ultra-fast OLAP Analytics with Apache Hive and Druid – Part 1 of 3
This is part 1 of a three-part series of doing Ultra Fast OLAP Analytics with Apache Hive and Druid. Unlock Sub-Second SQL Analytics over Terabytes of Data with Hive and Druid Modern corporations are increasingly looking for near real time analytics and insights to make actionable decisions. To fuel this, this blog introduces Ultra fast…
Accelerate Business Outcomes With A Connected Data Strategy
Join experts from Hortonworks and our guest from Forrester to learn how a next-gen connected data architecture can help accelerate time to value for your big data initiatives. Data today is a foundation for most businesses to drive better customer experience, better products and improved operational efficiencies. Growing adoption of cloud applications and platforms, in…
Apache, Hadoop, Falcon, Atlas, Tez, Sqoop, Flume, Kafka, Pig, Hive, HBase, Accumulo, Storm, Solr, Spark, Ranger, Knox, Ambari, ZooKeeper, Oozie, Phoenix, NiFi, HAWQ, Zeppelin, Atlas, Slider, Mahout, MapReduce, HDFS, YARN, Metron and the Hadoop elephant and Apache project logos are either registered trademarks or trademarks of the Apache Software Foundation in the United States or other countries.