Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
November 17, 2017 | Piet Loubser | Hortonworks Case Study

Building a global data lake for International Banking

November 13, 2017 | Matt Spillar | Hortonworks Case Study

How Nissan is Harnessing Big Data to Provide Value to Customers

November 10, 2017 | Syed Mahmood | Announcements

Certification of IBM Data Science Experience (DSX) on HDP is a Win-Win for Customers

Viewing posts by: Carter Shanklin« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

One of the most exciting new features of HDP 2.6 from Hortonworks was the general availability of Apache Hive with LLAP. If you missed DataWorks Summit you’ll want to look at some of the great LLAP experiences our users shared, including Geisinger who found that Hive LLAP outperforms their traditional EDW for most of their […]

Thank you for reading part 1 of a 2 part series for how to update Hive Tables the easy way.  This is part 2 of the series. Managing Slowly Changing Dimensions In Part 1, we showed how easy it is update data in Hive using SQL MERGE, UPDATE and DELETE. Let’s take things up a notch […]

This is part 1 of a 2 part series for how to update Hive Tables the easy way Historically, keeping data up-to-date in Apache Hive required custom application development that is complex, non-performant and difficult to maintain. HDP 2.6 radically simplifies data maintenance with the introduction of SQL MERGE in Hive, complementing existing INSERT, UPDATE […]

Hive / Druid integration means Druid is BI-ready from your tool of choice This is Part 3 of a Three-Part series (Part 1, Part 2) of doing ultra fast OLAP Analytics with Apache Hive and Druid. Connect Tableau to Druid Previously we talked about how the Hive/Druid integration delivers screaming-fast analytics, but there is another, even […]

THIS IS PART 2 OF A THREE-PART SERIES (Part 1, Part 3) OF DOING ULTRA FAST OLAP ANALYTICS WITH APACHE HIVE AND DRUID Modern corporations are increasingly looking for near real time analytics and insights to make actionable decisions.   To help organizations understand more about the benefits of Apache Hive and Druid, we will focus […]

This is part 1 of a three-part (Part 2, Part 3) series of doing Ultra Fast OLAP Analytics with Apache Hive and Druid. Unlock Sub-Second SQL Analytics over Terabytes of Data with Hive and Druid Modern corporations are increasingly looking for near real time analytics and insights to make actionable decisions. To fuel this, this […]

Hive View 2.0 is New in Apache Ambari 2.5 Ambari’s Hive View gives analysts and DBAs a convenient web interface to Apache Hive which allows SQL analytics, data management and performance diagnostics. Ambari 2.5 introduces Hive View 2.0 with a brand new user experience plus a slew of great new tools to help DBAs run […]

HDP 2.6 takes a huge step forward toward true data management by introducing SQL-standard ACID Merge to Apache Hive. As scalable as Apache Hadoop is, many workloads don’t work well in the Hadoop environment because they need frequent or unpredictable updates. Updates using hand-written Apache Hive or Apache Spark jobs are extremely complex.  Not only […]

Now Generally Available in HDP 2.6 Hive LLAP (Low Latency Analytical Processing) is Hive’s new architecture that delivers MPP performance at Hadoop scale through a combination of optimized in-memory caching and persistent query executors that scale elastically within YARN clusters. Hive LLAP — MPP Performance at Hadoop Scale   Since Hive LLAP was introduced as […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

Apache Hive(™) is the most complete SQL on Hadoop system, supporting comprehensive SQL, a sophisticated cost-based optimizer, ACID transactions and fine-grained dynamic security. Though Hive has proven itself on multi-petabyte datasets spanning thousands of nodes many interesting use cases demand more interactive performance on smaller datasets, requiring a shift to in-memory. Hive 2 marks the […]

The need to address Business Continuity and Disaster Recovery (BCDR) concerns is well known to anyone who runs production systems. This blog introduces HBase’s new backup and restore capabilities, which give HBase the ability to perform full and incremental backups across clusters and into the cloud. When combined with real-time replication, this new incremental backup […]

The most significant new feature in Apache Hive 2, to be included in the upcoming HDP 2.5 release is a technical preview of LLAP (Live Long and Process). LLAP enables as fast as sub-second SQL analytics on Hadoop by intelligently caching data in memory with persistent servers that instantly process SQL queries. Since LLAP is […]

Are you heading to HBaseCon this year on May 24? This year HBaseCon just had too much great content to fit it all into one day, and thanks to the kind sponsorship of Salesforce we’re happy to announce that PhoenixCon, the first ever Apache Phoenix user conference will be held on the next day, May […]

Apache Ambari 2.0 User Views introduce two functional tools to help you understand and optimize your cluster resources to get the best performance in a multitenant Hadoop environment. Tez View: Understand and Optimize Jobs in your Cluster The Tez View gives you visibility into all the jobs on your cluster, allowing you to quickly identify […]