Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
November 22, 2017 | Robert Hryniewicz

IoT and Data Science – A Trucking Demo on DSX Local with Apache NiFi

November 21, 2017 | Piet Loubser

Big Data London – UK readies for Global Data-Driven Upheaval

November 20, 2017 | Matt Spillar | Hortonworks Case Study

Addressing the Data Tipping Point

Viewing posts by: Alan Gates« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

Time is running out to secure your spot at DataWorks Summit/Hadoop Summit. With over 170 sessions featuring top organization using open source technologies to leverage their data, drive predictive analytics, distributed deep-learning, and artificial intelligence initiatives, you don’t want to miss the industry’s premier event. Join us June 13 – 15 in San Jose and save […]

Comcast. Verizon. Netflix. Walgreens. PayPal. Ford Motor Company. Wells Fargo. LinkedIn. Uber. These and other organizations like these will be at this year’s DataWorks Summit/Hadoop Summit. Open source experts will present over 170 sessions. They’ll share how they use open source technologies to leverage their data, drive predictive analytics, distributed deep-learning and artificial intelligence initiatives. There’s still time for you to […]

There were a lot of great activities and sessions at the recent Apache: Big Data North America in Vancouver, B.C. I enjoyed the technical level of the sessions and meeting others who contribute to projects in the Apache Software Foundation (ASF). The sessions I went to had a high level of interesting technical content, with […]

In August 2009, the Facebook Data Infrastructure Team published a white paper that outlined a warehousing solution over Hadoop. They called it Hive. And since that time, this project has not only emerged as the defacto standard for SQL in Hadoop, but with the help of the Stinger initiative it has progressed from a batch […]

In April of this year, Hortonworks, along with the broad Hadoop community delivered the final phase of the Stinger Initiative on schedule, completing the work to bring interactive SQL query to Apache Hive.  The original directive of Stinger was about advancing SQL capabilities at petabyte scale in pure open source. And over 13 months, 145 […]

The release of Hive 0.11 is exciting and represents a big step forward to delivery of Project Stinger  and SQL-IN-Hadoop.  There is still some work to be done however.  We look forward to delivery of Hadoop 2 with YARN and the Apache Tez project as being huge increases to Hive performance, but this is not […]

Written with Vinod Kumar Vavilapalli and Gopal Vijayaraghavan A few weeks back we blogged about the Stinger Initiative and set a promise to work within the open community to make Apache Hive 100 times faster for SQL interaction with Hadoop. We have a broad set of scenarios queued up for testing but are so excited about […]

  UPDATE: Since this article was posted, the Stinger initiative has continued to drive to the goal of 100x Faster Hive. You can read the latest information at https://hortonworks.com/stinger Introduced by Facebook in 2007, Apache Hive and its HiveQL interface has become the de facto SQL interface for Hadoop.  Today, companies of all types and sizes […]

In case you didn’t see the news, I wanted to share the announcement that HCatalog 0.4.0 is now available. For those of you that are new to the project, HCatalog provides a metadata and table management system that simplifies data sharing between Apache Hadoop and other enterprise data systems. You can learn more about the project […]