Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
December 11, 2018 | Bikas Saha | Hortonworks Engineering

Data Science & Engineering Platform: Data Lineage and Provenance for Apache Spark

December 11, 2018 | Paul Codding

What’s so great about Apache Ambari 2.7?

December 10, 2018 | Dinesh Chandrasekhar

What’s new in Hortonworks DataFlow 3.3?


All Topics

All Channels


This is the third in a series of data engineering blogs that we plan to publish. The first blog outlined the data science and data engineering capabilities of Hortonworks Data Platform. Motivation Apache Spark is becoming the de-facto processing framework for all kinds of complex processing including ETL, LOB business data processing and machine learning. […]

With Apache Ambari, our mission is to create and foster a 100% open source operations platform that allows teams to quickly deploy, secure, monitor and manage HDP, HDF, and our Hortonworks partner ecosystem products.  Whether you’re a customer with 5 nodes or 5,000, Apache Ambari gives you the enterprise feature set and tools needed to […]

We are excited to announce the General Availability of Hortonworks DataFlow (HDF) 3.3. Throughout 2018, the HDF releases have been focusing on making the platform more robust for taking on advanced streaming architectures. We brought about several innovations and key enhancements on the operational side as well as on the development side of the enterprise. […]

To make Big Data cloud-native, Hortonworks has unveiled the Open Hybrid Architecture Initiative. In our previous blogs, we have talked about the vision, key tenets/concepts, real-world use case, and the new storage environment of O3. We often get asked by our customers, partners, and analysts: “Hortonworks has been in the middle of the data revolution for […]

It’s been an incredible year for Hortonworks, and it’s only been possible because of our customers! We’ve seen our customers accomplish remarkable achievements across every industry. Throughout the year, we’ve featured a number of these stories, including the challenges that were being faced and the results that were achieved. Here is a look back, at the […]

With Hortonworks DataFlow (HDF) 3.3 now supporting Kafka Streams, we are truly excited about the possibilities of the applications that you can benefit from when combined with the rest of our platform. In this post, we will demonstrate how Kafka Streams can be integrated with Schema Registry, Atlas and Ranger to build set of microservices […]

In an earlier blog post, Democratizing Analytics within Kafka With 3 Powerful New Access Patterns in HDP and HDF, we discussed different access patterns that provides application developers and BI analysts powerful new tools to implement diverse use cases where Kafka is key component of their application architectures. In this blog, we will discuss in detail […]

With millions of consumers searching for the ideal hotel room around the world, was finding it difficult to cope with the vast amounts of data coming from its search engine marketing that could be used to enhance the customer journey. provides a service for reserving hotels, B&Bs and other types of commercial lodgings, […]

At Hortonworks, we are excited to launch the Data Hero Call for Nominations which are now open for Dataworks Summit Barcelona, Spain and Washington, DC. We want to hear from you our customers and learn how they drive  leadership in the industry and across the marketplace. Nominations for Barcelona are open through February 1, 2019 and for […]

There are three common abilities across the cloud providers that I want to focus on and to see how they work together and build on each other to help you maximize agility and data insights in the cloud. They are: cloud storage, running workloads on demand, and elastic resource management. In addition, we’ll talk about […]

Introducing our Storage Environment O3 Building on the last three blogs (vision, key tenets/concepts, real-world use case) in the Open Hybrid Architecture series, we now want to take a deeper dive into our Storage Environment, especially O3 (the molecular formula for Ozone). First, we want to look back at the annals of history of Hadoop. […]

Check out our recently published customer case study! This story gives a look at how TechnipFMC is enabling data analysts and data scientists to gain insights the come from the edge and quickly produce results. TechnipFMC is a global leader in oil and gas projects, technologies, systems, and services to provide their clients with deep expertise […]

One Size Does Not Fit All

This is the second in a series of data science blogs that we plan to publish. The first blog outlined the data science and data engineering capabilities of Hortonworks Data Platform. In this blog we highlight how the latest release of Apache Hive 3 can work with Apache Spark Motivation As enterprises embrace the value […]

This blog was co authored by Simon K. Lutzenberger, Manager Strategic Partnerships at PTC Today, PTC and Hortonworks announce a strategic partnership to “fast-forward” the realization of Industry 4.0 benefits including improved manufacturing quality and yield, enhanced asset and plant uptime, and optimized production flexibility and throughput. This collaboration is directed at a state-of-the art solution […]

The AWS S3 protocol is the defacto interface for modern object stores. Ozone-0.3.0-Alpha release adds S3 protocol as a first-class notion to Ozone. For all practical purposes, a user of S3 can start using Ozone without any change to code or tools. A Bit of History When we started building Ozone, there were a lot […]

Social Media News

@hortonworks: Featuring comments by our very own @saumitra_bg

@hortonworks: Big data planning is crucial to your #digital success. Your #data strategy shouldn't be written once, it must evolv…

@hortonworks: Top initiatives for U.S. banks include investing more in #BigData and #analytics projects. What are the trends driv…

@hortonworks: The journey to the #cloud has many speed bumps along the way. Learn how to avoid them and succeed. #data

@hortonworks: Tracks for #DWS19 Barcelona will include: #AI, Big Compute and Storage, #Cloud Big Data Architecture and Ops, #Data

@hortonworks: Learn how O2 realized significant #business value with HDP and HDF

@hortonworks: One big success that Micron has seen with HDP was shortening the time it took to identify misprocessed die from 7 d…

@hortonworks: Check out the latest in our #data engineering blogs! Data Science & Engineering Platform: Data Lineage and Provenan…