cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

Announcements

I’ve said it before and I’ll say it again, we are OPEN, we are PUBLIC and we are PROUD.  Hortonworks Data Platform is 100% open source. Hortonworks Data Flow is 100% open source. Apache Metron, the incubating cybersecurity effort Hortonworks is stewarding, is 100% open source. Our strategy remains committed to 100% open, our products […]

Top 3 articles this week: (or see the whole list here) Cheat Sheet and Tips for a Custom Install of Hortonworks Data Platform like a Pro. Some great tips and tricks on helping your install go flawlessly. A must read for anyone installing HDP. Recommended reading for anyone looking at Hadoop distributions.  How to install […]

I had the pleasure to speak at Spark Summit in New York today about accelerating the adoption of Spark by mainstream enterprises. I had to admit at the beginning of my talk that I’m an “open source addict” — over the past 12 years I’ve been blessed to have called JBoss, Red Hat, SpringSource, and Hortonworks […]

Attunity is an ISV partner of Hortonworks focused provides data optimization and data integration software that helps Hortonworks customers address exploding data growth and efficiently manage the performance of BI and data warehouse systems. As our guest blogger today, Carole Gunst, Marketing Director at Attunity, introduces the findings of a recent report on Data Lake Adoption, the […]

Register now for the February 25th Webinar at 10am PST/1pm EST. Data is a natural resource for insurance companies. It is acquired, exchanged, and analyzed on an unprecedented scale. Insurance brokers around the world now rely on data obtained from: Mobile devices Wearable devices Telematics Clickstream Social media Claims notes and diaries Call center recordings […]

This year’s Insurance Analytics USA Summit has an exciting new format with presentations and panels that focus on using data to its full potential, creating a data-conscious culture, and applying innovative modeling techniques. Sessions include “The Future of Insurance: Using Analytics to Take Advantage of the Data-Driven Age of Insurance” and “New and Big Data: […]

People have been asking us – Is Google Cloud Dataflow the same thing as Hortonworks DataFlow (HDF)? So we thought we’d take the opportunity to share with you how we see these two products work together. Both have the word dataflow in their name, and both systems are rooted in the premise of dataflow programming, […]

This year’s Insurance Canada Technology Conference will focus on the impact of new technologies in the insurance industry. Key topics include telematics, analytics, the Internet of Things (IoT), and how these capabilities enable insurance companies to improve underwriting and reduce risk. A recent article at Strategy Meets Action identified digital transformation in the insurance industry […]

It was 10 years ago today (Feb 2) that my first patch (https://issues.apache.org/jira/browse/NUTCH-197) went into the code that two days later became Hadoop (https://issues.apache.org/jira/browse/HADOOP-1). I had been working on Yahoo Search’s WebMap, which was the back end that analyzed the web for the search engine.  We had been working on a C++ implementation of GFS and MapReduce, but […]

Hortonworks® Continues European Expansion, Opens for Business in Ireland I’m very excited about our announcement earlier today that we have opened another European office by expanding our operations with a new office in Cork, Ireland. Building on our presence at the heart of London, we are continuing to grow internationally in Europe and beyond as […]

A couple of months ago I joined Hortonworks. There was an undeniable pull to go into the fire of crazy fast innovation and growth. About four seconds in, I realized there was so much more than just the pace of execution and growth but rather a bigger opportunity to be a part of something game-changing. […]

We take pride in producing valuable technical blogs and sharing them with a wider audience. Of all the blogs published in 2015 on our website, the following were most popular: Take a look at 5 techniques enabling Hive to support both batch and interactive workloads at speed and scale. 5 Ways to Make Your Hive Queries […]

Santa will be busy this year. On December 24th he’s scheduled to deliver presents to billions of children globally. Buddy and the Keeblers will be working overtime to meet the demand, and Santa has called in temp work from Legolas and Dobby. There’s little doubt that Santa is a master of lean manufacturing, but there’s […]

We are pleased to announce that the 2nd release of Hortonworks DataFlow is now available. Hortonworks DataFlow is a data-source agnostic, real time data collection and dataflow management platform designed to meet the practical challenges of collecting and moving data securely and efficiently. HDF 1.1 builds on the strength of the initial GA version of […]

We are in the midst of the third industrial revolution, driven by IoT and Big Data analytics. This is a fundamental blurring of boundaries between the physical and digital worlds, which has resulted in disruptive new business models. Register now for the Webinar on Thursday, December 10th , at 11:00am PST, with guest speakers Frank […]