cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

Announcements

What an exciting time for Hadoop, for the Community and for Hortonworks. Last week, we announced our strategy around Open and Connected Data Platforms. And followed-up with the latest release of our flagship product, the Hortonworks Data Platform 2.4. This included the release of Apache Ambari 2.2, which will further enable enterprises to harness the […]

Today our guest blogger is Keith Manthey, CTO from EMC. As part of my job, I regularly meet with clients around their Apache Hadoop journey. I often meet executives after they have encountered a catalytic event. In one particular meeting I vividly remember, the client had suffered over 24 hours of downtime on their Hadoop […]

As Apache Spark continues to gain popularity, the rapid march of new Spark releases continues. With HDP 2.4, we are announcing the general availability of Spark 1.6, which is the latest Spark version from the community. With Spark proving an incredibly useful data access engine running on top of Hadoop, data scientists and business analysts […]

The world’s data now doubles in volume every two years.  We’re living in an Age of Data fed by the Internet of Anything. Life in the Age of Data is always-on and always-connected with easy access to incredibly rich sources of analyzed information coming from the Internet, mobile devices, servers, machines, sensors, and so on. […]

Hadoop just turned 10, the first code check-in was on Feb. 2, 2006 by our very own co-founder, Owen O’Malley. I am tremendously proud to have been a part of this first 10 years, and even more excited on where this open movement is going to take us. Congratulations to everyone in the Community! We […]

During 2015, the pace of adoption of the Hortonworks Data Platform (HDP) continued to accelerate. Both existing and new customers brought massive amounts of data under management using Apache Hadoop technologies. I found it really interesting that new requirements being discussed by our customers showed much less consistency than previous years. In particular, we heard […]

I’ve said it before and I’ll say it again, we are OPEN, we are PUBLIC and we are PROUD.  Hortonworks Data Platform is 100% open source. Hortonworks Data Flow is 100% open source. Apache Metron, the incubating cybersecurity effort Hortonworks is stewarding, is 100% open source. Our strategy remains committed to 100% open, our products […]

Top 3 articles this week: (or see the whole list here) Cheat Sheet and Tips for a Custom Install of Hortonworks Data Platform like a Pro. Some great tips and tricks on helping your install go flawlessly. A must read for anyone installing HDP. Recommended reading for anyone looking at Hadoop distributions.  How to install […]

I had the pleasure to speak at Spark Summit in New York today about accelerating the adoption of Spark by mainstream enterprises. I had to admit at the beginning of my talk that I’m an “open source addict” — over the past 12 years I’ve been blessed to have called JBoss, Red Hat, SpringSource, and Hortonworks […]

Attunity is an ISV partner of Hortonworks focused provides data optimization and data integration software that helps Hortonworks customers address exploding data growth and efficiently manage the performance of BI and data warehouse systems. As our guest blogger today, Carole Gunst, Marketing Director at Attunity, introduces the findings of a recent report on Data Lake Adoption, the […]

Register now for the February 25th Webinar at 10am PST/1pm EST. Data is a natural resource for insurance companies. It is acquired, exchanged, and analyzed on an unprecedented scale. Insurance brokers around the world now rely on data obtained from: Mobile devices Wearable devices Telematics Clickstream Social media Claims notes and diaries Call center recordings […]

This year’s Insurance Analytics USA Summit has an exciting new format with presentations and panels that focus on using data to its full potential, creating a data-conscious culture, and applying innovative modeling techniques. Sessions include “The Future of Insurance: Using Analytics to Take Advantage of the Data-Driven Age of Insurance” and “New and Big Data: […]

People have been asking us – Is Google Cloud Dataflow the same thing as Hortonworks DataFlow (HDF)? So we thought we’d take the opportunity to share with you how we see these two products work together. Both have the word dataflow in their name, and both systems are rooted in the premise of dataflow programming, […]

This year’s Insurance Canada Technology Conference will focus on the impact of new technologies in the insurance industry. Key topics include telematics, analytics, the Internet of Things (IoT), and how these capabilities enable insurance companies to improve underwriting and reduce risk. A recent article at Strategy Meets Action identified digital transformation in the insurance industry […]

It was 10 years ago today (Feb 2) that my first patch (https://issues.apache.org/jira/browse/NUTCH-197) went into the code that two days later became Hadoop (https://issues.apache.org/jira/browse/HADOOP-1). I had been working on Yahoo Search’s WebMap, which was the back end that analyzed the web for the search engine.  We had been working on a C++ implementation of GFS and MapReduce, but […]