cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

Please join Hortonworks for the GENIVI Demonstration Showcase event on 27 April in Paris, France. The showcase will be all about Connected Data Platforms for Connected Cars and will run from 5:00pm to 8:30pm, hosted by Grant Bodley, GM Global Automotive & Manufacturing Solutions at Hortonworks. You can register here. Be sure to visit Hortonworks […]

Blog post by Vamsi Chemitiganti, General Manager-Financial Services, Hortonworks and Joe Gillespie, Anti-Money Laundering Leader, Booz Allen Hamilton Now more than ever, advanced analytic technology is a necessity in creating an effective anti-money laundering (AML) compliance program. As data volumes grow from a high number of customers using multiple channels (including ATM kiosks, online and branch […]

The Financial Services industry is undergoing a major transformation. Innovation in data technologies is driving growth of predictive analytics and data mining techniques that will dramatically change banking over the next few years. This is the second in a series of posts that will describe that transformation. The first can be found here. This second […]

by Tendu Yogurtcu, PhD – General Manager of Big Data, Syncsort This week, Hortonworks announced an exciting expansion of our long-standing partnership. Hortonworks will now resell Syncsort’s leading Hadoop data integration software, DMX-h for onboarding ETL processing in Hadoop. DMX-h will enable our joint customers to easily access and collect data from a diverse set […]

The Hortonworks Connected Data Platform, serves as the unifying system for enterprises looking to power data processing and predictive analytic applications that leverage both Data In Motion along with Data at Rest with agility and flexibility. The Data in motion capability is provided by Hortonworks Data Flow (HDF), which is powered by Apache NiFi. HDF […]

Public Preview – Apache Atlas and Apache Ranger are now integrated to drive dynamic classification-based security    Why Governance and Security are better together? How do you keep track of large number of diverse data objects (think hundred thousand data entities) in your data lake that continue to increase every day. Now that Apache Hadoop […]

Hortonworks is proud and committed to being 100% open, we break down silos, push boundaries and enable the entire ecosystem to flourish and innovate (read Shaun Connolly’s blog). That belief extends to our commitment with Open Data Platform initiative (ODPi) as well, we are proud to be part of ODPi because it operates under an open governance model […]

Hadoop just turned 10, the first code check-in was on Feb. 2, 2006 by our very own co-founder, Owen O’Malley. I am tremendously proud to have been a part of this first 10 years, and even more excited on where this open movement is going to take us. Congratulations to everyone in the Community! We […]

I’ve said it before and I’ll say it again, we are OPEN, we are PUBLIC and we are PROUD.  Hortonworks Data Platform is 100% open source. Hortonworks Data Flow is 100% open source. Apache Metron, the incubating cybersecurity effort Hortonworks is stewarding, is 100% open source. Our strategy remains committed to 100% open, our products […]

This is another great European customer guest blog post  authored by Joan Viladrosa, Tech Lead & Senior Big Data Engineer at Billy Mobile. You can hear more about their solution by joining our live webcast February 23rd. Register here.   About Billy Mobile As a mobile ad exchange with a large marketplace of direct publishers […]

We started Hortonworks Community Connection at the end of 2015, and there is some amazing content that any data developer or data administrator should read and bookmark. I will publish this blog weekly and highlight the top technical articles that are on HCC based on community activity and votes.  Top 3 articles on the site:  Sample […]

Hortonworks has achieved quite a bit of success with online dating. Personally, I haven’t just yet, but hey it warms my heart to think about all those that we’ve helped bring together. Valentine’s Day is upon us and so I wanted to launch this Cupid’s arrow with a missive about how Hortonworks Data Platform (HDP) […]

Author: Michael Bironneau, Data Scientist, Open Energi At Open Energi, we think of our service as an automated, virtual power station. Whenever the electric grid experiences sudden, unforeseen surges in supply or demand, assets under the control of our Dynamic Demand algorithm automatically pick up the slack – just like a power station would but cheaper […]

A Beginners Guide to Becoming an Apache Contributor Venkatesh Sellappa, Teradata My name is Venkatesh Sellappa. My background is primarily application of analytics in the Big Data Space, before either of them was called that. We used to just call it programming. My session is an account of my personal journey into the often contentious […]

Advanced Execution Visualization of Spark jobs Author: Zoltán Zvara, Márton Balassi, András Garzó, Hungarian Academy of Sciences in collaboration with Ericsson Understanding the physical plan of a big data application is often crucial for tracking down bottlenecks and faulty behavior. Apache Spark although offering useful Web UI component for monitoring and understanding the logical plan […]