cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

Hadoop Insights

I decided to take a break from my Cybersecurity Architecture series and CISO’s View series to give my thoughts on this year’s RSA conference while things are still fresh.  First off, I enjoyed meeting with old colleagues and many security people that I respect which justified the trip as far as I’m concerned.  I’m really amazed […]

With the introduction of the Hortonworks Data Cloud (HDCloud), deploying clusters and starting to process data has become an order of magnitude faster. When Apache Hadoop evolved from being an on premise solution to a cloud based solution, the time it took to make a cluster went from weeks to days. The same magnitude of […]

Hortonworks has achieved quite a bit of success with online dating. Personally, I haven’t just yet, but hey it warms my heart to think about all those that we’ve helped bring together. Valentine’s Day is upon us and so I wanted to launch this cupid’s arrow with a missive about how Hortonworks Data Platform (HDP) […]

It is that time of year again, right before Christmas in Las Vegas, where nearly 30,000 technologists gather to see the latest in innovation around the Cloud. Hortonworks is honored to participate as an exhibitor for the first time. If you are in Vegas this week for the AWS re:Invent, please stop by our booth #2732 […]

Provenance, Lineage & Chain of Custody The models of Provenance, Lineage and Chain of Custody are used in fine art to determine when a piece was created, the sequence of locations where it was held, how it was touched along the way, and who has owned it since creation, all with the purpose of authenticating the piece. […]

A guest blog post from Scott Schlesinger, Principal, America’s EY Advisory EY and Hortonworks formed a strategic business alliance in August 2015 that is focused on helping our valued clients turn big data challenges into big business opportunities.  Recognizing that big data is transforming business and technology is driving that change, EY plays a significant role in […]

Debugging distributed systems can be difficult largely because they are designed to run on many (possibly thousands) of hosts in a cluster. This process typically involves monitoring and analyzing log files spread across the cluster, and if the necessary information is not being logged, service restarts and job redeployment may be required. Not only is […]

Dinah Washington sang “what a difference a day makes” and having lived in London for a year this month, I’m feeling that multiplied by 365!  And what a year it has been…. I joined Hortonworks back in 2012 when the company was barely 8 months old and moved to be part of the International team […]

Part 1: A Little History In this series of blog posts, we will provide an in-depth look select features introduced with the release of Apache Storm (Storm) 1.0. To kick off the series, we’ll take a look how Storm has evolved over the years from its beginnings as an open source project, up to the […]

This blog focuses on moving streaming analytics outside the confines of the traditional data center. Moving streaming analytics closer to where data originates can be accomplished by leveraging an enterprise grade data movement application, married with an extremely lightweight streaming engine. This combination is being used by forward-looking organizations to solve usage cases in a […]

Apache Hadoop® exists within a broader ecosystem of enterprise analytical packages. This includes ETL tools, ERP and CRM systems, enterprise data warehouses, data marts and others. Modern workloads flow from these various traditional analytical sources into Hadoop and then often back out again. What dataset came from which system, when and how did it change over […]

The world’s data now doubles in volume every two years.  We’re living in an Age of Data fed by the Internet of Anything. Life in the Age of Data is always-on and always-connected with easy access to incredibly rich sources of analyzed information coming from the Internet, mobile devices, servers, machines, sensors, and so on. […]

Welcome back to my blogging adventure.  If you’ve been reading my Cybersecurity series;  “echo: hello world”, “Cybersecurity: the end of rules are nigh”, and Cybersecurity: why context matters and how do we find it you know just how much time I’ve spent explaining why an integrated cybersecurity analytic solution should focus on delivering value and […]

Hadoop just turned 10, the first code check-in was on Feb. 2, 2006 by our very own co-founder, Owen O’Malley. I am tremendously proud to have been a part of this first 10 years, and even more excited on where this open movement is going to take us. Congratulations to everyone in the Community! We […]

I’ve said it before and I’ll say it again, we are OPEN, we are PUBLIC and we are PROUD.  Hortonworks Data Platform is 100% open source. Hortonworks Data Flow is 100% open source. Apache Metron, the incubating cybersecurity effort Hortonworks is stewarding, is 100% open source. Our strategy remains committed to 100% open, our products […]