cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

Hadoop Ecosystem

Debugging distributed systems can be difficult largely because they are designed to run on many (possibly thousands) of hosts in a cluster. This process typically involves monitoring and analyzing log files spread across the cluster, and if the necessary information is not being logged, service restarts and job redeployment may be required. Not only is […]

There were a lot of great activities and sessions at the recent Apache: Big Data North America in Vancouver, B.C. I enjoyed the technical level of the sessions and meeting others who contribute to projects in the Apache Software Foundation (ASF). The sessions I went to had a high level of interesting technical content, with […]

The world’s top authorities on Apache Hadoop convene at Hadoop Summit San Jose and one of the top questions that will be answered will be around the future and direction of Hadoop. Sanjay Radia – Founder and Architect, Hortonworks lead the track which selected 13 sessions around this topic. I asked Sanjay what he hoped would […]

At Hadoop Summit San Jose the goal of the Data Science, Analytics and Spark track is sure to be packed. Ram Sriharsha – Product Manager Apache Spark, Databricks generalizes the 16 sessions in the track as providing technical guidance around: Leveraging Hadoop for analytics is a key use case across industries and represents a critical value proposition for Hadoop. This track […]

Hadoop Summit San Jose, is just around the corner. I am amazed at the depth and breadth of the technical sessions and was looking at the Application Development track: Application Development YARN has transformed Hadoop into a multi-tenant data platform. It is the foundation for a wide range of processing engines that empowers businesses to […]

In preparation for Hadoop Summit San Jose, I asked the Chair for the Apache Committer Insights track, Andy Feng – VP Architecture, Yahoo! which were the top 3 sessions he would recommend. Although it was a tough choose only 3, he recommended: HDFS: Optimization, Stabilization and Supportability Speakers: Chris Nauroth from Hortonworks and Arpit Agarwal […]

Welcome back to my blogging adventure.  In my Cybersecurity Architecture series, we’ve spent some time discussing the value an analytic approach to the incident response process. In the last article, Conceptual Cybersecurity Architecture for analytic response, we started to drill into the solution space by giving a high level architecture to drive our discussion.  Let’s […]

A guest blog post from Scott Schlesinger, Principal, Ernst & Young LLP In July 2015, EY announced its EY Warranty Analytics service offering for the SAP HANA® platform. The service includes EY’s advanced analytics for use with SAP® technology to monitor warranty claims, with the goals of identifying fraudulent activity, reducing costs and improving quality. Automobile […]

To compete in the age of IoAT, organizations are tapping into data sources from a network of physical objects to design new customer experiences. The companies that are furthest along are removing operational inefficiencies from their internal processes. They are using self learning algorithms and dynamic model deployment for predictive maintenance to accelerate success in […]

Introduction The community recently announced the release of Apache Storm 1.0.0 Stable. This is a significant release that delivers several features that pertain to enterprise readiness, operational simplicity and ease of use by dramatically enhancing areas around performance, scalability, debug-abilty and manageability. Highlights Here are some of the highlights of features introduced in Storm 1.0 […]

Hortonworks is proud and committed to being 100% open, we break down silos, push boundaries and enable the entire ecosystem to flourish and innovate (read Shaun Connolly’s blog). That belief extends to our commitment with Open Data Platform initiative (ODPi) as well, we are proud to be part of ODPi because it operates under an open governance model […]

Today’s blog post comes from our partners at Neustar and is by Jonjie Sena When it comes to mobile Caller ID, smartphone owners are feeling outnumbered. In the U.S. market, where Caller ID has the largest uptake, over ninety percent of landline phones feature Caller ID service, versus two percent for mobile phones. The remaining […]

I had the pleasure to speak at Spark Summit in New York today about accelerating the adoption of Spark by mainstream enterprises. I had to admit at the beginning of my talk that I’m an “open source addict” — over the past 12 years I’ve been blessed to have called JBoss, Red Hat, SpringSource, and Hortonworks […]

We started Hortonworks Community Connection at the end of 2015, and there is some amazing content that any data developer or data administrator should read and bookmark. I will publish this blog weekly and highlight the top technical articles that are on HCC based on community activity and votes.  Top 3 articles on the site:  Sample […]

Hortonworks has achieved quite a bit of success with online dating. Personally, I haven’t just yet, but hey it warms my heart to think about all those that we’ve helped bring together. Valentine’s Day is upon us and so I wanted to launch this Cupid’s arrow with a missive about how Hortonworks Data Platform (HDP) […]