cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

At Hadoop Summit San Jose the goal of the Data Science, Analytics and Spark track is sure to be packed. Ram Sriharsha – Product Manager Apache Spark, Databricks generalizes the 16 sessions in the track as providing technical guidance around: Leveraging Hadoop for analytics is a key use case across industries and represents a critical value proposition for Hadoop. This track […]

Hadoop Summit San Jose, is just around the corner. I am amazed at the depth and breadth of the technical sessions and was looking at the Application Development track: Application Development YARN has transformed Hadoop into a multi-tenant data platform. It is the foundation for a wide range of processing engines that empowers businesses to […]

In preparation for Hadoop Summit San Jose, I asked the Chair for the Apache Committer Insights track, Andy Feng – VP Architecture, Yahoo! which were the top 3 sessions he would recommend. Although it was a tough choose only 3, he recommended: HDFS: Optimization, Stabilization and Supportability Speakers: Chris Nauroth from Hortonworks and Arpit Agarwal […]

The Ambari Metrics System (AMS) released with Ambari 2.0 about a year ago is an Ambari-native pluggable and scalable system for collecting and querying Hadoop Metrics (AMBARI-5707). Since that time, the community has been working hard at adding new capabilities to the system and recently announced the availability of Ambari 2.2.2 where AMS now includes […]

I’m just reaching the end of my first month at Hortonworks based in our London office.  Most of that time has been spent with our customers understanding their use cases, reading about trends and developments in data analytics or watching videos about everything from connected data platforms to modern data apps to the bits and bytes […]

The first post in this three part series brought to the fore critical strategic trends in the Wealth & Asset Management (WM) space – the most lucrative portion of Banking. This second post will describe an innovation framework for a forward looking WM institution.The final post will cover technology architecture and business strategy recommendations for WM CXO’s. Introduction: […]

Part 1: A Little History In this series of blog posts, we will provide an in-depth look select features introduced with the release of Apache Storm (Storm) 1.0. To kick off the series, we’ll take a look how Storm has evolved over the years from its beginnings as an open source project, up to the […]

Before we drill down into how Hortonworks partnered with Arizona State University (ASU) to design and develop a platform to discover genomic links to cancer, let’s take a look at a few of cancer’s fundamental attributes. Cancer is both a complicated and complex disease.  Cancer is complicated because it is not actually a single disease, but rather the […]

Welcome back to my blogging adventure.  In my Cybersecurity Architecture series, we’ve spent some time discussing the value an analytic approach to the incident response process. In the last article, Conceptual Cybersecurity Architecture for analytic response, we started to drill into the solution space by giving a high level architecture to drive our discussion.  Let’s […]

This blog focuses on moving streaming analytics outside the confines of the traditional data center. Moving streaming analytics closer to where data originates can be accomplished by leveraging an enterprise grade data movement application, married with an extremely lightweight streaming engine. This combination is being used by forward-looking organizations to solve usage cases in a […]

A guest blog post from Scott Schlesinger, Principal, Ernst & Young LLP In July 2015, EY announced its EY Warranty Analytics service offering for the SAP HANA® platform. The service includes EY’s advanced analytics for use with SAP® technology to monitor warranty claims, with the goals of identifying fraudulent activity, reducing costs and improving quality. Automobile […]

Apache Hadoop® exists within a broader ecosystem of enterprise analytical packages. This includes ETL tools, ERP and CRM systems, enterprise data warehouses, data marts and others. Modern workloads flow from these various traditional analytical sources into Hadoop and then often back out again. What dataset came from which system, when and how did it change over […]

  At Hortonworks, we work with hundreds of enterprises to ensure they get the most out of Apache Hadoop and the Hortonworks Data Platform. A critical part of making that possible is ensuring operators can quickly identify the root cause if something goes wrong. A few weeks ago, we presented our vision for Streamlining Apache […]

After years of experience with the entire Hadoop stack, Hortonworks solutions engineer Paul Hargis became interested in the math and statistics behind machine learning. He has since morphed into a Big Data Architect Extraordinaire and Spark Subject Matter Expert. In a recent interview, Paul shared his background and unique insights around Hadoop, Spark and Machine […]

“If (wealth management advisors) continue to work the way you have been, you may not be in business in five years” – Industry leader Joe Duran, 2015 TD Ameritrade Wealth Advisor Conference. The wealth management segment is a potential high growth business for any financial institution. It is the highest customer touch segment of banking and is fostered on long term and extremely lucrative advisory relationships. […]