The Hortonworks Blog

Opportunity abounds

According to the enterprise data usage experts at Appfluent, the typical Enterprise Data Warehouse (EDW) dedicates 70% of its storage volume to unused data and 55% of its processing capacity to low value ETL workloads. This represents a waste of what could otherwise be a high performance, finely tuned analytics and reporting environment that supports enterprise priorities. Even worse, EDW environments often cannot deal with the varied structures of new data sources that offer so much untapped value.…

The recent post by Jayush Luniya announced the community release of Apache Ambari 2.0. One of the three key Ambari features that Jayush discussed was Rolling Upgrades, enabling Hadoop operators to upgrade from one version of HDP to the next, with minimal disruption to the cluster.

The Hortonworks development team worked long and hard to make the Hadoop platform “rolling upgradeable”. That groundwork was available in Hortonworks Data Platform 2.2 as described in this previous post.…

This is the third post in a series exploring recent innovations in the Hadoop ecosystem that are included in Hortonworks Data Platform (HDP) 2.2. In this post, we introduce the theme of supporting rolling upgrades and downgrades of a HDFS cluster. See this previous post for an introduction on enterprise-grade rolling upgrades in HDP 2.2.

Hortonworks Data Platform provides centralized enterprise services for consistent operations of Hadoop clusters for a reliable enterprise-ready data lake.…

Advances in Hadoop security, governance and operations have accelerated adoption of the platform by enterprises everywhere. Apache Ambari is the open source operational platform for provisioning, managing and monitoring Hadoop clusters from a single pane of glass, and with the Apache Ambari 1.7.0 release last year, Ambari made it far easier for enterprises to adopt Hadoop.

Today, we are excited to announce the community release of Apache Ambari 2.0, which will further accelerate enterprise Hadoop usage by simplifying the technical challenges that slow adoption the most.…

Hortonworks subscribers across all major industries use Hortonworks Data Platform (HDP) to power advanced analytics applications for data discovery and predictive analytics. The insurance industry uses Hadoop to drive this type of innovation for usage-based insurance (UBI).

Cindy Maike is the GM for Insurance Solutions at Hortonworks, and later this month she will present on big data for UBI at Insurance Telematics Canada 2015. The conference begins on April 23rd in Toronto, and Cindy will present with Tammy Chen from Towers Watson and Gerry Lee from QA Consultants on “Data Makes the (UBI) World Go Round.”

Register for the Conference & Exhibition

Here’s a preview of what Cindy and the panel will discuss.…

As we are finalizing our preparations for what will surely be another successful Hadoop Summit Europe event, one thing has become unequivocally clear: the Hadoop challenge is no longer about acceptance. It’s no longer about adoption. It’s about Hadoop being pervasive. Hadoop is everywhere.

As Mike Gualtieri of Forrester wrote in a recent report:

Hadoop is a must-have for large enterprises

I couldn’t agree more with Mike’s assessment, and I encourage you to read the report: “Predictions 2015: Hadoop Will Become a Cornerstone of Your Business Technology Agenda”.…

Hortonworks is excited to announce that our first hands-on, performance based certification exam is now available! The HDP Certified Developer (HDPCD) exam is designed for Hadoop developers working with frameworks like Pig, Hive, Sqoop and Flume. This new approach to Hadoop certification is designed to allow individuals an opportunity to prove their Hadoop skills in a way that is recognized in the industry as meaningful and relevant to on-the-job performance.…

In this guest blog, Dale Glover, vice president of Industry Consulting at Teradata, discusses the evolution of Internet of Things, how sensor data are used for diagnostics and repair as well as for prescriptive and proactive data analytics.

The Internet of Things (IOT) is upon us, and we see new participants joining this evolution of technology every day. Whether it is some new consumer device/appliance contributing to the ultimate automation of our homes and vehicles, or the deluge of new smartphone-connected devices (Fitbits, watches, or even smart clothing), the number of “things” generating volumes of data in an automatic and consistent manner increases daily.…

Newbold Advisors is a Hortonworks® services partner that works with customers in the oil and gas industry. The company delivers big data analytics strategies and solutions across all segments of the oil and gas industry. I recently spoke with Ram Seetepalli, Senior Director at Newbold Advisors. We discussed the challenges facing midstream companies and how organizations in that sector aggressively leverage all of their data assets with a modern data architecture with Apache Hadoop at its core.…

This guest post is from Vamsi Chemitiganti, chief architect of Red Hat’s Financial Services Vertical. Vamsi is responsible for driving Red Hat’s technology vision from a client standpoint. His areas of focus range from platform, middleware, storage to big data and cloud (IaaS and PaaS). The clients Vamsi engages with on a daily basis span marquee names on Wall Street, including businesses in capital markets, core banking, wealth management and IT operations.…

This three part series is co-authored by Ofer Mendelevitch, director of data science at Hortonworks, and Jiwon Seo, Ph.D. and research assistant at Stanford University.

Introduction

This is the third part of the blog-post series about anomaly detection from healthcare data.

In part 1, we described the dataset, the business use-case and our general approach of applying graph algorithms (specifically the personalized-PageRank algorithm) to detect anomalies in the Medicare-B dataset.…

As retailers embark on taking advantage of big data, they are increasingly looking to the Apache Hadoop platform and the partner ecosystem that surrounds it to solve their most pressing challenges. Our partner Microsoft is helping organizations around the world in Retail gain better insights into their big data with Azure HDInsight and additional Azure analytic services.

As an example, Azure HDInsights, powered by Hortonworks Data Platform (HDP), was a key component in helping Pier 1 Imports realize value from unstructured and structured data to get a 360 degree view of their customers.…

Apache Ambari is the only 100% open source management and provisioning tool for Apache Hadoop. Recent innovations of Apache Ambari have focused on opening Apache Ambari into a pluggable management platform that can automate cluster provisioning, deploy 3rd party software and provide custom operational and developers’ views to the end user.

Join us Thursday March 26 at 10am PT, for an online technical workshop where we will cover 3 key integration points of Apache Ambari including Stacks, Views and Blueprints and deliver working examples of each.…

At the beginning of February, HP announced their intent to acquire Voltage Security to expand data encryption security solutions for Cloud and Big Data. Today, both companies share their thoughts about the acquisition. Carole Murphy, Director Product Marketing at Voltage Security, and Albert Biketi, Vice President and General Manager at HP Atalla, tell us more about how HP extends the capabilities of every product in the Voltage portfolio, including Voltage’s leadership in securing Hadoop data with data-centric, standards-based technologies.…

Today EMC is launching their EMC® Business Data Lake solution, the first fully-engineered, enterprise-grade solution for a Data Lake running on EMC infrastructure. At Hortonworks, we’ve been assisting customers on their journey to a data lake via a Modern Data Architecture (MDA) and our vision and EMC’s vision are highly complementary and so we’re delighted to be part of the EMC Business Data Lake.

The Data Lake enabled by a Modern Data Architecture allows enterprises to be a Data-First Enterprise.…