cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

This is the first blog in a series written by Richard Proctor, GM of Global Healthcare at Hortonworks, Inc. The series will discuss the reasons for Healthcare’s surging interest in, and rapid adoption of, Hadoop. We’ve all heard a lot lately about Hadoop, especially within the Healthcare industry. Well, this is really important stuff! Healthcare represents the […]

Another busy week on Hortonworks Community Connection, here is the hot content for this week (based on community activity and votes): Top 3 articles this week: (or see the whole list here) Map Hive jobs to YARN queues Using a Hive hook to map jobs to YARN queues when using hive.server2.enable.doAs = false (security best practices and […]

Denodo, providers of a data virtualization solution, has partnered with Hortonworks to help customers access their data stored in HDP – and any other data source in the organization – seamlessly and in real-time. Hadoop and the Need for Speed The amount of data continues to explode, doubling every two years, and this flood of […]

Another busy week on Hortonworks Community Connection, here is the hot content for this week (based on community activity and votes): Top 3 articles this week: (or see the whole list here) Quickly enable SSL encryption for Hadoop components in HDP Sandbox Nice collection of scripts and advice on using Sandbox with SSL.  Hive on Tez Performance Tuning […]

As all of us know, the advent of big data has revolutionized analytics and data science allowing enterprises to store, access and analyze vast amounts of historical data.  But existing data platforms need to evolve to deal with the tsunami of data-in-motion being generated by the Internet of Anything (IoAT). We need a fresh approach to maximize […]

The Hortonworks Data Platform (HDP)[1] compliments core Apache Hadoop with Enterprise level governance, security, and operations capabilities. At HDP’s architectural center is YARN, providing resource management and a pluggable architecture that enables batch, interactive and real-time workloads to run effectively together, in the same cluster. SAS® Event Stream Processing[1] is embeddable software that analyzes streaming […]

Big Data, the Internet of Anything (IoAT) and the Connected Car have created a new Information Superhighway that fundamentally changes the relationship between automakers and drivers. Previously, automakers had an incomplete feedback loop after they sold a vehicle but the connected car has changed all of that. Now, automakers can establish a complete feedback from each […]

The Financial Services industry is undergoing a major transformation. Innovation in data technologies is driving growth of predictive analytics and data mining techniques that will dramatically change banking over the next few years. This is the first of three blogs that will describe that transformation. In this one, I’ll cover the importance of data science […]

Another busy week on Hortonworks Community Connection, here is the hot content for this week (based on community activity and votes): Top 3 articles this week: (or see the whole list here) Cheat Sheet and Tips for a Custom Install of Hortonworks Data Platform like a Pro Great article on the necessary steps to consider before undertaking a […]

Hortonworks launched SmartSense in 2015 to help customers quickly collect cluster configuration, metrics, and logs to proactively detect issues, and expedite support cases troubleshooting.  This diagnostic information is packaged into an encrypted and anonymized bundle and sent to Hortonworks for analysis.  The result of that analysis is available as customized recommendations to help prevent issues […]

What an exciting time for Hadoop, for the Community and for Hortonworks. Last week, we announced our strategy around Open and Connected Data Platforms. And followed-up with the latest release of our flagship product, the Hortonworks Data Platform 2.4. This included the release of Apache Ambari 2.2, which will further enable enterprises to harness the […]

Today our guest blogger is Keith Manthey, CTO from EMC. As part of my job, I regularly meet with clients around their Apache Hadoop journey. I often meet executives after they have encountered a catalytic event. In one particular meeting I vividly remember, the client had suffered over 24 hours of downtime on their Hadoop […]

How much time will shoppers spend online versus in stores? Are online shoppers mostly men or women? How old are they? Who are they shopping for? How do the answers change based on weekends, weather, holidays and geographic location? Utilizing this and so much more, retailers can tailor advertising efforts and specials to target the […]

It has a been a busy week on Hortonworks Community Connection, here is the hot content for this week (based on community activity and votes): Top 3 articles this week: (or see the whole list here) Visualize patients’ complaints to their doctors using NiFi and Solr/Banana: Solution to a very typical problem, how to take advantage […]

As Apache Spark continues to gain popularity, the rapid march of new Spark releases continues. With HDP 2.4, we are announcing the general availability of Spark 1.6, which is the latest Spark version from the community. With Spark proving an incredibly useful data access engine running on top of Hadoop, data scientists and business analysts […]