cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

Hortonworks launched SmartSense in 2015 to help customers quickly collect cluster configuration, metrics, and logs to proactively detect issues, and expedite support cases troubleshooting.  This diagnostic information is packaged into an encrypted and anonymized bundle and sent to Hortonworks for analysis.  The result of that analysis is available as customized recommendations to help prevent issues […]

What an exciting time for Hadoop, for the Community and for Hortonworks. Last week, we announced our strategy around Open and Connected Data Platforms. And followed-up with the latest release of our flagship product, the Hortonworks Data Platform 2.4. This included the release of Apache Ambari 2.2, which will further enable enterprises to harness the […]

Today our guest blogger is Keith Manthey, CTO from EMC. As part of my job, I regularly meet with clients around their Apache Hadoop journey. I often meet executives after they have encountered a catalytic event. In one particular meeting I vividly remember, the client had suffered over 24 hours of downtime on their Hadoop […]

 

Women in Big Data

I’ve had an unbelievably amazing time since joining  Hortonworks.   One of the great things about a fast growth company, is that we are able to create new initiatives, try new approaches, and move the needle quickly.  We formed the women@hortonworks group late last summer after a few of us were asked to attend the inaugural Women in […]

How much time will shoppers spend online versus in stores? Are online shoppers mostly men or women? How old are they? Who are they shopping for? How do the answers change based on weekends, weather, holidays and geographic location? Utilizing this and so much more, retailers can tailor advertising efforts and specials to target the […]

It has a been a busy week on Hortonworks Community Connection, here is the hot content for this week (based on community activity and votes): Top 3 articles this week: (or see the whole list here) Visualize patients’ complaints to their doctors using NiFi and Solr/Banana: Solution to a very typical problem, how to take advantage […]

As Apache Spark continues to gain popularity, the rapid march of new Spark releases continues. With HDP 2.4, we are announcing the general availability of Spark 1.6, which is the latest Spark version from the community. With Spark proving an incredibly useful data access engine running on top of Hadoop, data scientists and business analysts […]

The world’s data now doubles in volume every two years.  We’re living in an Age of Data fed by the Internet of Anything. Life in the Age of Data is always-on and always-connected with easy access to incredibly rich sources of analyzed information coming from the Internet, mobile devices, servers, machines, sensors, and so on. […]

Welcome back to my blogging adventure.  If you’ve been reading my Cybersecurity series;  “echo: hello world”, “Cybersecurity: the end of rules are nigh”, and Cybersecurity: why context matters and how do we find it you know just how much time I’ve spent explaining why an integrated cybersecurity analytic solution should focus on delivering value and […]

Hadoop just turned 10, the first code check-in was on Feb. 2, 2006 by our very own co-founder, Owen O’Malley. I am tremendously proud to have been a part of this first 10 years, and even more excited on where this open movement is going to take us. Congratulations to everyone in the Community! We […]

During 2015, the pace of adoption of the Hortonworks Data Platform (HDP) continued to accelerate. Both existing and new customers brought massive amounts of data under management using Apache Hadoop technologies. I found it really interesting that new requirements being discussed by our customers showed much less consistency than previous years. In particular, we heard […]

I’ve said it before and I’ll say it again, we are OPEN, we are PUBLIC and we are PROUD.  Hortonworks Data Platform is 100% open source. Hortonworks Data Flow is 100% open source. Apache Metron, the incubating cybersecurity effort Hortonworks is stewarding, is 100% open source. Our strategy remains committed to 100% open, our products […]

It has a been a busy week on Hortonworks Community Connection, here is the hot content for this week (based on community activity and votes): Top 3 articles this week: (or see the whole list here) How to limit the size of ranger log and number of log files to retain? By default, Ranger uses the log4j […]

This is the third part of a series written by Guest Blogger Charles Boicey from UC Irvine Health (part 1, part 2). The series will demonstrate a real case study for Apache Hadoop in healthcare and also journal the architecture and technical considerations presented during implementation. During the summer of 2012 I made a commitment to the folks […]

Top 3 articles this week: (or see the whole list here) Cheat Sheet and Tips for a Custom Install of Hortonworks Data Platform like a Pro. Some great tips and tricks on helping your install go flawlessly. A must read for anyone installing HDP. Recommended reading for anyone looking at Hadoop distributions.  How to install […]