cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we introduced what a Data Lake 3.0 is. In part 2 of the series, we talked about how a multi-colored YARN will play a critical role in building a successful Data Lake 3.0. In part 3 of the series, we […]

Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we introduced what a Data Lake 3.0 is and in part 2 of the series, we talked about how a multi-colored YARN will play a critical role in building a successful Data Lake 3.0. In this blog, we will take a […]

Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we briefly introduced the power of leveraging prepackaged applications in Data Lake 3.0 and how the focus will shift from the platform management to solving the business problems. In this post, we further deliberate on this idea to help answer […]

Apache Ranger’s graduation to TLP is just one step in a longer journey to help enterprises across industries secure their big data platforms using a modern opensource based, authorization and audit framework. Below are the highlights of the breadth of capabilities currently available in Apache Ranger: Apache Ranger is a centralized framework to define, administer […]

About two years ago, Hortonworks donated the entire code base of about 440,000 lines from its XA Secure acquisition to the Apache Software Foundation (ASF) in order to help jump start Apache Ranger as an Apache Incubator project. Hortonworks made this decision because our enterprise customers need an extensible and robust open source security framework […]

We are very excited about the release of Apache Zeppelin 0.7.0 and want to thank the Apache Foundation along with the Apache Zeppelin community. The long awaited release introduces several key features which are highlighted below, the most notable improvements in this release are in the area of multi user enhancements, pluggable visualization, Apache Spark & security […]

We are pleased to announce the latest release of Hortonworks Data Cloud for AWS. This release (version 1.11 for those that are keeping score) continues to drive towards the goal of making data processing easy and cost effective in the cloud. For those that aren’t familiar with Hortonworks Data Cloud for AWS (or “HDCloud” for […]

Today, Hortonworks announced the Hortonworks EDW Optimization Solution to help extend and accelerate return on investment for business intelligence e.g. the data warehouse. The solution brings together technologies from Hortonworks and partners Syncsort and AtScale. But before I dig into the details of this solution it is worth understanding the vision Hortonworks is revealing here. […]

Apache Spark 2.1 was released recently in the community. The main focus of this release was improvements in Structured Streaming and Machine Learning. Structured Streaming: Kafka .10 support, Metrics & Stability improvements Machine Learning: SparkR Improvements including new ML algorithms for LDA, Random forests, GMM, etc. Wanna try Spark 2.1 now? Well, you are in […]

Last year Hadoop celebrated it’s tenth birthday, young in the land of data technologies. But the growth in popularity of Apache Hadoop is not slowing down anytime soon. In fact, results from the 2016 Big Data Maturity Survey indicates 97% of respondents plan to do more big data initiatives in the next 3 months. The […]

The NRF Big Show is here and it’s no surprise that retail data analytics are a hot topic. It’s an exciting time for retailers as we continue to discover the power of data to improve our ability to personalize the customer experience, drive brand loyalty and increase sales. Two key trends are emerging – retailers […]

As we kick off the new year I wanted to thank our customers, partners, Apache community members, and of course the amazing Hortonworks team, for an amazing 2016. Let’s take a step back and look at some of the Hortonworks highlights from last year… IN THE ECOSYSTEM there was tremendous acceleration. At the beginning of […]

Bob Glithero Analytics Product Marketing Manager, Pivotal Over the last five years, mobile network operators (MNOs) realized 15% lower compound revenue growth on average than other types of communication service providers. With few exceptions, MNOs globally have seen a long-term decline in average revenue per user (ARPU). To reinvigorate growth, innovative MNOs are searching for […]

As the hectic holiday season nears, we’re all looking for way to have a little more time for friends and family, to enjoy the season and perhaps slow down a little. But as many of us know, business doesn’t always wait. And for some, it’s one of the busiest times of year. Wouldn’t it be […]

Apache Spark has ignited an explosion of data exploration on very large data sets. Spark played a big role in making general purpose distributed compute accessible. Anyone with some level of skill in Python, Scala, Java, and now R, can just sit down and start exploring data at scale. It also democratized Data Science by […]