cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

We at Hortonworks have spent countless hours working with customers as they use Apache Hadoop, Spark and Hive in the cloud, to help them better leverage the cloud platforms they use for these data processing workloads. In the interest of community and sharing, wanted to share some of the “top reasons” we’ve heard. Enjoy! Cloud […]

Streaming on Apache Spark and Interactive SQL on Apache Hive now runs even faster In 2016, Hortonworks announced Microsoft Azure HDInsight as its Premier Connected Data Platforms cloud solution to give customers Apache™ Hadoop® in cloud environments. Microsoft and Hortonworks have been pioneering cloud solutions for the past four years together through a strategic partnership […]

With the introduction of the Hortonworks Data Cloud (HDCloud), deploying clusters and starting to process data has become an order of magnitude faster. When Apache Hadoop evolved from being an on premise solution to a cloud based solution, the time it took to make a cluster went from weeks to days. The same magnitude of […]

We are pleased to announce the latest release of Hortonworks Data Cloud for AWS. This release (version 1.11 for those that are keeping score) continues to drive towards the goal of making data processing easy and cost effective in the cloud. For those that aren’t familiar with Hortonworks Data Cloud for AWS (or “HDCloud” for […]

Today, Hortonworks announced the Hortonworks EDW Optimization Solution to help extend and accelerate return on investment for business intelligence e.g. the data warehouse. The solution brings together technologies from Hortonworks and partners Syncsort and AtScale. But before I dig into the details of this solution it is worth understanding the vision Hortonworks is revealing here. […]

Apache Spark 2.1 was released recently in the community. The main focus of this release was improvements in Structured Streaming and Machine Learning. Structured Streaming: Kafka .10 support, Metrics & Stability improvements Machine Learning: SparkR Improvements including new ML algorithms for LDA, Random forests, GMM, etc. Wanna try Spark 2.1 now? Well, you are in […]

As the hectic holiday season nears, we’re all looking for way to have a little more time for friends and family, to enjoy the season and perhaps slow down a little. But as many of us know, business doesn’t always wait. And for some, it’s one of the busiest times of year. Wouldn’t it be […]

We recently concluded our highly attended How to Get Started with Hortonworks Data Cloud for AWS Webinars. Thank you Jeff Sposetti and Sean Roberts for hosting the sessions. The webinars provided a very informative overview about the offering and included a detailed demonstration to show how the product works. Some great questions came across during […]

It is that time of year again, right before Christmas in Las Vegas, where nearly 30,000 technologists gather to see the latest in innovation around the Cloud. Hortonworks is honored to participate as an exhibitor for the first time. If you are in Vegas this week for the AWS re:Invent, please stop by our booth #2732 […]

Earlier this year, we started making Technical Previews of Hortonworks Data Cloud for AWS available. The feedback and response has been incredible, and over the past few months, we performed many Technical Preview refreshes. Now we are ready to make it official and release the product into AWS Marketplace. Therefore, we are excited to announce […]

It’s no secret that there is a data explosion. A recent IDC analyst report from April 2014 indicated the volume of data, known as the digital universe, is doubling in size every two years. And by 2020, there will be as many digital bits as there are stars in the universe. There are many reasons […]

Guest author: Jeff Kelly, Data Strategist, Pivotal The phrase “digital transformation” gets bandied about a lot these days, but what exactly does it mean? When you strip away the hyperbole, I believe digital transformation is the process by which enterprises evolve from using traditional information technology to merely support existing business models to adopting modern […]

People often think about cloud architecture in simplistic terms: you’re either public, private, or hybrid. (In fact, there’s even confusion about the meaning of the term “hybrid” itself—this video helps clear it up: In the real world, of course, virtually every implementation is hybrid—no company puts 100% of its IT environment into one single cloud. […]

Hadoop’s ability to work with Amazon S3 storage goes back to 2006 and the issue HADOOP-574, “FileSystem implementation for Amazon S3”. This filesystem client, “s3://” implemented an inode-style filesystem atop S3: it could support bigger files than S3 could then support, some its operations (directory rename and delete) were fast. The s3 filesystem allowed Hadoop […]

This is the first of a three part series of the evolution of the Hortonworks and Microsoft relationship. Microsoft has led one tech industry revolution after another from the dawn of personal computing to the cloud. Hortonworks is defining a new generation of innovation and impact with its pioneering work in Big Data. You already […]