cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

People often think about cloud architecture in simplistic terms: you’re either public, private, or hybrid. (In fact, there’s even confusion about the meaning of the term “hybrid” itself—this video helps clear it up: In the real world, of course, virtually every implementation is hybrid—no company puts 100% of its IT environment into one single cloud. […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

Last week, we had a jam-packed webinar on Hortonworks DataFlow, with over 700 registrants and so we were unable to get back to everyone to answer their questions. We’ve grouped the questions (and answers) below into the following categories, and  if you have more questions, anytime, we encourage you to check out the Data Ingestion […]

Apache Hive(™) is the most complete SQL on Hadoop system, supporting comprehensive SQL, a sophisticated cost-based optimizer, ACID transactions and fine-grained dynamic security. Though Hive has proven itself on multi-petabyte datasets spanning thousands of nodes many interesting use cases demand more interactive performance on smaller datasets, requiring a shift to in-memory. Hive 2 marks the […]

 Original post in HCC I had a few hours in the morning before the Strata+ Hadoop World conference schedule kicked in, so I decided to write a little HDF 2.0 flow to grab all the tweets about the Strata Hadoop conference. First up, I used GetTwitter to read tweets and filtered on these terms: strata, […]

One the most enjoyable parts of my job is working with customers and partners who have innovated on the Hortonworks Connected Data Platform.  Companies like Servient. Here’s a great real example of a recent use case for a customer we worked together on in the energy vertical.  I’ve removed the actual name for obvious reasons. […]

Hadoop’s ability to work with Amazon S3 storage goes back to 2006 and the issue HADOOP-574, “FileSystem implementation for Amazon S3”. This filesystem client, “s3://” implemented an inode-style filesystem atop S3: it could support bigger files than S3 could then support, some its operations (directory rename and delete) were fast. The s3 filesystem allowed Hadoop […]

This is the first of a three part series of the evolution of the Hortonworks and Microsoft relationship. Microsoft has led one tech industry revolution after another from the dawn of personal computing to the cloud. Hortonworks is defining a new generation of innovation and impact with its pioneering work in Big Data. You already […]

Hortonworks Big-Data Maturity Scorecard v2.0 The fourth Industrial revolution is here, and competing to succeed in the 4.0 ‘digital’ world entails making the right decisions based on data driven pointers, to successfully implement your strategy. As we work with the entire stack of Fortune 100 organizations, we often see companies—particularly those operating across business lines […]

Cloud Computing is one of the big three trends impacting IT architectures today.  What some may not realize is that an underlying connected data architecture is not only essential for cloud, but sits at the confluence of all three trends. Here’s why. The first big trend is IoT. According to BI Intelligence, we can now […]

In the US fast food industry, this is a common question when you order a burger.  ‘You want fries with that?’   It’s in the American psyche at this point, and has become common parlance. I was recently heard this exchange: ‘Hey, can I get a copy of your targeted promos report?’    ‘Sure!  You want […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Supporting Custom Properties for Expression Language in Apache NiFi by:ydavis NiFi has previously supported the ability to refer to flow file attributes, system properties and environment […]

We recently concluded this webinar series, with 7 webinars and 77 questions answered. All webinars, slides, Q&A and related info are available below. Should you have any more questions, anytime, we encourage you to check out the Data Ingestion & Streaming track of Hortonworks Community Connection where an entire community of folks are monitoring and […]

With the release of Hortonworks 2.5 Sandbox several new exciting features have been added to Apache Spark and Apache Zeppelin. Apache Spark Updates One of the most powerful new Hortonworks 2.5 Sandbox features is the ability to run two versions of Spark alongside in the same environment: a Generally Available (GA) Spark 1.6.2 and a […]

How Hortonworks can help hotel industry capture value through Insights Aggregation and Predictive Analytics Big Data has transformed every industry including the hospitality vertical. Through customer analytics, targeted segmentation, and campaigning, hotels would like to focus on delivering personalized promotions, cross and up-selling travel services. Our objective is to address these challenges through an open-source […]