cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

“Millennials want “finance at their fingertips”..they want to be able to email and text the financial advisors and talk to them on a real-time basis,”– Greg Fleming, Ex-Morgan Stanley Executive The first post in this series on Robo-advisors,  touched on the fact that Wealth Management has been an area largely untouched by automation as far as […]

Hortonworks DataFlow has been seeing great success being deployed in multiple use cases. We recently shared a set of real-world use cases on a webinar, and also wanted to share here so readers can peruse which types of uses cases are being implemented and see if there are parallels between current and future users of […]

HDF makes streaming analytics faster and easier, by enabling accelerated data collection, curation, analysis and delivery in real-time, on-premises or in the cloud through an integrated solution with Apache NiFi, Kafka and Storm. This 7-part webinar series takes you through tutorials, workshops, and real business use cases.

Wealth Management is the highest growth businesses for any medium to large financial institution. It also is the highest customer touch segment of banking and is fostered on long term (read extremely lucrative advisory) relationships. This three part series explores the automated “Robo-advisor” movement in the first post. We will cover the business background and some definitions . The second post will focus on the […]

We recently hosted a webinar on the topic of  HDF 2.0 and the integration between Apache NiFi, Apache Ambari and Apache Ranger.  We thought we would share the questions & answers from the webinar, and also compile relevant data into a single place to make it easy to find and reference. Should you have any […]

It’s no secret that there is a data explosion. A recent IDC analyst report from April 2014 indicated the volume of data, known as the digital universe, is doubling in size every two years. And by 2020, there will be as many digital bits as there are stars in the universe. There are many reasons […]

Guest author: Jeff Kelly, Data Strategist, Pivotal The phrase “digital transformation” gets bandied about a lot these days, but what exactly does it mean? When you strip away the hyperbole, I believe digital transformation is the process by which enterprises evolve from using traditional information technology to merely support existing business models to adopting modern […]

People often think about cloud architecture in simplistic terms: you’re either public, private, or hybrid. (In fact, there’s even confusion about the meaning of the term “hybrid” itself—this video helps clear it up: In the real world, of course, virtually every implementation is hybrid—no company puts 100% of its IT environment into one single cloud. […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

Last week, we had a jam-packed webinar on Hortonworks DataFlow, with over 700 registrants and so we were unable to get back to everyone to answer their questions. We’ve grouped the questions (and answers) below into the following categories, and  if you have more questions, anytime, we encourage you to check out the Data Ingestion […]

Apache Hive(™) is the most complete SQL on Hadoop system, supporting comprehensive SQL, a sophisticated cost-based optimizer, ACID transactions and fine-grained dynamic security. Though Hive has proven itself on multi-petabyte datasets spanning thousands of nodes many interesting use cases demand more interactive performance on smaller datasets, requiring a shift to in-memory. Hive 2 marks the […]

 Original post in HCC I had a few hours in the morning before the Strata+ Hadoop World conference schedule kicked in, so I decided to write a little HDF 2.0 flow to grab all the tweets about the Strata Hadoop conference. First up, I used GetTwitter to read tweets and filtered on these terms: strata, […]

One the most enjoyable parts of my job is working with customers and partners who have innovated on the Hortonworks Connected Data Platform.  Companies like Servient. Here’s a great real example of a recent use case for a customer we worked together on in the energy vertical.  I’ve removed the actual name for obvious reasons. […]

Hadoop’s ability to work with Amazon S3 storage goes back to 2006 and the issue HADOOP-574, “FileSystem implementation for Amazon S3”. This filesystem client, “s3://” implemented an inode-style filesystem atop S3: it could support bigger files than S3 could then support, some its operations (directory rename and delete) were fast. The s3 filesystem allowed Hadoop […]

This is the first of a three part series of the evolution of the Hortonworks and Microsoft relationship. Microsoft has led one tech industry revolution after another from the dawn of personal computing to the cloud. Hortonworks is defining a new generation of innovation and impact with its pioneering work in Big Data. You already […]