cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

One of the privileges of my job is working with many different technology leaders who are major users of big data. In all types of company and industry. I get a bird’s eye view of the playing field. There’s definitely a number of skirmishes happening at the moment in the cultural revolution created by the data […]

If you’ve been reading along, you’re aware of the lightbulb moments from my article, “echo: hello world”, that allowed me to discover the benefits of an analytic approach to cybersecurity.  This is the start of my new blog series, The CISO’s View, where I focus on the executive level business concerns facing security leaders. Today, […]

Apache Spark has been Open Source’s new kid on the block. Companies are using Spark to develop sophisticated models that would enable them to discover new opportunities or avoid risk. But what does the future or at least the near future hold for Spark? In this blog we have outlined five trends we see in […]

“Millennials want “finance at their fingertips”..they want to be able to email and text the financial advisors and talk to them on a real-time basis,”– Greg Fleming, Ex-Morgan Stanley Executive The first post in this series on Robo-advisors,  touched on the fact that Wealth Management has been an area largely untouched by automation as far as […]

I am super excited to be in my first month at Hortonworks, heading up the product and solutions marketing team. In addition to joining a super star team, I am joining one of the leading innovators in the modern data landscape. My love affair with all things data started in the early 1990’s when I […]

Hortonworks DataFlow has been seeing great success being deployed in multiple use cases. We recently shared a set of real-world use cases on a webinar, and also wanted to share here so readers can peruse which types of uses cases are being implemented and see if there are parallels between current and future users of […]

Wealth Management is the highest growth businesses for any medium to large financial institution. It also is the highest customer touch segment of banking and is fostered on long term (read extremely lucrative advisory) relationships. This three part series explores the automated “Robo-advisor” movement in the first post. We will cover the business background and some definitions . The second post will focus on the […]

It’s no secret that there is a data explosion. A recent IDC analyst report from April 2014 indicated the volume of data, known as the digital universe, is doubling in size every two years. And by 2020, there will be as many digital bits as there are stars in the universe. There are many reasons […]

Guest author: Jeff Kelly, Data Strategist, Pivotal The phrase “digital transformation” gets bandied about a lot these days, but what exactly does it mean? When you strip away the hyperbole, I believe digital transformation is the process by which enterprises evolve from using traditional information technology to merely support existing business models to adopting modern […]

Provenance, Lineage & Chain of Custody The models of Provenance, Lineage and Chain of Custody are used in fine art to determine when a piece was created, the sequence of locations where it was held, how it was touched along the way, and who has owned it since creation, all with the purpose of authenticating the piece. […]

The first post in this three part series on Digital Foundations introduces the concept of Customer 360 or Single View of Customer (SVC). We will discuss the need for & the definition of the SVC as part of the first step in any Digital Transformation endeavor. We will also discuss specific benefits from both a […]

People often think about cloud architecture in simplistic terms: you’re either public, private, or hybrid. (In fact, there’s even confusion about the meaning of the term “hybrid” itself—this video helps clear it up: In the real world, of course, virtually every implementation is hybrid—no company puts 100% of its IT environment into one single cloud. […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

Apache Hive(™) is the most complete SQL on Hadoop system, supporting comprehensive SQL, a sophisticated cost-based optimizer, ACID transactions and fine-grained dynamic security. Though Hive has proven itself on multi-petabyte datasets spanning thousands of nodes many interesting use cases demand more interactive performance on smaller datasets, requiring a shift to in-memory. Hive 2 marks the […]

The Financial regulators are driving a Data Evolution Traditionally technology moves fast, regulators react slow. When technology leaps forward, it enables financial firms to change the nature of their business – often into un-regulated territory; Regulators react to pass regulation to catch up. This model can work in slow moving markets, but in todays interconnected […]