Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
March 13, 2019 |

How Data Analysis in Sports Is Changing the Game

February 5, 2019 | Dinesh Chandrasekhar

Measuring the Success of Your Blockchain Implementation

January 29, 2019 | Abhas Ricky

How the Sharing-Economy Business Model Fosters Regulatory Engagement


All Topics

All Channels


Any baseball fan knows that data analysis in sports is a big part of the experience. This article looks at how everyone from baseball to football teams are using data both on and off the field to change the game.

With many companies spreading their resources thinly between different emerging technologies, a mature approach to evaluating your blockchain implementation is crucial.

The sharing-economy business model promises to revolutionize the concept of capital and ownership. Companies that want to embrace it must form collaborative relationships with governments and regulators, among others.

Hortonworks’ DataWorks Summit 2018 events have offered a unique opportunity for attendees to sample the technology zeitgeist.

There is a common misconception that you need a PhD or high level of technical expertise to use deep learning—that’s not necessarily the case. Employees with basic mathematical aptitude can develop deep learning skills.

This is part seven of an on going series about the Open Hybrid Architecture Initiative. You can learn more about the vision, key tenets, real-world use case, new storage environment of O3, participation in the Cloud Native Computing Foundation, and running stateful containers on YARN by reading blogs from earlier in the series.  In today’s […]

It is that time of the year – to call out predictions and trends for the year. The two hot areas that are enabling digital transformation across all industry verticals are IoT and edge computing. Let us look at what to expect in 2019 and beyond for IoT. In this post, I am going to […]

Insurance risk assessment relies on having a rich history of data. The more we know about what happened in the past, the more we can predict risk in the future. But when it comes to the risks and liabilities of connected devices, there isn’t a past to look back on. How can insurers adapt?

At Dataworks Summit in Barcelona, Spain in March 2019, we are pleased to have two innovating and leaders in the industry, Commerzbank and Airbus, share their digital transformation strategy. Commerzbank’s digital transformation strategy is about developing its multi-channel approach to helping global customers.  By updating all its processes, designing new products and services to better […]

It’s not just a new year. It’s a new era. Yesterday we were Hortonworks. Today, with the formal completion of our merger, we are Cloudera –  which is now the second largest open source software company in the world. My personal journey in the Apache Hadoop ecosystem started in early 2006. It all started with […]

1. Motivation The HiveWarehouseConnector (HWC) is an open-source library which provides new interoperability capabilities between Hive and Spark. In practice, Hive and Spark are often leveraged together by companies to provide a scalable infrastructure for data warehousing and data analytics. However, as they both continue to expand their capabilities, interoperability between the two becomes difficult. […]

Organizations commonly use a plethora of data storage and processing systems today. These different systems offer cost-effective performance for their respective use cases. Besides traditional RDBMSs such as Oracle DB, Teradata, or PostgreSQL, teams use Apache Kafka for streams and events data, Apache Druid for real-time series data, and Apache Phoenix for quick index lookups. […]

(This Blogpost is coauthored by Xun Liu and Quan Zhou from Netease). Introduction Hadoop is the most popular open source framework for the distributed processing of large, enterprise data sets. It is heavily used in both on-prem and on-cloud environment. Deep learning is useful for enterprises tasks in the field of speech recognition, image classification, […]

We are excited to announce the release of the first Hortonworks Data Platform (HDP) 3 Sandbox. The Hortonworks sandbox is a great way to test drive some of the latest features found in HDP 3. The sandbox, a single node environment, is packed with 100% open-source Apache Projects that will allow you to explore Big […]

Our last few blogs as part of the Kafka Analytics blog series focused on the addition of Kafka Streams to HDP and HDF and how to build, secure, monitor Kafka Streams apps / microservices. In this blog, we focus on the SQL access pattern for Kafka with the new Kafka Hive Integration work. Kafka SQL […]

Social Media News

@hortonworks: Introducing the 2019 Data Heroes - EMEA! Data Heroes design modern data architectures that work across hybrid and m…

@hortonworks: Follow the #data, if you want to improve the performance of a sports team both on and off the field. @RickyAbhas ta…

@hortonworks: NoSQL Day at #DWS19 DC is the community event for Apache HBase, Apache Phoenix, and Apache Accumulo. All HBase, Pho…

@hortonworks: #Moneyball may have charted the early days of #DataAnalysis in sports, but #BigData is taking it to a whole new lev…

@hortonworks: #BigData #analytics is changing the game in the sports industry. Here's how: via @RickyAbhas

@hortonworks: The Data Heroes initiative is one of the ways that we recognize customers who achieve outstanding results with our…

@hortonworks: NoSQL Day at #DWS19 DC is the community event for Apache HBase, Apache Phoenix, and Apache Accumulo. All HBase, Pho…