The Hortonworks Blog

Analysts and data scientists⎯not to mention business executives⎯want Big Data not for the sake of the data itself, but for the ability to work with and learn from that data. As other users become more savvy, they also want more access. But too many inefficient queries can create a bottleneck in the system.

The good news is that Apache™ Hive 0.14—the standard SQL interface for processing, accessing and analyzing Apache Hadoop® data sets—is now powered by Apache Calcite.…

Managing online security for companies is a big task. In a world of increasing cyber threats, the risks to financial organizations are greater than they have ever been. Data breaches result not only in financial loss from data theft and misuse, but in significant reputation damage to the organizations that experience them. How can such organizations quickly and accurately identify risks to protect their data, their assets, and their customers? Threats to your network and vital data sets are constantly evolving to be more sophisticated, which makes them more difficult to detect, especially when you are relying on traditional tools.…

Leading enterprise organizations have concluded that YARN-enabled Hadoop is foundational to their modern data architectures. These companies subscribe with Hortonworks (and implement Hortonworks Data Platform) to bring additional types of data under management, merge those with legacy datasets, and unlock new business insight.

But don’t take our word for it.

Watch these brief videos and hear our customers describe how a data-first approach is transforming their businesses.

Advertising

Luminar is the leading big data analytics and modeling provider uniquely focused on delivering actionable insights on U.S.…

This is the third post in a series exploring recent innovations in the Hadoop ecosystem that are included in Hortonworks Data Platform (HDP) 2.2. In this post, we introduce the theme of supporting rolling upgrades and downgrades of a Hadoop YARN cluster.

HDP 2.2 offers substantial innovations in Apache™ Hadoop YARN, enabling Hadoop users to efficiently store and interact with their data in a single repository, simultaneously using a wide variety of engines.…

Hortonworks provides enterprise Hadoop for the telecommunications service provider, and Hortonworks Data Platform (HDP) is architected from the ground up with the centralized YARN-based architecture and core enterprise services for data governance, security and cluster operations that can revolutionize your telecommunications business.

As the originators of Hadoop, leaders in the developer community, and partners for your success, nobody is better to help you become a data-centric telecommunications enterprise.

Hortonworks supports most of the largest North American carriers.…

As a data scientist working with Hadoop, I often use Apache Hive to explore data, make ad-hoc queries or build data pipelines.

Until recently, optimizing Hive queries focused mostly on data layout techniques such as partitioning and bucketing or using custom file formats.

In the last couple of years, driven largely by the innovation of the Hive community around the Stinger initiative, Hive query time has improved dramatically, enabling Hive to support both batch and interactive workloads at speed and at scale.…

The Apache HBase community has released Apache HBase 1.0.0. Seven years in the making, it marks a major milestone in the Apache HBase project’s development, offers some exciting features and new API’s without sacrificing stability, and is both on-wire and on-disk compatible with HBase 0.98.x.

In this blog, which is a cross post from from Apache HBase Blog, we look at the past, present and future of Apache HBase project.…

In this guest blog, Kumar Srivastava, senior director of product management at ClearStory Data, shares his thoughts on ClearStory’s integration with Hortonworks Data Platform (HDP)

We are excited to be working with and announcing ClearStory Data’s integration with Hortonworks Data Platform (HDP) during Strata + Hadoop World 2015. This partnership with Hortonworks is significant as it brings ClearStory’s business-ready, fast-cycle, scalable analysis on Hadoop Data Lakes and specifically on the Hortonworks Data Platform (HDP).…

This is a unique moment in time. Fueled by open source, Apache Hadoop has become an essential part of the modern enterprise data architecture and the Hadoop market is accelerating at an amazing rate.

The impressive thing about successful open source projects is the pace of the “release early, release often” development cycle, also known as upstream innovation. The process moves through major and minor releases at a regular clip and the downstream users get to pick the releases and versions they want to consume for their specific needs.…

Today we’re excited to be jointly announcing with EMC that the Isilon OneFS file system has been certified to work with the Hortonworks Data Platform (HDP). Now Isilon customers who are looking for a robust, enterprise-ready, stable Apache Hadoop platform can use HDP on their Isilon implementations.

Joint Engineering Delivering Choice

We’re excited to see the results of the months of engineering and testing efforts that now provide customers even greater deployment choice for their Hadoop projects as they are implementing a modern data architecture towards a data lake.…

OspreyData is a Hortonworks® technology partner whose solution is certified both for Hortonworks Data Platform and YARN. The company delivers agile big data analytics solutions for the oil and gas industry. In this blog, Al Brown, CTO at OspreyData, shares his thoughts on how the industry is addressing a big problem: unplanned interruptions to production.

A Mandate for Operational Efficiency and Margin Growth

The oil and gas industry is constantly challenged with a mandate to operate more efficiently—both in the oilfield and within the data center.…

Today Microsoft announced two important new updates to their Azure HDInsight Service with Apache Hadoop 2.6, now available on new clusters.

We are excited to continue to work alongside Microsoft in expanding the deployment options to the Linux Operating System for managed Hadoop as a Service Azure HDInsight clusters. The HDInsight on Linux Preview leverages the completely open Apache Ambari framework to deploy, manage and monitor Hadoop clusters on premise or in the cloud.…

Today, SAS and Hortonworks, two long-time partners and innovators in the Big Data and Analytics space, have announced the certification and release of SAS® Data Loader for Hadoop.

Read the guest blog post below and learn more about SAS and Hortonworks’ joint efforts, thanks to Keith Renison, Senior Solutions Architect for SAS Global Technology Practice.

The New Analytics Culture

Let’s talk about three key elements that drive data management for Hadoop.…

There are lots of ways to interact with Hortonworks at this weeks Strata +Hadoop World event.

Exhibitor Booth 1321

While at our booth you can talk with our experts and get the latest on Hortonworks, get an overview of Apache Hadoop or hear more about how we are helping organizations drive success with Hadoop. You can also get one of the popular Hortonworks elephants!

Passport Program

While at our booth you can pick up a Passport Card to that you can enter for a chance to win some great prizes from one of the 24!…

This guest post is from Gavin Sherry, Vice President of Engineering, Data, at Pivotal. A long time contributor to database technology, Gavin was one of the early contributors to the PostgreSQL project. This led him to join the Greenplum Database R&D team. More recently, Gavin launched Pivotal HAWQ, Pivotal’s SQL on Hadoop engine.

Intro

In the ten years since Hadoop was first conceived at Yahoo!, the big data market has taken off.…