cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

Did you know every Hortonworks HDP support subscription comes with SmartSense? Advanced Analytics of Diagnostic Data Prevents Issues SmartSense uses advanced analytics to make suggestions and recommendations based on the deep knowledge of our Hortonworks engineers and committers to prevent issues and improve performance of your HDP cluster. Based on the diagnostic data collected from […]

In this blog, we will be discussing, SAS® Grid Manager for Hadoop. There are some very compelling reasons to modernize data architectures with Hadoop. Anyone responsible for administering SAS workloads on Hadoop or considering this path should know about SAS Grid Manager for Hadoop. What is SAS Grid Computing? SAS Grid Computing has been offering […]

Hortonworks SmartSense provides proactive recommendations that improve cluster performance, security and operations. As 30% of issues are configuration related, Hortonworks SmartSense makes an immediate impact on Hadoop system performance and availability. Join us on Tuesday Feb 28, 2017 to learn how SmartSense helps increases efficiency of Hadoop hardware through customized cluster recommendations. Paul Codding, Product […]

As we kick off the new year I wanted to thank our customers, partners, Apache community members, and of course the amazing Hortonworks team, for an amazing 2016. Let’s take a step back and look at some of the Hortonworks highlights from last year… IN THE ECOSYSTEM there was tremendous acceleration. At the beginning of […]

Originally posted in HCC 1. Introduction NiFi is a powerful and easy to use technology to build dataflows from diverse sources to diverse targets while transforming and dynamically routing in between. NiFi is packaged in HDF 2.0 which (in addition to bundling Kafka and Storm for a complete data movement platform) pushes NiFi to enterprise […]

Apache Spark has been Open Source’s new kid on the block. Companies are using Spark to develop sophisticated models that would enable them to discover new opportunities or avoid risk. But what does the future or at least the near future hold for Spark? In this blog we have outlined five trends we see in […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

Apache Hive(™) is the most complete SQL on Hadoop system, supporting comprehensive SQL, a sophisticated cost-based optimizer, ACID transactions and fine-grained dynamic security. Though Hive has proven itself on multi-petabyte datasets spanning thousands of nodes many interesting use cases demand more interactive performance on smaller datasets, requiring a shift to in-memory. Hive 2 marks the […]

The Financial regulators are driving a Data Evolution Traditionally technology moves fast, regulators react slow. When technology leaps forward, it enables financial firms to change the nature of their business – often into un-regulated territory; Regulators react to pass regulation to catch up. This model can work in slow moving markets, but in todays interconnected […]

Hortonworks Big-Data Maturity Scorecard v2.0 The fourth Industrial revolution is here, and competing to succeed in the 4.0 ‘digital’ world entails making the right decisions based on data driven pointers, to successfully implement your strategy. As we work with the entire stack of Fortune 100 organizations, we often see companies—particularly those operating across business lines […]

With the release of Hortonworks 2.5 Sandbox several new exciting features have been added to Apache Spark and Apache Zeppelin. Apache Spark Updates One of the most powerful new Hortonworks 2.5 Sandbox features is the ability to run two versions of Spark alongside in the same environment: a Generally Available (GA) Spark 1.6.2 and a […]

It’s never been easier to get started with Apache Hadoop. The Hortonworks Sandbox combines 100% open-source Apache Hadoop and its data access engines (Apache Spark, Apache Hive, Apache HBase, Apache Solr, Apache Pig) with enterprise-grade Operations (Apache Ambari), Security (Apache Ranger and Apache Knox) and Governance (Apache Atlas).  The Sandbox also provides tools for devOps, […]

Five key capabilities for IoT implementations IoT connected devices are turning up everywhere.  Every major communications carrier is offering its own IoT platform.  And hundreds of technology companies are offering capabilities for IoT use case implementation. But the crux of the matter is not what options there are, but how to make it all work together. […]

As enterprises around the world bring more of their sensitive data into Hadoop data lakes, balancing the need for democratization of access to data without sacrificing strong security principles becomes paramount. According to a recent research report by Securosis, “Hadoop has (mostly) reached security parity with the relational platforms of old, and that’s saying a […]

A lot has been said about Data Lakes over the past five years. The call to action from our industry to customers was to take all your data-at-rest in databases and warehouses, and add to this to the data-in-motion from everything in your ecosystem. Then store all of the resulting terabytes and petabytes in a […]