cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC HDF installation on EC2 by:mpandit Hortonworks DataFlow (HDF) powered by Apache NiFi, Kafka and Storm, collects, curates, analyzes and delivers real-time data from the IoAT to […]

Apache Spark 2.0 was released yesterday in the community. This is a long awaited release that delivers several key features. We are really excited about this release and sincerely thank the Apache Software Foundation and Apache Spark communities for making this release possible. The most notable improvements in this release are in the areas of API, […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Phoenix HBase Tuning – Quick Hits by:smanjee HBase tuning like any other service within the ecosystem requires understanding of the configurations and the impact (good or […]

Apache Hive 2.1 was released about a month ago and it’s a great opportunity to review how Hive 2 is drastically changing the landscape for SQL on Hadoop. There is so much new in Hive it’s hard to pick highlights, but here are a few: Interactive query with Hive LLAP. LLAP was introduced in Hive […]

The need to address Business Continuity and Disaster Recovery (BCDR) concerns is well known to anyone who runs production systems. This blog introduces HBase’s new backup and restore capabilities, which give HBase the ability to perform full and incremental backups across clusters and into the cloud. When combined with real-time replication, this new incremental backup […]

Following the success of our sold-out 2015 Roadshow, we are pleased to announce our worldwide Future of Data Roadshow 2016! The Roadshow brings the innovators driving the future of data to you and offers insightful content for both business and technical attendees. This is an invaluable opportunity to network with leaders who are transforming their business […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Horses for Courses: Apache Spark Streaming and Apache Nifi by:vvaks Comparing Apache Nifi and Apache Spark Streaming for different streaming and IOT use cases Data Analysis […]

Significant Throughput and Latency Gains Between Apache Storm 0.9 and 1.0 The release of version 1.0 marks another major milestone for Apache Storm. Since becoming an Apache project in Sept 2013, much work has gone into maturing the feature set and also improving performance by reworking or tweaking various components. (See A Brief History of […]

It has been another exciting week on Hortonworks Community Connection HCC. We have lots of great technical content and are continuing to see great activity. We recommend the following assets from last week: Top Articles from HCC Disaster recovery and Backup best practices in a typical Hadoop Cluster :Series 1 Introduction by:rbiswas Disaster recovery plan […]

It has been another exciting week on Hortonworks Community Connection HCC. We have lots of great technical content and are continuing to see great activity. We recommend the following assets from last week: Top Articles from HCC Adding KDC Administrator Credentials to the Ambari Credential Store by:rlevas Rack Awareness by:rbiswas Spark+Pycharm+Pybuilder on Docker by:smanjee YARN […]

According to Strategy Meets Action (SMA), the value and disruption do not come from the “things” or the technology itself. New, actionable insights can be gleaned from massive amounts of new data being collected and analyzed. Insurers must build strong enterprise-wide data management and analytics capabilities to be in a position to capitalize on these […]

The first decade is over and we’re entering the second. One industry watcher makes a great point: Awkward teenage years ahead? I don’t believe we’ll be one of those ‘difficult’ teenagers. We might be a bit of a nerd, but we’ll be the well balanced one. The one with friends, the one that goes to […]

This week we made a huge step forward in accelerating genomics-based precision medicine in research and clinical care, starting a consortium of experts and organizations who will help to define the next generation of genomics research. We’ve already been joined by Arizona State University, Baylor College of Medicine, Booz Allen Hamilton, Mayo Clinic, OneOme and […]

Big data is changing the way enterprises interact with and consume data. Modern data platforms, such as Hortonworks Data Platform (HDP) and Hortonworks Data Flow (HDF), are driving a data revolution by powering new workloads and analytic applications. This week, there are thousands of attendees in San Jose at Hadoop Summit 2016 learning about the […]

The most significant new feature in Apache Hive 2, to be included in the upcoming HDP 2.5 release is a technical preview of LLAP (Live Long and Process). LLAP enables as fast as sub-second SQL analytics on Hadoop by intelligently caching data in memory with persistent servers that instantly process SQL queries. Since LLAP is […]