The Hortonworks Blog

Today we’re delighted to announce our acquisition of XA Secure to provide comprehensive security capabilities for Enterprise Hadoop. Please join us in welcoming XA Secure to the Hortonworks family.

Register for the Webinar

Hortonworks Data Platform has seen phenomenal adoption across an ever-growing number of organizations. As part of that adoption, and thanks to Apache Hadoop YARN, businesses are moving from single-purpose Hadoop clusters to a versatile, integrated data platform hosting multiple business applications – combining data sets with diverse processing needs in one place.…

Last week Vinay Shukla and Kevin Minder hosted the first of our seven Discover HDP 2.1 webinars. Vinay and Kevin covered three important topics related to new Apache Hadoop security features in HDP 2.1:

  • REST API security with Apache Knox Gateway
  • HDFS security with Access Control Lists (ACLs)
  • SQL security and next-generation Hive authorization

Here is the complete recording of the webinar.

Here are the presentation slides: http://www.slideshare.net/hortonworks/discoverhdp21security

Attend our next Discover HDP 2.1 webinar tomorrow, Thursday, May 15 at 10am Pacific Time: Interactive SQL Query in Hadoop with Apache Hive

We’re grateful to the many participants who joined and asked excellent questions.…

For years, experts in the healthcare industry have been searching for ways to detect (and possibly cure) Alzheimer’s disease, the most common form of dementia. Current estimates indicate that 35.6 million people are living with dementia, projected to jump to 135 million by 2050, according to the Global CEO Initiative on Alzheimer’s Disease. At a projected cost of over $600 billion each year, it’s a looming global health and fiscal crisis.…

Rainstor is a Hortonworks Certified Technology Partner and provides an efficient database that reduces the cost, complexity and compliance risk of managing enterprise data. RainStor’s patented technology enables customers to cut infrastructure costs and scales anywhere; on-premise or in the cloud and natively on Hadoop. RainStor’s customers are 20 of the world’s largest communications providers and 10 of the biggest banks and financial services organizations. 

Rainstor’s Mark Cusack, Chief Architect, writes about the benefits of certification on HDP 2.1.…

Join us for an update on Hortonworks strategy and market perspective, successes and key learnings exclusively for Hortonworks partners and prospective partners. Hortonworks’ executives are hosting this informative afternoon with the goal of helping grow the Hadoop ecosystem and your big data business. The event will take place the day before Summit begins on June 2nd, from 2-5pm at the San Jose Convention Center. Register here, – space is limited.

Note: This is not part of the Hadoop Summit.…

Syncsort is a Hortonworks Certified Technology Partner and has over 40 years of experience helping organizations integrate big data…smarter. Keith Kohl, Director of Product Management, Syncsort, is our guest blogger. Below he talks about the importance of certification and how it benefits Syncsort’s customers and prospects interested in Hadoop.

Back in January, Syncsort announced our partnership with Hortonworks and the certification of DMX-h on HDP 2.0. I was also given the opportunity to write a guest BLOG on the Hortonworks site about HDP 2 and the GA of YARN (thanks Hortonworks!).…

There’s no denying that the information collected by Big Data architectures such as Hortonworks Data Platform (HDP) is revolutionizing how enterprises view and understand their business. The data contains deep insights into many aspects of the business such as sales, customer trends and buying patterns.

The problem has been not only how to extract those insights from the data but how to get it quickly and easily into the hands of the people who need it the most. …

Fino Consulting is a new Consulting and Systems Integration Partner of Hortonworks serving Fortune 1000 companies with winning business solutions through data science. Fino is an early mover in cloud computing, challenging clients to “Re-think what they know about cloud-computing” to build high-performance sustainable applications and stretch the boundaries of enterprise data. Fino uses HDInsight from Microsoft for client solutions because of its versatile, cloud-based data platform that manages data of any type, while leveraging all the features and functionality of Microsoft’s resources.…

I’m a pretty heavy Unix user and I tend to prefer doing things the Unix Way™, which is to say, composing many small command line oriented utilities. With composability comes power and with specialization comes simplicity. Although, sometimes if two utilities are used all the time, sometimes it makes sense for either:

  • A utility that specializes in a very common use-case
  • One utility to provide basic functionality from another utility

For example, one thing that I find myself doing a lot of is searching a directory recursively for files that contain an expression:

find /path/to/root -exec grep -l "search phrase" {} \;

Despite the fact that you can do this, specialized utilities, such as ack have come up to simplify this style of querying.…

Hadoop 2 and its YARN-based architecture has increased the interest in new engines to be run on Hadoop and one such workload is in-memory computing for machine learning and data science use cases. Apache Spark has emerged as an attractive option for this type of processing and today, we announce availability of our HDP 2.1 Tech Preview Component of Apache Spark.  This is a key addition to the platform and brings another workload supported by YARN on HDP.…

The first use of the term BoF session was used at the Digital Equipment Users’ Society (DECUS) conference in the 1960s. Its essence was to bring together like minds and thought leaders—just as birds of the feather flock together— to share and exchange computing ideas, in an informal yet spirited way. Since then, the organizers and sponsors of most computing conferences have been loyal to its essence and spirit.

For ideas and innovation happen in collaboration—not in isolation. …

This is the second in our series on the motivations and architecture for improvements to the Apache Hadoop YARN’s Resource Manager Restart resiliency. Other in the series are:

Introduction: Phase I – Preserve Application-queues

In the introductory blog, we previewed what RM Restart Phase I entails. In essence, we preserve the application-queue state into a persistent store and reread it upon RM restart, eliminating the need for users to resubmit their applications.…

This is the first post in our series on the motivations and architecture for improvements to the Apache Hadoop YARN’s Resource Manager Restart resiliency. Other in the series are:

Resource Manager (RM) is the central authority of Apache Hadoop YARN for resource management and scheduling. It is responsible for allocation of resources to applications like Hadoop MapReduce jobs, Apache TEZ DAGs, and other applications running atop YARN.…

Last week’s release of HDP 2.1 was packed with countless new features for enterprise Hadoop. These included new processing capabilities with Tez and Hive on YARN, Solr and Storm, to operations with Ambari, governance with Falcon and security with Knox.

To guide you through these capabilities, Hortonworks is hosting a new series of webinars beginning on May 8 and running to June 26.

You can join any or all of the webinars listed below, and we’ve provided a simple way of signing up for all 7.…

Hortonworks Data Platform 2.1 for Windows is the 100% open source data management platform based on Apache Hadoop and available for the Microsoft Windows Server platform. I have built a helper tool that automates the process of deploying a multi-node Hadoop cluster – utilizing the MSI available in HDP 2.1 for Windows.

Download HDP 2.1 for Windows

HDP on Windows MSI Overview

HDP on Windows installation package comes in the format of MSI, Microsoft’s MSI format utilizes the installation and configuration service provided with Windows called Windows Installer.…

Go to page:« First...56789...203040...Last »