The Hortonworks Blog

Posts categorized by : Administrator

LDAP provides a central source for maintaining users and groups within an enterprise. There are two ways to use LDAP groups within Hadoop. The first is to use OS level configuration to read LDAP groups. The second is to explicitly configure Hadoop to use LDAP-based group mapping.

Here is an overview of steps to configure Hadoop explicitly to use groups stored in LDAP.

  • Create Hadoop service accounts in LDAP
  • Shutdown HDFS NameNode & YARN ResourceManager
  • Modify core-site.xml to point to LDAP for group mapping
  • Re-start HDFS NameNode & YARN ResourceManager
  • Verify LDAP based group mapping

Prerequisites: Access to LDAP and the connection details are available.…

Luminar is one of Hortonworks’ original customers. Apache Hadoop is a pillar of their modern data architecture, and since choosing Hortonworks in 2012, the Luminar team became expert users of Hortonworks Data Platform version 1.

They were eager to migrate to HDP2 after it launched in October 2013.

I recently spoke with Juan Manuel Alonso, Luminar’s Manager of Insights. Juan Manuel worked with the Hortonworks professional services team to plan and execute the migration from HDP1 to HDP2.…

Compuware is a Hortonworks Technology Partner and this week announced the availability of the newest release of APM for Big Data.  This release provides enhanced support for Hadoop 2.0 and Hortonworks Data Platform (HDP) 2.0

Compuware’s APM for Big Data now provides greater visibility into Hadoop job details with out-of-the-box dashboards that require no configuration. The graphical dashboards expand insight and ease of analyzing Hadoop deployments.  With the Hadoop focused dashboards, customers can get information about any Hadoop cluster and summarized overviews of cluster utilization across users, jobs, pools, queues and more.…

Today, the Forrester WaveTM: Big Data Hadoop Solutions, Q1 2014 was published by Forrester Research and while not surprised, we are delighted that this leading analyst firm recognized us as a clear leader in the Hadoop market. We could not be prouder of our unwavering strategy and hard work that is propelling us to the forefront of this burgeoning Hadoop market.

Download and review the report here.

Forrester evaluated nine vendors across a range of criterion from strategy to product and market presence and we scored a very balanced report across all categories and are way “up and to the right”.…

It gives me great pleasure to announce that the Apache Hadoop community has voted to release Apache Hadoop 2.3.0!

hadoop-2.3.0 is the first release for the year 2014, and brings a number of enhancements to the core platform, in particular to HDFS.

With this release, there are two significant enhancements to HDFS:

  • Support for Heterogeneous Storage Hierarchy in HDFS (HDFS-2832)
  • In-memory Cache for data resident in HDFS via Datanodes (HDFS-4949)

With support for heterogeneous storage classes in HDFS, we now can take advantage of different storage types on the same Hadoop clusters.…

Hadoop Summit Europe in Amsterdam is approaching fast. From Falcons to Pigs, we have a menagerie of meetups covering all things Hadoop – all with fantastic speakers. This year, we’re also delighted to expand the discussion with meetups from Splunk, SAS and Revolution Analytics.

You can sign up for any and all of the meetups below and remember these are open to everyone to attend.

Tuesday, April 1st

At the Krasnapolsky Hotel, from 5pm onwards:

  • Reception and Cocktails.

With over 230 JIRA tickets resolved, the Apache HBase community released 0.98.0 yesterday which is the next major version after 0.96.x series.

HBase 0.98.0 comes with an exciting set of new features with keeping the same stability improvements and features on top of 0.96. Additional to usual bug fixes, some of the major improvements include:

  • Reverse Scans (HBASE-4811): for use cases where both forward and reverse iteration is required, HBase now allows to perform scans in reverse mode.

Actuate is a Hortonworks Technology Partner and founded and co-leads the BIRT open source project, which is used by more than 2.5 million developers around the globe and serves as the foundation of Actuate’s commercial offerings. Applications built with BIRT and BIRT iHub deliver more business and consumer insights to more people than all BI companies combined. 

The deployment of Big Data architectures has become more prevalent as organizations realize the power of what Big Data can bring to their businesses and to their profitability. …

Hadoop can be a great complement to existing data warehouse platforms, such as Teradata, as it naturally helps to address two key storage challenges:

The purpose of this article is to detail some of the key integration points and to show how data can be easily exchanged for enrichment between the two platforms.

As a data integrator who is familiar with RDBMS systems and is new to the Hadoop platform, I was looking for a simple way (i.e.…

With the growing number of large-scale enterprise deployments of big data, certain limitations have become more apparent bringing to light some weaknesses in this first phase of analytics infrastructures.  Hadoop, clearly a very valuable tool for the collection of unstructured data, poses some challenges that need to be overcome for wide spread successful enterprise adoption.

In our upcoming webinar on Tuesday Feb 19 at 10 am PT, we will address these issues and highlight how to solve them using Hortonworks Data Platform and our partner Actian.…

Earlier this week Microsoft announced via their blog that a new version of Windows Azure HDInsight is available in public preview.

Microsoft recognizes the importance of the technical innovation in and around YARN as well as Hortonworks leadership in this area and we have worked collaboratively to bring Hadoop 2.2 to Azure via our Hortonworks Data Platform 2.0 for Windows release.

Apache Hadoop YARN is the data operating system for Hadoop and greatly expands the applications possible of this emerging technology by allowing multiple processing frameworks such as streaming or graph processing to plug in natively.…

We are excited to announce an expansion of our relationship with open source leader Red Hat to a deeper more strategic alliance. The main goal is to help organizations adopt enterprise Apache Hadoop more quickly. This is a natural progression of our relationship with Red Hat because we are so closely aligned around a strategy of innovating in the open and applying enterprise rigor to open source software thereby de-risking it for the enterprise.…

In this post, we will explore how to quickly and easily spin up our own VM with Vagrant and Apache Ambari. Vagrant is very popular with developers as it lets one mirror the production environment in a VM while staying with all the IDEs and tools in the comfort of the host OS.

If you’re just looking to get started with Hadoop in a VM, then you can simply download the Hortonworks Sandbox.…

This guest post from Steve Ratay, Viewpoint Architect, Teradata Corporation

Teradata’s Unified Data Architecture is a powerful combination of the Teradata Enterprise Data Warehouse, the Aster Discovery Platform, Apache Hadoop (via the Hortonworks Data Platform) and Teradata Enterprise Management tools in a single architecture. 

If you are Teradata user managing an Enterprise Data Warehouse or Data Discovery platform, chances are that you are using Teradata Viewpoint, a monitoring and management platform for Teradata Systems. …

I recently sat down with Mahadev Konar and Jeff Sposetti to discuss Apache Ambari v1.4.1. Ambari 1.4.1 is a single framework to provision, manage and monitor clusters based on the Hadoop 2 stack, with YARN and NameNode HA on HDFS.

Mahadev is one of the original architects of Apache Hadoop, a co-founder of Hortonworks, and a committer on Apache Ambari and Apache ZooKeeper. Jeff is the Hortonworks product manager focused on Apache Ambari and Apache Falcon.…

Go to page:« First...23456...Last »