Apache Ambari: Provision, Manage and Monitor Hadoop

A Single Pane of Glass for Hadoop Operations

I recently sat down with Mahadev Konar and Jeff Sposetti to discuss Apache Ambari v1.4.1. Ambari 1.4.1 is a single framework to provision, manage and monitor clusters based on the Hadoop 2 stack, with YARN and NameNode HA on HDFS.

Mahadev is one of the original architects of Apache Hadoop, a co-founder of Hortonworks, and a committer on Apache Ambari and Apache ZooKeeper. Jeff is the Hortonworks product manager focused on Apache Ambari and Apache Falcon.

Together, Mahadev and Jeff explain how Ambari works, innovations included in version 1.4.1, and the future of the Ambari roadmap.

Other highlights include:

  • Recollections of the challenges managing a Hadoop cluster in the early days, before Apache Ambari existed
  • The two core use cases for Ambari: unification of all Hadoop ecosystem projects under a single point of control & a single point of integration for other software vendors developing applications to run in Apache Hadoop
  • The relationship between Ambari & Apache Hadoop YARN
  • How Ambari allows users to manage NameNode HA for cluster stability
  • Future plans for extensibility for new services and apps running in Apache Hadoop YARN and integrated into Ambari
  • Planned improvements to the Ambari user experience
  • How Ambari provides insight into historical cluster performance & operations

Visit our Apache Ambari project page to learn more.

Categorized by :
Ambari Hadoop Operations & Management Sandbox

Leave a Reply

Your email address will not be published. Required fields are marked *

If you have specific technical questions, please post them in the Forums

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.