Meet the Committer: 3 Minutes on Apache HBase with Enis Soztutar

We’re continuing our series of quick interviews with Apache Hadoop project committers at Hortonworks.

This week Enis Soztutar discusses Apache HBase, built for random read/write access to data in billions of rows and millions of columns.

Enis began using Apache Hadoop in 2006. Now, Enis is a Hortonworks engineer and Apache HBase project management chair. He has also been a committer to Apache Hadoop since 2007 and to HBase since 2012.

In this brief video, Enis describes what HBase is, why it was created, and how it works.

What is HBase?

  • A no-SQL, non-relational database that runs on top of the Hadoop Distributed File System (HDFS)
  • Designed to be scalable, on commodity hardware
  • Designed to be distributed: file storage can be spread out among an array of independent machines
  • Intended to run on top of Hadoop

Why was HBase created?

  • Apache HBase is the open source implementation of Google’s BigTable (as described in their Bigtable paper)
  • Built for random read/write access to enormous data sets, with billions of rows and millions of columns

How Does HBase work?

  • HBase indexes data with a row key, a column key and a time stamp
  • Key/value pairs are sorted alphabetically by their key, as in this fictional example with only three data elements:
    • “aaa” :  “This is the value in the first row”
    • “abc” : “Second row”
    • “zzz” : “A quick brown fox jumps over the lazy dog”
    • Used by many enterprises such as Yahoo!, Facebook and Twitter

Watch the Hortonworks blog for an upcoming technical HBase post about the upcoming release of HBase version 0.96.

In the mean time, learn more about HBase here or at the Apache HBase project site.

Also, take a look at past Hortonworks blogs discussing HBase

Categorized by :
CIO & ITDM Data Analyst & Scientist Developer HBase HDP

Leave a Reply

Your email address will not be published. Required fields are marked *

If you have specific technical questions, please post them in the Forums

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

HDP 2.1 Webinar Series
Join us for a series of talks on some of the new enterprise functionality available in HDP 2.1 including data governance, security, operations and data access :
Contact Us
Hortonworks provides enterprise-grade support, services and training. Discuss how to leverage Hadoop in your business with our sales team.
Explore Technology Partners
Hortonworks nurtures an extensive ecosystem of technology partners, from enterprise platform vendors to specialized solutions and systems integrators.