Apache HBase

A non-relational (NoSQL) database that runs on top of HDFS

Apache™ HBase is a non-relational (NoSQL) database that runs on top of the Hadoop® Distributed File System (HDFS).  It is columnar and provides fault-tolerant storage and quick access to large quantities of sparse data. It also adds transactional capabilities to Hadoop, allowing users to conduct updates, inserts and deletes.

What HBase Does

HBase provides random, real time access to your Big Data. HBase was created for hosting very large tables with billions of rows and millions of columns.

HBase provides the following benefits:

  • Fault tolerant storage for large quantities of data
  • Flexible data model
  • Easy Java API as well as Thrift, or REST gateway APIs
  • Near real-time lookups
  • Atomic and strongly consistent row-level operations
  • Automatic sharding and load balancing of tables
  • Metrics exports via File and Ganglia plugins
  • High availability through automatic failover
  • In-memory caching via block cache and bloom filters
  • Server side processing via filters and co-processors
  • Replication across the data center

How HBase Works

Apache HBase uses Log Structured Merge trees (LSM trees) to store and query the data. It features, compression, in-memory caching, bloom filters, and very fast scans. HBase tables can serve as both the input and output for MapReduce jobs.

Apache Top-Level Project Since
May 2010
Hortonworks Committers & Contributors
9

Try it with Sandbox

Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.

Get Sandbox

Recently in the Blog

Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Integrate with existing systems
Hortonworks maintains and works with an extensive partner ecosystem from broad enterprise platform vendors to specialized solutions and systems integrators.
Contact Us
Hortonworks provides enterprise-grade support, services and training. Discuss how to leverage Hadoop in your business with our sales team.

Thank you for subscribing!