Data Governance Initiative

Ensuring a common approach to data governance across all systems and data

Enterprises that include Hadoop in their modern data architecture must address certain realities when bringing legacy and new data from disparate platforms under management in their cluster.

The Data Governance Initiative is working to develop an extensible foundation that addresses enterprise requirements for comprehensive data governance and assures that Hadoop:

  • Snaps into existing frameworks to openly exchange metadata
  • Addresses enterprise data governance requirements within its own stack of technologies

Data Governance Initiative Diagram

The DGI solution will feature deep integration with Apache Falcon for data lifecycle management and Apache Ranger for centralized security policies. It will also interoperate with and extend existing third-party data governance and management tools by shedding light on the data access patterns within the Hadoop cluster.

In addition to Hortonworks, the members of DGI are enterprise Hadoop users Aetna, Merck, Schlumberger, and Target and also Hortonworks’ technology partner SAS.

White Paper
Protect the value of information assets and manage risk with effective governance tools and data architecture
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.