Banner

Building Governance for Big Data

As organizations pursue Hadoop initiatives in order to capture new opportunities for data-driven insight, data governance requirements can pose a key challenge. To manage data risk, organizations need a comprehensive and effective way to ensure full visibility, control and compliance for the corporate and customer information in the Data Lake. Recognizing data governance as an essential element of Open Enterprise Hadoop, Hortonworks has collaborated with industry partners to create a flexible, open framework based on metadata and taxonomy to ensure the auditability, transparency, reproducibility and consistency of the Data Lake and the information it contains.

This white paper outlines the metadata-based approach embodied in Apache Atlas, an open source project developed collaboratively by Hortonworks and a diverse group of large enterprises.

Download the White Paper