Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button

Hortonworks Data Steward Studio

Understand, Secure, and Govern Data Across Enterprise Data Lakes


Data Steward Studio (DSS) is a DataPlane Service that empowers users to understand, secure, and govern data across enterprise data lakes. DSS empowers enterprises to precisely identify and evaluate the integrity of their data in order to securely collaborate and confidently democratize it across the enterprise.

DSS enables enterprises to contextualize knowledge about the data located across hybrid data lakes which empowers them to generate actionable insights and take meaningful actions about their business operations.

video imgvideo button

Data Steward Studio


Discover and classify data across data lakes

DSS features out-of-the-box profilers that can run as a pipeline of operations on data located across multiple data lakes. Customers can install the profiler agent in a data lake and set up a specific schedule to generate various types of data profiles. DSS empowers data stewards to:

  • Understand enterprise data based on sensitivity and distribution characteristics
  • Get visibility into the number of tables that have been added every day
  • Receive operational metrics including the number of partitions, time of creation, table size, number of rows, input and output format

Blog: Understand your hybrid data lakes to exploit their business value!
Blog: Forrester Recognizes Hortonworks as a Strong Performer in Big Data Fabric Wave
Discover Data Sources Across Data Lakes
Understand enterprise data

DSS provides all the metadata associated with a particular data asset tracked by Apache Atlas. With DSS, data stewards are able to:

  • Get end-to-end visibility into data provenance, origin, lineage 
and impact
  • Understand how data is created and modified
  • Visualize upstream lineage and downstream impact
  • Discern how schema or data has evolved over time

Webinar: Path to GDPR Compliance Begins with Data Governance – Live Panel
Understand Enterprise Data
Comply with regulations

DSS displays all the audit events associated with a particular data asset through Apache Ranger. With DSS, internal and external auditors are empowered to:

  • Get visibility into who has accessed which data from a forensic audit or compliance perspective
  • Visualize access patterns, identify anomalies and ensure proper control mechanisms
  • View the most recent raw audit events, as well as summarized views of audits by type of access and access outcome

White Paper: Path to GDPR Compliance Begins with Data Governance
Comply With Regulation
Make trusted data available to business

DSS enables data consumers and stewards to create Asset Collections to group heterogenous data assets based on business definition. Asset Collections can be created based on categories such as customer profiles, sales assets, financials, PII, and HR data. By creating Asset Collections, data stewards and data consumers can:

  • Automate data use, retention, 
and recovery strategies
  • Organize data into asset collections based on business classifications, purpose, protections and relevance
  • Search data in the data lake using tags, attribute facets, or free text
  • Get an overview of data assets within an asset collection through intuitive dashboards

Press Release: Data Steward Studio Helps Enterprises Across Cloud and On-Prem Data Lakes
Make Trusted Data Available to Business