Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics, offering information and knowledge of the Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
Open Source Projects
Schema Registry

Schema Registry

MENU

OVERVIEW

Hortonworks Schema Registry is a shared repository of schemas that allows applications to flexibly interact with each other – in order to save or retrieve schemas for the data they need to access. Having a common Schema Registry provides end to end data governance and introduce operational efficiency by providing reusable schema,defining relationships between schemas and enabling data providers and consumers to evolve at different speed.

What Hortonworks Schema Registry Does?

Hortonworks Schema Repository that allows streaming analytics applications to flexibly interact with each other. This enables users to save, edit, or reuse schemas for the data they need. This also allows easy attachment of schemas to each piece of data without incurring additional overhead for greater operational efficiency. With schema version management, data consumers and data producers can evolve at different rates. And, through schema validation, data quality is greatly improved. Schema Registry is integrated with Apache NiFI and Hortonworks  Streaming Analytics Manager.

Key Features:

Apache NiFi Integration with Schema Registry Functions include:

  • Fetch Schema
  • Serialize/Deserialize with Schema
Streaming Analytics Manager (SAM)  integration with Schema Registry For Example:

  • Every SAM builder component requires a schema to function.
  • SAM’s primary mechanism for connecting to a stream of data is Kafka, but Kafka does not have a schema.
  • When you configure a Kafka source and supply Kafka topic, SAM calls the Schema Registry.
  • Using the Kafka topic as the key, SAM will retrieve the schema.
  • This schema is then displayed on the tile component, and is passed to downstream components.

Hortonworks Focus on Schema Registry

Hortonworks Schema Registry is a shared repository of schemas that allows applications to flexibly interact with each other. Schema Registry is integrated with Apache NiFI and Hortonworks Streaming Analytics Manager.

Following are the key benefits for having a centralized schema registry:

  • End to End Data Governance
    • Centralized registry to provide reusable schema
    • Version management to define relationship between schemas
    • Validation to enable generic format conversion and generic routing
  • Operational Efficiency
    • Centralized registry to avoid attaching schema to every piece of data
    • Version management to enable consumers and producers can evolve at different rates
    • Validation to ensure data quality

Below are some great screen shots showcase how Hortonworks Schema Registry works:

1- Schema Dashboard showcase all the schemas

2- A specific schema that is attached to a Kafka topic

3- How to add a new schema

4- Schema Registry integration with Apache NiFi

Forums

Schema in our Blog