Back in September, we presented a 3-part webinar series on our collaborations with Red Hat. Close to a thousand registrants and attendees participated and provided rich interaction to our series. The content included an overview of our strategic partnership, demonstrated a couple of demos, and provided tutorials to get you started on your Big Data journey with Red Hat and Apache Hadoop.
In this blog, Kenneth Peeples, JBoss technology evangelist and principal marketing manager for Data Virtualization and Fuse Service Works at Red Hat, recaps the webinar series and offers insights into JBoss Data Virtualization and HDP. For those of you who missed one of our webinars (or want to review it), you can find recordings of all sessions on the Red Hat partner page.
I have had the privilege to work with the Hortonworks team designing, building and testing multiple use cases with Red Hat JBoss Data Virtualization (DV) and Hortonworks Data Platform (HDP). We designed and built four use cases that are easy to duplicate and run. During the three-webinar series with Red Hat and Hortonworks, all four use cases were covered. Webinar 2, Discover Red Hat and Apache Hadoop for the Modern Data Architecture, covers use cases 1 and 2, while Webinar 3, Discover Red Hat and Apache Hadoop for the Modern Data Architecture, covers use cases 3 and 4. This blog post covers the first use case and provides the sources and videos for you to try.
Our strategic alliance between Hortonworks and Red Hat makes it easier for organizations to adopt Apache Hadoop in the enterprise. The alliance has three main areas of collaboration:
The primary benefits to the enterprise include:
Several technologies are tightly integrated with HDP – Red Hat Storage, Enterprise Linux, OpenJDK, OpenStack Platform and Data Virtualization. Our use cases and webinars focused on HDP and DV.
There are several benefits to using DV and HDP together.
Before going into the use cases, I want to cover a couple of DV concepts. The upstream community project for DV is Teiid and the Teiid Designer for JBoss Developer Studio. Teiid is a set of open source enterprise information integration tools noted for their ability to rapidly create data services that can quickly adapt to changes in your IT environment. A virtual database (VDB) is a container for components used to integrate data from multiple data sources, so that they can be accessed in an integrated manner through a single, uniform API. A VDB contains models, which define the structural characteristics of data sources, views, and web services. Once a VDB is created, it is deployed to the DV Server. A Translator provides an abstraction layer between Teiid Query Engine and a physical data source that knows how to convert Teiid issued query commands into source specific commands and execute them using the Resource Adaptor. It also has smarts to convert the result data that came from the physical source into a form that Teiid Query engine is expecting. A Resouce Adaptor provides the connectivity to the physical data source. This also gives a native way to issue commands to the source and gather results. A Resource Adaptor can be a RDBMS data source, web service, text file, connection to mainframe or to a custom source you defined. A Translator along with its Resource Adaptor must be configured on a Source Model. Several components worth highlighting for DV are described below:
We designed the material like the how to guide, videos and tutorials to make it easy to see the demonstrations in action as well as allow you to duplicate the demonstrations. The software required to run these examples are:
So let’s go through each use case to provide an overview, more technical detail and references to get you started.
To learn more, listen to the replays of the webinars listed below and look at the Red Hat page on Hortonworks.com: