Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
September 04, 2014
prev slideNext slide

Informatica and Big Data Innovations for the Modern Data Architecture

In this partner guest blog, John Haddad, senior director of product marketing at Infomatica, explains and enumerates how’s key innovations in Enterprise SQL at Hadoop scale will augment Informatica’s Big Data Edition integration with Hortonworks’ Modern Data Architecture.

Informatica is excited about the new innovations Hortonworks is including in the project such as Hive transactions, Hive-Spark integration, and sub-second queries. The Informatica Big Data Edition helps our customers take advantage of these new innovations without having to rebuild their data pipelines for Big Data analytics. This is achieved through a visual development environment and metadata driven approach built on the Informatica Vibe virtual data machine that runs natively in Hadoop. For example:

  • We expect that the enhancements to Tez, block based processing, and support for Spark will improve performance for specific data integration and data quality transformations.
  • The Introduction of ACID support with Stinger will simplify the implementation of update strategies on Hadoop.
  • The Sub-second SQL queries on Hadoop will make it easier for analytics teams to do visualizations and data analysis.

These are just some key benefits as the Stinger project continues to evolve. Our joint customers expect that the corporate IT standards for governance, security, and reliability will extend throughout the Modern Data Architecture as they look to complement their existing information management infrastructure with Hadoop. Transaction-level support (i.e. ACID), sub-second queries, and continuous performance improvements are major milestones for applications running on Hadoop such as the Informatica Big Data Edition. Informatica running on Hortonworks ensures our customers can implement a Modern Data Architecture (see Figure 1) that supports the demands of enterprise-ready deployments with regards to performance, scalability, security, and governance.


Figure 1: Informatica + Hortonworks reference architecture for data integration and data quality to support the demands of of enterprise-ready deployments.



Dustin Smith says:

This makes me realize that the implications for what tomorrow’s data ecosystems will look are staggering. The potential for organization’s to leverage data massive data to drive analytics at incredible speeds with the whole thing resting on enterprise ready pipeline infrastructure….simply awesome.

Leave a Reply

Your email address will not be published. Required fields are marked *