Hadoop and the Data Warehouse: When to Use Which

As a preview to the April 30th webinar: Hadoop & the EDW: When to Use Which, Chad Meley, Global Director of Marketing at Teradata, interviewed the two luminary speakers, Eric Baldeschwieler (aka “eric14”) and Stephen Brobst, about the purpose of their presentation and what you can expect to take away from their shared experiences.

Chad:  “Eric, in this webinar you’re going to talk about the strategic role of relational big data technologies, which have come under fire in some circles with the rise of Hadoop.  As the Founder & CTO of Hortonworks, and former VP of Hadoop Software Engineering at Yahoo!, why do you feel this is an important message?”

Eric Baldeschwieler (eric14):  “We at Hortonworks are very optimistic about the continued growth of Hadoop, and there’s certainly a lot of media coverage, events, and communities that are aiding adoption and contributing to the future of Hadoop.  I think what is getting lost at this point in time is how Hadoop compliments, rather than replaces, relational data warehouses that are based on massively parallel processing (MPP) architectures.  We have observed that customers do not replace their EDW with Hadoop rather they optimize the use of the EDW and move appropriate workloads to Hadoop. This frees up more appropriate processing cycles for the data warehouse. Each technology has big broad sweet spots and when combined you get a stronger solution. During this webinar we aim to bring some clarity because if they are not used optimally then ultimately that’s bad for customers and the future of Hadoop.  So, it’s not as altruistic as it seems (laughs).”

Chad:  “What are some of the key take-aways?”

Eric Baldeschwieler (eric14):  “With the rise of Hadoop, there are now options that are going to result in use cases that were previously done in Enterprise Data Warehouses that are now better handled in Hadoop resulting from a combination of economics and capabilities;  however, because there’s more data being generated and companies can now capture and analyze multi-structured data in ways that were unthinkable in the past, this will result in bringing structure to key signals in new big data sets that can be cleansed, integrated, and reused for a variety of new analytical use cases where the economics are capabilities favor an MPP RDBMS.  We’ll go into some level of technical depth as to why that is. “

Chad:  “I’ve got to ask, why are you called eric14?”

Eric Baldeschwieler (eric14): “Well the short answer is that there are 14 letters in Baldeschwieler and people find my last name difficult to pronounce so eric14 is easier to say. The longer story is that it goes back to my sisters 3rd grade class where there were two Karen B’s and the teacher shortened it to Karen 14 and Karen 5 so she could call on the two Karen’s. When I encountered the same problem in college while selecting an email address the solution seemed obvious and I’ve used Eric14 ever since.”

Chad:  “Stephen, in this webinar you’re going to talk about how Hadoop has favorably changed the landscape of the enterprise data platform.  As CTO of Teradata, why is this an important message to you?”

Stephen Brobst:  “The emerging big data philosophy is to “keep all data forever” because enterprises know that there is value to be had in these assets.  However, to make this approach financially viable we need to radically change the economics of storing and manipulating huge data volumes.  Hadoop, through a combination of clever engineering and an open source software model, provides the opportunity to deliver a dramatically improved return on investment model as part of an analytic ecosystem addressing value extraction from both traditional and non-traditional data sources.”

Chad:  “What are some of the key take-aways?”

Stephen Brobst:  “I think that the key takeaway is to use the right technology for the problem that you are solving.  No one technology solves all problems well.  A combination of Hadoop technology, relational database technology, and innovative discovery platforms can help optimize value delivery from big data assets within an enterprise.”

Chad:  “You were recently ranked as one of the top 15 CTOs in the world, you have been an author for multiple books and numerous articles in academic and industry journals, you did course and thesis research at MIT, Harvard, and U.C. Berkeley and you have served on two national committees related to science and technology.   Having said that, I’m told that you don’t have a house or apartment, no car, and all of your physical possessions are inside your suitcase.”

Stephen Brobst:  “Is that a question?”

Chad:  “Yes, please elaborate freely on this highly interesting lifestyle decision.”

Stephen Brobst:  “I put much higher value on my relationship to ‘people”’ rather than ‘stuff.’  There are only two kinds of material possessions that I really care about: books and music.  These days, I can carry these possessions in digital form – so it doesn’t take much for me to re-locate my loot from one place to another.  I also thrive on travel.  Teradata has labs and development facilities in multiple locations across the United States, Canada, China, India, Pakistan, and the Philippines.  I am a hands-on CTO and I make a point of spending time with our teams in each of these locations.  We also have customers all over the world and I believe in having direct interaction with these thought leaders to influence our product and deployment strategy rather than designing cool gadgets in a technology vacuum.”

We look forward to joining you at the webinar – you can register here.

Categorized by :
Apache Hadoop Big Data

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Contact Us
Hortonworks provides enterprise-grade support, services and training. Discuss how to leverage Hadoop in your business with our sales team.
Integrate with existing systems
Hortonworks maintains and works with an extensive partner ecosystem from broad enterprise platform vendors to specialized solutions and systems integrators.
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.

Thank you for subscribing!