Posts by John Kreisa:


Cascading for Hadoop and Hortonworks Data Platform

cascading-logo-315x97Today Concurrent announced that we have certified the Hortonworks Data Platform  against the Cascading application framework. As Hadoop adoption continues to grow more organizations are looking to take advantage of new data types and build new applications for the enterprise. By combining our enterprise-grade data platform and unparalleled growing ecosystem with the power, maturity and broad platform support of Concurrent’s Cascading application framework, we have now closed the modeling, development and production loop for all data-oriented applications.

Cascading and Big Data Applications

For those that aren’t familiar Cascading is the most widely used and deployed application framework for building robust, enterprise Big Data applications on Hadoop. Recognized companies, including The Climate Corporation, eBay, Etsy, FlightCaster, iCrossing, Razorfish, Trulia, TeleNav and Twitter, are using Cascading to streamline data processing, data filtering and workflow optimization for large volumes of unstructured and semi-structured data. Cascading is also at the core of popular language extensions including PyCascading (Python + Cascading), Scalding (Scala + Cascading) and Cascalog (Clojure + Cascading) – open source projects sponsored by Twitter. Cascading has become the most reliable and repeatable way of building and deploying Big Data applications.

Cascading and Hortonworks Data Platform

HDP is the only 100-percent open source ApacheTM Hadoop®-based data management platform. HDP allows users to capture, process and share data in any format and at scale. Built and packaged by the core architects, builders and operators of Hadoop, HDP includes all of the necessary components to manage a cluster at scale and uncover business insights from existing and new big data sources.

Together, with the simplicity and flexibility of Cascading and the reliability and stability of the HDP, companies can rapidly build, test and deploy new data transformation and refinement, data processing, analytics and machine-learning applications. Enterprises can now leverage existing skill sets, core competencies and product investments by carrying them over to HDP via the standards-based technology – Java, ANSI SQL and machine-learning standards. Analysts and data scientists familiar with these can now easily run predictive data models at scale and integrate ETL, data preparation and predictive analytics in the same application, greatly reducing time to production and unlocking access to large Hadoop data sets.

You can read more about Modern Data Architecture with Hadoop here.

10 Reasons To Put “Hadoop Summit 2013” In Your Calendar

hadoop_summit_logoHadoop Summit 2013 in San Jose is approaching quickly and in just a few weeks attendees will have the opportunity to learn all of the up and coming advances in the world of Apache Hadoop and Big Data. You can still register here!

Here are ten great reasons to pencil “Hadoop Summit 2013” into your calendar:

  1. Informative and exciting keynotes
    Keynotes will be given by Jer Thorpe, an artist and educator known for exploring the many-folded boundaries between science, data, art and culture and Merv Adrian, VP of research at Gartner who follows database, big data, NoSQL and adjacent technologies.
  2. Lightning talks
    These quick-hit informative talks will give you a broad perspective on the various applications of Hadoop.
  3. Expert panels
    Live panel discussions with industry leaders will include topics like SQL on Hadoop and Hadoop in the Enterprise.
  4. More than 90 sessions!
    The event will span two full days and will include 90+ sessions and speeches by over 50 organizations.
  5. Meet ups
    Socialize and learn at the pre-conference meet ups and attend the Birds of a Feather sessions, the Big Data Science “Machine Learning Evening”, or the Big Data Camp.
  6. Business use cases and reference architecture
    Get updated with the latest Hadoop use cases and reference architectures to gain insight for your business.
  7. Training classes
    Take a Hadoop Training class and become extra prepared for Hadoop Summit.
  8. First annual Hadoop Summit Bike Ride
    Cruise around the Silicon Valley on two wheels and get some fun exercise in at the first ever Summit bike ride.
  9. Hadoop Summit Party
    Celebrate your newly acquired knowledge and have some post-conference fun at the Tech Museum Summit party. Enjoy the museum’s many interactive exhibits as well as music, food and cocktails.
  10. Community
    Make new connections and share ideas with the rest of the Hadoop community.

Don’t miss this exciting opportunity and register now! See you there!

Boosting Big Data and the Hadoop Ecosystem with Splunk Alliance

SplunkLogoToday we announced a strategic alliance with operational intelligence leader Splunk. We are excited to be strengthening our relationship with Splunk and expanding the Apache Hadoop ecosystem and we expect this to further drive open source innovation. Additionally this alliance is further proof of Hadoop’s maturation as a key component of the next generation enterprise architecture.

One of the key benefits of the partnership is that it enables organizations to easily take advantage of the massive scale out storage and processing capabilities of Apache Hadoop with Splunk Enterprise via Splunk Hadoop Connect, which easily and reliably moves data between Splunk Enterprise and Hadoop.

This capability means the enterprise can easily use Splunk Enterprise to collect machine data from across the enterprise and deliver it to Hadoop for batch analytics. Likewise, the output of Hadoop jobs can be imported into Splunk Enterprise for rapid analysis and visualization.

Visit the Splunk website to learn more about Splunk Enterprise and Splunk Hadoop Connect.

Find out more about how Hadoop and the Hortonworks Data Platform enables next-generation data architecture.

Hadoop, Hadoop, Hurrah! HDP for Windows is Now GA!

HDP for WindowsToday we are very excited to announce that Hortonworks Data Platform for Windows (HDP for Windows) is now generally available and ready to support the most demanding production workloads.

We have been blown away with the number and size of organizations who have downloaded the beta bits of this 100% open source, and native to Windows distribution of Hadoop and engaged Hortonworks and Microsoft around evolving their data architecture to respond to the challenges of enterprise big data.

With this key milestone HDP for Windows offers the millions of customers running their business on Microsoft technologies an ecosystem-friendly Hadoop-based solution that is built for the enterprise and purpose built for Windows. This release cements Apache Hadoop’s role as a key component of the next generation enterprise data architecture, across the broadest set of datacenter configurations as HDP becomes the first production-ready Apache Hadoop distribution to run on both Windows and Linux.

Additionally, customers now also have complete portability of their Hadoop applications between on-premise and cloud deployments via HDP for Windows and Microsofts’s HDInsight Service.

Enterprise Hadoop Momentum

Since its beta availability, we’ve been working with customers across a wide range of industries including automotive, manufacturing, financial services, retail and government. Here are just a few examples of the tremendous opportunity those customers are seeing:

  • Automotive – a major automotive company wants to use HDP on Windows to create a centralized repository for all of the sensor data collected from their cars. The refinement and exploration of the data trends and patterns found through driving habits, maintenance and repair data and myriad other signals will be used to further improve the quality of their cars.
  • Healthcare – a major healthcare applications provider is looking to build the next generation of healthcare apps that integrate patient health record data with clinical study and FDA data so that the customer experience is enriched and provides a higher level of health care services at a lower cost.
  • Financial services – multiple major financial services organizations are looking to create centralized repositories across different divisions enabling them to explore and gain deeper insight into customer risk patterns.
  • Manufacturing – a major manufacturer of electronics will create a centralized repository of machine generated data coming from the production lines and compare and analyze that data with part failure and return data enabling them to identify and predict problems in production and increasing the quality of their products.

This is just a small sample of the emerging use cases for HDP on Windows. You can explore how Hadoop fits into your data architecture here.

Availability & Training

Hortonworks Data Platform for Windows is now available for download at: http://hortonworks.com/download/.

We also have training specifically designed for HDP on Windows, you can get more information here: http://hortonworks.com/hadoop-training/hadoop-on-windows-for-developers/

Hadoop and the Data Warehouse: When to Use Which

As a preview to the April 30th webinar: Hadoop & the Enterprise Data Warehouse: When to Use Which, Chad Meley, Global Director of Marketing at Teradata, interviewed the two luminary speakers, Eric Baldeschwieler (aka “eric14”) and Stephen Brobst, about the purpose of their presentation and what you can expect to take away from their shared experiences.

Chad:  “Eric, in this webinar you’re going to talk about the strategic role of relational big data technologies, which have come under fire in some circles with the rise of Hadoop.  As the Founder & CTO of Hortonworks, and former VP of Hadoop Software Engineering at Yahoo!, why do you feel this is an important message?”

Eric Baldeschwieler (eric14):  “We at Hortonworks are very optimistic about the continued growth of Hadoop, and there’s certainly a lot of media coverage, events, and communities that are aiding adoption and contributing to the future of Hadoop.  I think what is getting lost at this point in time is how Hadoop compliments, rather than replaces, relational data warehouses that are based on massively parallel processing (MPP) architectures.  We have observed that customers do not replace their EDW with Hadoop rather they optimize the use of the EDW and move appropriate workloads to Hadoop. This frees up more appropriate processing cycles for the data warehouse. Each technology has big broad sweet spots and when combined you get a stronger solution. During this webinar we aim to bring some clarity because if they are not used optimally then ultimately that’s bad for customers and the future of Hadoop.  So, it’s not as altruistic as it seems (laughs).”

Chad:  “What are some of the key take-aways?”

Eric Baldeschwieler (eric14):  “With the rise of Hadoop, there are now options that are going to result in use cases that were previously done in Enterprise Data Warehouses that are now better handled in Hadoop resulting from a combination of economics and capabilities;  however, because there’s more data being generated and companies can now capture and analyze multi-structured data in ways that were unthinkable in the past, this will result in bringing structure to key signals in new big data sets that can be cleansed, integrated, and reused for a variety of new analytical use cases where the economics are capabilities favor an MPP RDBMS.  We’ll go into some level of technical depth as to why that is. “

Chad:  “I’ve got to ask, why are you called eric14?”

Eric Baldeschwieler (eric14): “Well the short answer is that there are 14 letters in Baldeschwieler and people find my last name difficult to pronounce so eric14 is easier to say. The longer story is that it goes back to my sisters 3rd grade class where there were two Karen B’s and the teacher shortened it to Karen 14 and Karen 5 so she could call on the two Karen’s. When I encountered the same problem in college while selecting an email address the solution seemed obvious and I’ve used Eric14 ever since.”

Chad:  “Stephen, in this webinar you’re going to talk about how Hadoop has favorably changed the landscape of the enterprise data platform.  As CTO of Teradata, why is this an important message to you?”

Stephen Brobst:  “The emerging big data philosophy is to “keep all data forever” because enterprises know that there is value to be had in these assets.  However, to make this approach financially viable we need to radically change the economics of storing and manipulating huge data volumes.  Hadoop, through a combination of clever engineering and an open source software model, provides the opportunity to deliver a dramatically improved return on investment model as part of an analytic ecosystem addressing value extraction from both traditional and non-traditional data sources.”

Chad:  “What are some of the key take-aways?”

Stephen Brobst:  “I think that the key takeaway is to use the right technology for the problem that you are solving.  No one technology solves all problems well.  A combination of Hadoop technology, relational database technology, and innovative discovery platforms can help optimize value delivery from big data assets within an enterprise.”

Chad:  “You were recently ranked as one of the top 15 CTOs in the world, you have been an author for multiple books and numerous articles in academic and industry journals, you did course and thesis research at MIT, Harvard, and U.C. Berkeley and you have served on two national committees related to science and technology.   Having said that, I’m told that you don’t have a house or apartment, no car, and all of your physical possessions are inside your suitcase.”

Stephen Brobst:  “Is that a question?”

Chad:  “Yes, please elaborate freely on this highly interesting lifestyle decision.”

Stephen Brobst:  “I put much higher value on my relationship to ‘people”’ rather than ‘stuff.’  There are only two kinds of material possessions that I really care about: books and music.  These days, I can carry these possessions in digital form – so it doesn’t take much for me to re-locate my loot from one place to another.  I also thrive on travel.  Teradata has labs and development facilities in multiple locations across the United States, Canada, China, India, Pakistan, and the Philippines.  I am a hands-on CTO and I make a point of spending time with our teams in each of these locations.  We also have customers all over the world and I believe in having direct interaction with these thought leaders to influence our product and deployment strategy rather than designing cool gadgets in a technology vacuum.”

We look forward to joining you at the webinar – you can register here.

Hadoop Summit 2013 Amsterdam – It’s A Wrap!

We want to take a moment to thank everyone who attended the Hadoop Summit in Amsterdam - THANK YOU! With nearly 500 people registered for the event we think we can safely say is was a big success. We’ve had overwhelming support to do it again next year – so watch this space.

The awesome Beurs Van Berlage venue set us up for a series of fantastic conversations and really well attended sessions and talks as Hadoop continues to explode onto the enterprise scene . Outside of the main tracks, there was great attendance for NLHUG and BoF talks, and kudos to the 10 presenters who ran those lightning talks. Finally, the customer panel was also well received, with great practical advice on adopting Hadoop from HSBC, Neustar and eBay.

But of course it wouldn’t be an event without a party, and we had a great time at the Heineken Experience (from what we can remember).  We put some photos on our Facebook page, but @timoelliott did a much better job than us with this fantastic set on Flickr. This one shows the awesome venue:

hadoop summit exhibition hall

So did you enjoy the summit?  Head over to Facebook  and let us know your favorite part and why: keynotes, tracks, lightning talks, the sandbox experience in the dev cafe, or the party.

And here is a tiny selection of some of the most recent Tweets closing out the show:

Hadoop Summit Tweet

Hadoop Summit Tweet

Hadoop Summit Tweet

Hadoop Summit Tweet

With the community voting just about complete - you still have a few hours to take part – for Hadoop Summit San Jose we are barely 3 months away from a whole bunch of new content and connections and we hope you join us there too!

Thanks again!

Getting Ready for The Elephant Party in Europe

We are just under two weeks away from start of the first ever Hadoop Summit Europe and with all of the final preparations being made we thought we would highlight some of the not to be missed activities in and around the event. The event is filling fast but you can still register here.

Here are 10 great reasons to attend!

1)   Great track content – there are 35 informative sessions on Apache Hadoop and related technologies for you to choose from selected by the community and delivered by the experts themselves.

2)   Great keynotes – leading industry analyst Matt Aslett will present the opening keynote and we will also hear from open source veteran Shaun Connolly as well as Hortonworks CTO Eric Baldeschwieler

3)   Hadoop in the Enterprise expert panel – We will have a live panel discussion from industry leaders incuding eBay, HSBC and Neustar discussing how and why they use Apache Hadoop.

4)   Meetups – the NLHUG and other communities will be meeting around the event.

5)   Lightening talks – we’ve got rapid fire content coming to you in the form of community selected lightening talks. These 5 minute sessions will give you a taste of a wide range of technologies and initiatives

6)   It’s Amsterdam – historic, edgy and fun!

7)   Ecosystem – The conference has the support of the broader Hadoop ecosystem so you can come and discuss Hadoop and big data in the solutions showcase.

8)   Community – The Apache Hadoop community is big and getting bigger. Come meet and mingle with other community members to learn about the latest goings on and make new connections.

9)   Get Hadoop certified – Calling all Hadoop Experts! We’re bringing certification to you! If you are ready to take the exam to become a Hortonworks Certified Apache Hadoop Developer (HCAHD) or a Hortonworks Certified Apache Hadoop Administrator (HCAHA).

10)   Get trained on Hadoop – we’ve got a host of classes available during the event to help you learn or sharpen your Hadoop skills. This includes a newly added Applying Data Science class. Check out the classes.

11)  BONUS reason – have a beer on us at the Hadoop Summit Party at the Heineken Experience a cool venue at a historic location.

Register now, don’t miss the party hope to see you there!

Putting the Elephant in the Window

 

For several years now Apache Hadoop has been fueling the fast growing big data market and has become the defacto platform for Big Data deployments and the technology foundation for an explosion of new analytic applications. Many organizations turn to Hadoop to help tame the vast amounts of new data they are collecting but in order to do so with Hadoop they have had to use servers running the Linux operating system. That left a large number of organizations who standardize on Windows (According to IDC, Windows Server owned 73 percent of the market in 2012 – IDC, Worldwide and Regional Server 2012–2016 Forecast, Doc # 234339, May 2012) without the ability to run Hadoop natively, until today.

windoweleWe are very pleased to announce the availability of Hortonworks Data Platform for Windows providing organizations with an enterprise-grade, production-tested platform for big data deployments on Windows. HDP is the first and only Hadoop-based platform available on both Windows and Linux and provides interoperability across Windows, Linux and Windows Azure. With this release we are enabling a massive expansion of the Hadoop ecosystem. New participants in the community of developers, data scientist, data management professionals and Hadoop fans to build and run applications for Apache Hadoop natively on Windows. This is great news for Windows focused enterprises, service provides, software vendors and developers and in particular they can get going today with Hadoop simply by visiting our download page.

This release would not be possible without a strong partnership and close collaboration with Microsoft. Through the process of creating this release, we have remained true to our approach of community-driven enterprise Apache Hadoop by collecting enterprise requirements, developing them in open source and applying enterprise rigor to produce a 100-precent open source enterprise-grade Hadoop platform.

One of our goals at Hortonworks is to make Hadoop and enterprise viable data platform available on as many platforms as possible. In fact, it is already available today in a range of deployment options including: on-premise, virtual, cloud and an appliance. For organizations looking to leverage Apache Hadoop, they now have even more choices of deployment options between Linux and Windows, giving them more freedom to meet their internal policies and standards. For Microsoft Windows customers, they have complete portability of their Apache Hadoop applications between on premise and cloud deployments, as Hortonworks Data Platform for Windows and HDInsight Service on Windows Azure are built on exactly the same code line.

If you are in the SF Bay Area this week, you can talk to us live about the power of the Hortonworks Data Platform for Windows at booth #316 at the Strata Conference, taking place February 26-28 at the Santa Clara Convention Center in Santa Clara, Calif.

 We will also be conducting the webinar, “Unlocking the Other Half: Introduction to Hortonworks Data Platform for Windows,” on Tuesday, March 12 at 10 a.m. PST / 1 p.m. EST.

To register for the webinar, please visit http://info.hortonworks.com/Hortonworks_HDPonWindows_webcast.html.

 

Buzz Growing for Hadoop Summit Europe

We are now less than a month away from the kickoff of Hadoop Summit Europe, taking place March 20-21 in Amsterdam. The excitement from the community is really starting to grow and pass sales are far ahead of where we expected. Much of the buzz is tied directly to the content that will be presented during the conference.

In all, there were be 42 breakout sessions with presenters from more than 20 companies, including representatives from Adobe, eBay, Facebook, HSBC, LinkedIn, Twitter and Yahoo!. We have started to feature interviews with some of the most compelling speakers on the Hadoop Summit website. Those posted thus far include:

  • Clemens Neudecker of the National Library of the Netherlands and Sven Schlarb of the Austrian National Library (interview)
  • Alasdair Anderson of HSBC (interview)
  • Mikhail Petrenko of Adobe (interview)
  • Jason Dai of Intel (interview)
  • Steve Watt of Red Hat (interview)
  • Joydeep Sen Sarma of Qubole (interview)

The breakout sessions are broken down into four tracks, each aimed at providing valuable and educational content to meet the varied needs of the attendees. We recently featured interviews with each of the track chairs in order to provide some insight into the track sessions and the expected takeaways from each. The interviews are available on the Hadoop Summit website and also linked to below:

  • Evert Lammerts, Track Chair, Operating Hadoop (interview)
  • Isabel Drost, Track Chair, Applied Hadoop (interview)
  • Lars George, Track Chair, Integrating Hadoop (interview)
  • Steve Loughran, Track Chair, Hadoop Futures (interview)

We also recently announced the initial set of speakers for the Lightning Round, which will take place during the first evening of the conference. Speakers will have 5 minutes to cover the topics that the community voted as the ones they wanted to learn about during Hadoop Summit.

The list of the initial 8 Lightning Round sessions is available here.

You definitely don’t want to miss this powerful and exciting lineup of speakers, so REGISTER for Hadoop Summit Europe today!!

Why Microsoft is committed to Hadoop and Hortonworks

This guest blog post is from Microsoft’s Dave Campbell providing more details on why they chose Hortonworks for  HDInsights.

Last February at Strata Conference in Santa Clara we shared Microsoft’s progress on Big Data, specifically working to broaden the adoption of Hadoop with the simplicity and manageability of Windows and enabling customers to easily derive insights from their structured and unstructured data through familiar tools like Excel.

Hortonworks is a recognized pioneer in the Hadoop Community and a leading contributor to the Apache Hadoop project, and that’s why we’re excited to announce our expanded partnership with Hortonworks to give customers access to an enterprise-ready distribution of Hadoop that is 100 percent compatible with Windows Server and Windows Azure.  To provide customers with access to this Hadoop compatibility, yesterday we also released new previews of Microsoft HDInsight Server for Windows and Windows Azure HDInsight Service, our Hadoop-based solutions for Windows Server and Windows Azure.

With this expanded partnership, the Hadoop community will reap the following benefits of Hadoop on Windows:

  • Insights to all users from all data: Analyze unstructured Hadoop data with familiar tools like Excel.  Through integration with award-winning Microsoft BI tools such as PowerPivot and Power View,  HDInsight enables analysis of all your data (structured or unstructured), including data on Linux .
  • Enterprise-ready Hadoop with HDInsight: Offering the most reliable, innovative and trusted distribution available.  Microsoft and Hortonworks together deliver tighter security through integration with Windows Server Active Directory, ease of management through System Center integration, and built-in high availability with Hortonworks Data Platform 1.1. Additionally, harness your existing .NET and JavaScript developers with rich developer frameworks that enable them to write and deploy MapReduce jobs.
  • Simplicity of Windows for Hadoop: Microsoft HDInsight Server for Windows Server significantly simplifies setup and provisioning of Hadoop through streamlined packaging.  So, you don’t need to choose and test the right Hadoop projects on your own.  In the cloud, Windows Azure HDInsight Service simplifies deployment so much that you can now setup a 16-node Hadoop cluster in only 10 minutes!  System Center simplifies management through integration with the Apache Ambari project.  With this integration IT Operators can manage their Hadoop clusters side-by-side with their databases, applications and other IT assets on a single glass pane.
  • Extend your data warehouse with Hadoop: HDP 1.1 improves integration of Hadoop with relational Data Warehouses with HCatalog.  This provides SQL-like language access to Hadoop so that customers can enrich their analysis by including insights from Hadoop environments into the Enterprise Data Warehouse and BI systems.  Additionally, Microsoft enables customers to extend their Enterprise Data Warehouses with Hadoop connectors for SQL Server and Parallel Data Warehouse appliance.
  • Seamless Scale and Elasticity of the Cloud: Microsoft offers HDInsight both in the cloud and on-premise, with seamless migration across the two environments based on your needs. The cloud service offers elastic scalability, a simplified deployment and management experience and a low-cost way to experiment with Hadoop. Deploying Microsoft HDInsight Server on Windows Server provides enterprise-class security through integration with Active Directory, simplified management with System Center management and availability with a trusted and reliable Hadoop distribution.

This is a very exciting milestone, and we hope you’ll join us for the ride as we continue partnering with Hortonworks to democratize big data.  Download HDInsight today at Microsoft.com/BigData.

Hadoop & Big Data Seminar, Coming to a City Near You

Do you want to understand how Apache Hadoop can benefit your business? Do you understand the relationship between Hadoop and your Big Data initiatives? Are you struggling to explain the benefits of Hadoop to your management team?

At Hortonworks, we are constantly being asked by business and executive audiences to explain use cases, benefits and components of Hadoop. While the interest in Big Data and Hadoop grows, this urgent and often pressing demand for a map to create value and differentiation amplifies.

Good news, Hortonworks is hosting a half-day seminar series specifically targeted at IT Managers, Directors, and Executives. The focus of these sessions will be “Big Business Value from Big Data and Hadoop.

We are thrilled at the reception these events have already garnered and urge you to register before seats are full. The list of cities and dates include:

  • Seattle – Sept 19
  • Los Angeles – Sept 20
  • Chicago – Sept 25
  • Dallas – Sept 26
  • San Francisco – Sept 27
  • DC – Oct 9
  • New York – Oct 10
  • Boston – Oct 11

REGISTER

We hope to see you there!

Recap of Hadoop Summit 2012

I wanted to take this opportunity to say thanks to the more than 2,200 attendees, speakers and sponsors that helped to make Hadoop Summit 2012 a great success. There was tremendous buzz throughout the conference; exceeding the excitement levels of all past Hadoop conferences. It’s a great indicator for the future of Apache Hadoop and the broader big data ecosystem.

The content from this conference was outstanding, from the opening keynotes to the last round of breakout sessions. I wanted to thank the track chairs (Abhishek Mehta, Ashish Thusoo, Avik Dey, Ben Reed, Peter Sirota and Val Bercovici) for making the hard decisions that led to such an outstanding agenda. I thought the group did a great job selecting the right mix of technical, use case and best practices sessions for developers, operators and analysts. I would also like to thank the more than 110 speakers for putting in the time and effort to share their Apache Hadoop experiences.

All of the sessions at this year’s conference were recorded and we are in the process of editing these videos for placement on the Hadoop Summit website. We have also now posted most of the slides as well. Simply visit the Sessions page to access the slides and recordings.

I am pleased to announce that all of the keynote session recordings are now available. These include compelling presentations from the following speakers:

Geoffrey Moore (author of “Crossing the Chasm” and “Escape Velocity”)

Scott Burke (SVP, Advertising & Data, Yahoo!)

Dr. Philip Shelley (CTO, Sears)

Scott Gnau (VP and GM of R&D, Teradata)

Shaun Connolly (VP of Corporate Strategy, Hortonworks)

Eric Baldeschwieler (CTO, Hortonworks)

Also, if you have not yet seen the introductory video from Hadoop Summit, I strongly encourage you to watch it now (below). I have heard from quite a few folks that this video got them even more excited about the role they have played in the Apache Hadoop ecosystem.

(click HERE for a full screen version on Vimeo)

On behalf of this year’s co-hosts Hortonworks and Yahoo!, let me again thank everyone for their role in making Hadoop Summit 2012 such a success. Because of the emergence of Apache Hadoop as the foundation of the next generation enterprise data architecture, I have no doubt that next year’s conference will be even bigger and better. I can’t wait.

~ John Kreisa

Hortonworks @ TheCUBE

By any measure, last week’s Hadoop Summit was a tremendous success. It brought together more than 2,200 people from throughout the Apache Hadoop ecosystem to share Hadoop knowledge, ideas, best practices, and interesting use cases. It was also a great chance for big data vendors to make announcements and demonstrate new and exciting solutions.

For those of you that missed the conference, or missed a particularly interesting presentation, we have some good news. Each of the 90+ keynotes and breakout sessions were recorded and we will be posting these sessions online at hadoopsummit.org over the coming days once the editing is completed.

In the meantime, I would like to draw your attention to TheCUBE videos featured on SiliconAngle TV. As conference organizers, we were very fortunate to be able to support the team from TheCUBE, including John Furrier (@furrier) and Jeff Kelly (@jeffreyfkelly). They did an outstanding job of streaming interviews with many of the industry thought leaders and providing some excellent insight into the conference happenings for those that could not attend. These sessions are all now available via their website.

Read More

Introducing Hortonworks Data Platform v1.0

I wanted to take this opportunity to share some important news. Today, Hortonworks announced version 1.0 of the Hortonworks Data Platform, a 100% open source data management platform based on Apache Hadoop. We believe strongly that Apache Hadoop, and therefore, Hortonworks Data Platform, will become the foundation for the next generation enterprise data architecture, helping companies to load, store, process, manage and ultimately benefit from the growing volume and variety of data entering into, and flowing throughout their organizations. The imminent release of Hortonworks Data Platform v1.0 represents a major step forward for achieving this vision.

You can read the full press release here. You can also read what many of our partners have to say about this announcement here. We were extremely pleased that industry leaders such as Attunity, Dataguise, Datameer, Karmasphere, Kognitio, MarkLogic, Microsoft, NetApp, StackIQ, Syncsort, Talend, 10gen, Teradata and VMware all expressed their support and excitement for Hortonworks Data Platform.

Those who have followed Hortonworks since our initial launch already know that we are absolutely committed to open source and the Apache Software Foundation. You will be glad to know that our commitment remains the same today. We don’t hold anything back. No proprietary code is being developed at Hortonworks.

Read More

Go to page:12