Extending Apache Hadoop to Millions of New Microsoft Users
Today we announced that we were delivering on our earlier promise to help Microsoft bring Apache Hadoop to Windows. I’m pleased to share that Microsoft, with our collaboration and guidance, has now submitted a series of patches to Apache aimed at overcoming the challenges of running Apache Hadoop in Windows Server environments.
These patches, once vetted and approved by the community, will become part of the core Hadoop code base. They will also become available in the two major Apache Hadoop branches: hadoop-1.0 (the current stable branch, which is available as part of Hortonworks Data Platform v1.0) and hadoop-0.23 (the next generation of Apache Hadoop, which will be available as part of Hortonworks Data Platform v2.0).
In addition, Microsoft and Hortonworks are expanding our technical collaboration to include:
- An enhanced Hive ODBC driver that will enable Hadoop data to be analyzed using familiar tools such as Microsoft Excel and business intelligence (BI) clients such as PowerPivot for Excel.
Why is this announcement significant?
From a Hortonworks perspective, we are obviously pleased that Microsoft chose Hortonworks as their technology collaboration partner for Apache Hadoop. Microsoft identified the tremendous value that comes from working with the team that has been at the core of Apache Hadoop development since the beginning. This, however, is not what makes the announcement significant.
We have stated on many occasions our vision that Apache Hadoop will process half of the world’s data within the next five years (or less). The Microsoft contributions are a very important step in making that vision a reality.