Another week, another release… Following the release of Apache Hadoop 2.0 beta last week, we are excited to release the beta of Hortonworks Data Platform 2.0, the first commercial release of the stable YARN API and protocols on which new applications can now be built.
The latest product news and company announcements from Hortonworks
It’s my great pleasure to announce that the Apache Hadoop community has declared Hadoop 2.x as Beta with the vote closing over the weekend for the hadoop-2.1.0-beta release.
As noted in the announcement to the mailing lists, this is a significant milestone across multiple dimensions: not only is the release chock-full of significant features (see below), it also represents a very stable set of APIs and protocols on which we can continue to build for the future.…
Today we are delighted to announce the release of Hortonworks Data Platform v1.3 for Windows. With this release, our HDP distributions for Hadoop have reached parity enabling seamless application portability across Linux and Windows platforms.
Hadoop represents the future of the enterprise data platform and we have made it our mission to deliver Hadoop as far and wide as possible: from Linux to Windows, from the Enterprise Data Center to the cloud and we’re very proud of this latest product release as we deliver on that mission.…
Implementing and integrating Hadoop to complement existing EDW, RDBMS and Discovery Systems is all part of realizing a Modern Data Architecture for a business which unlocks the opportunities that big data provides for new insight and competitive edge.
That is why we were excited to take part in Cisco and NetApp’s joint announcement of their FlexPod Portfolio because it brings new engineered offerings to the market for enterprises looking to take advantage of Hadoop.…
I’d like to share some thoughts on the recent news that Eric Baldeschwieler has decided to leave Hortonworks. I’d like to start off first by thanking Eric for his contributions to the Hadoop community since its inception over 7 years ago, and I’d like to express my personal appreciation for his help in getting Hortonworks off the ground.
It’s hard to believe it’s been over two years since Hortonworks was founded by over 20 engineers from the original Yahoo!…
Last week, we published a blog about the Hadoop job marketing and evolving your SQL skills to Hadoop. To help you with that evolution, we’re delighted to offer you some special pricing on training. What better way to stay cool this summer than to be in a nice air-conditioned classroom?
- Looking for a class in the US? Hortonworks – here’s 25% for any class offered by Hortonworks in August. Use this discount code: TCAMSUMMR25
- Do you want to take a class in the UK?
Four years ago, Arun Murthy entered a JIRA ticket (MAPREDUCE -279) that outlined a re-architecture of the original MapReduce. In the ticket, he outlined a set of capabilities that allowed processes to better share resources and an architecture that would allow Hadoop to extend beyond batch data processing.
It turned out that this ticket was prescient of true enterprise requirements for Hadoop. As enterprise adoption accelerated, it became even clearer that multiple processing models – moving beyond batch – was critical for Hadoop to broaden its applicability for mainstream usage in the modern enterprise architecture.…
We are delighted to announce a new round of funding led by new investors Tenaya Capital and Dragoneer Investment Group, with participation from our existing investors Benchmark Capital, Index Ventures and Yahoo!.
I could not be more excited about the opportunity in front of us. The market reception of our business model and strategy of ensuring 100% open source Apache Hadoop becomes an enterprise viable data platform is resonating strongly with the market.…
Over the past year, customers have told us they want to store all their data in one place and interact with it in multiple ways… they want to use Hadoop, but in order to do so, it needs to extend beyond batch. It also needs to be interactive and real-time (among others).
Hortonworkers are out there – here is a rundown of events and meet ups we’ll be at in the next couple of weeks and we hope we’ll see you there. Did we miss any? Want us to attend your event? Let us know!Big Data Innovation Summit
April 10-11, 2013, San Francisco, CA
Spring into April and jump into Big Data! Be sure to meet us at Big Data Innovation Summit by the bay.…
We are very pleased to announce the Alpha 2 release of the Hortonworks Data Platform 2.0 (HDP 2.0 Alpha2) is now available for download!
A key focus in HDP 2.0 Alpha 2 is on performance as announced in the Stinger initiative, and includes a series of enhancements to the performance of Apache Hive for interactive SQL queries. In fact HDP 2.0 Alpha 2 was used to perform the tests announced yesterday, showing a 45X performance increase using Hive. …
We are just under two weeks away from start of the first ever Hadoop Summit Europe and with all of the final preparations being made we thought we would highlight some of the not to be missed activities in and around the event. The event is filling fast but you can still register here.
Here are 10 great reasons to attend!
1) Great track content – there are 35 informative sessions on Apache Hadoop and related technologies for you to choose from selected by the community and delivered by the experts themselves.…
Apache Pig version 0.11 was released last week. An Apache Pig blog post summarized the release. New features include:
- A DateTime datatype, documentation here.
- A RANK function, documentation here.
- A CUBE operator, documentation here.
- Groovy UDFs, documentation here.
And many improvements. Oink it up for Pig 0.11! Hortonworks’ Daniel Dai gave a talk on Pig 0.11 at Strata NY, check it out:
Last week, the HBase community released 0.94.5, which is the most stable release of HBase so far. The release includes 76 jira issues resolved, with 61 bug fixes, 8 improvements, and 2 new features.
Most of the bug fixes went against the REST server, replication, region assignment, secure client, flaky unit tests, 0.92 compatibility and various stability improvements. Some of the interesting patches in this release are: [HBASE-3996] – Support multiple tables and scanners as input to the mapper in map/reduce jobs [HBASE-5416] – Improve performance of scans with some kind of filters.…
As the Release Manager for hadoop-2.x, I’m very pleased to announce the next major milestone for the Apache Hadoop community, the release of hadoop-2.0.3-alpha!2.0 Enhancements in this Alpha Release
This release delivers significant major enhancements and stability over previous releases in hadoop-2.x series. Notably, it includes:
- QJM for HDFS HA for NameNode (HDFS-3077) and related stability fixes to HDFS HA
- Multi-resource scheduling (CPU and memory) for YARN (YARN-2, YARN-3 & friends)
- YARN ResourceManager Restart (YARN-230)
- Significant stability at scale for YARN (over 30,000 nodes and 14 million applications so far, at time of release – see more details from folks at Yahoo!