The Hortonworks Blog
The following is a guest post from Scott Gnau, President, Teradata Labs
I continue to be astonished by the evolution of Apache Hadoop, the software framework for large scale computing that has flourished thanks to a dynamic open source ecosystem. An army of contributors, including the smart engineers and contributors at Hortonworks, constantly refines Hadoop’s ability to manage massive amounts of data on computer clusters via MapReduce processing and the underlying Hadoop Distributed File System (HDFS).…
We are excited to announce today that Hortonworks is bringing Windows-based Hadoop Operational Management functionality via Management Packs for System Center. These management packs will enable users to deploy, manage and monitor Hortonworks Data Platform (HDP) for both Windows and Linux deployments. The new management packs for System Center will provide management and monitoring of Hadoop from a single System Center Operations Manager console, enabling customers to streamline operations and ensure quality of service levels.…
Four years ago, Arun Murthy entered a JIRA ticket (MAPREDUCE -279) that outlined a re-architecture of the original MapReduce. In the ticket, he outlined a set of capabilities that allowed processes to better share resources and an architecture that would allow Hadoop to extend beyond batch data processing.
It turned out that this ticket was prescient of true enterprise requirements for Hadoop. As enterprise adoption accelerated, it became even clearer that multiple processing models – moving beyond batch – was critical for Hadoop to broaden its applicability for mainstream usage in the modern enterprise architecture.…
Today our partner Teradata announced a new offering called the Teradata Portfolio for Hadoop, which is built upon the 100% open source Hortonworks Data Platform (HDP). The new products and expanded partnership with Hortonworks offers customers a flexible choice of deployment offerings for Apache Hadoop from one of the most trusted vendors in the data management market worldwide.
Teradata have been helping their customers to get more value from their data for more than 30 years so this is a natural next step as organizations are looking to evolve their data architectures to capture net new data sources and create new applications.…
In the last few weeks, we have been getting together a prototype, Hoya, running HBase On YARN. This is driven by a few top level use cases that we have been trying to address. Some of them are:
- Be able to create on-demand HBase clusters easily -by and or in apps
- With different versions of HBase potentially (for testing etc.)
- Be able to configure different Hbase instances differently
- For example, different configs for read/write workload instances
- Better isolation
- Run arbitrary co-processors in user’s private cluster
- User will own the data that the hbase daemons create
- MR jobs should find it simple to create (transient) HBase clusters
- For Map-side joins where table data is all in HBase, for example
- Elasticity of clusters for analytic / batch workload processing
- Stop / Suspend / Resume clusters as needed
- Expand / shrink clusters as needed
- Be able to utilize cluster resources better
- Run MR jobs while maintaining HBase’s low latency SLAs
The Hoya tool is a Java tool, and is currently CLI driven.…
We are delighted to announce a new round of funding led by new investors Tenaya Capital and Dragoneer Investment Group, with participation from our existing investors Benchmark Capital, Index Ventures and Yahoo!.
I could not be more excited about the opportunity in front of us. The market reception of our business model and strategy of ensuring 100% open source Apache Hadoop becomes an enterprise viable data platform is resonating strongly with the market.…
What is the value of Hadoop to your business? What value lies in your big data?
There are a MANY definitions of big data out there. In fact, we have published two of them to our blog alone and I am sure we can dream up of a few more. However, when it comes down to it, our customers know best. After all, they are the users of Hadoop.
New Whitepaper: “Business Value of Hadoop”.…
In case you haven’t heard, Hadoop 2.0 is on the way! There are loads more new features than I can begin to enumerate, including lots of interesting enhancements to HDFS for online applications like HBase. One of the most anticipated new features is YARN, an entirely new way to think about deploying applications across your Hadoop cluster. It’s easy to think of YARN as the infrastructure necessary to turn Hadoop into a cloud-like runtime for deploying and scaling data-centric applications.…
Today Concurrent announced that we have certified the Hortonworks Data Platform against the Cascading application framework. As Hadoop adoption continues to grow more organizations are looking to take advantage of new data types and build new applications for the enterprise. By combining our enterprise-grade data platform and unparalleled growing ecosystem with the power, maturity and broad platform support of Concurrent’s Cascading application framework, we have now closed the modeling, development and production loop for all data-oriented applications.…
Over the past year, customers have told us they want to store all their data in one place and interact with it in multiple ways… they want to use Hadoop, but in order to do so, it needs to extend beyond batch. It also needs to be interactive and real-time (among others).
There are plenty of server and storage options for the wave of data that is being collected and analyzed. New platforms such as Apache™ Hadoop® provide the opportunity to make all the new data types being collected useful. However, like any other platform, performance varies depending on the underlying servers being used. There is great promise in what Hadoop can deliver in terms of business value, and the ecosystem is continuously growing with companies making strides to make Hadoop easier to deploy and manage.…
This week we’re at the Red Hat Summit along with many others enjoying the great discussions within the community. As part of the summit, we are delighted to announce extended collaboration with Red Hat to continue to advance open source big data community projects.
Some details on the the three areas of collaboration forming the announcement:
- Enhancing Apache Ambari to support the management of Hadoop-compatible file systems, such as GlusterFS. With this integration, users will be able to provision, deploy, monitor and manage alternative file systems with Ambari, further cementing Ambari’s position as the standard for Hadoop management.