The Hortonworks Blog

Today we announced the expansion of our strategic relationship with HP enabling HP to resell Hortonworks Data Platform (HDP). As data volumes grow and new data sources emerge it is important for enterprises have access to production ready enterprise Apache Hadoop to meet their big data needs.

With HDP, HP customers can now seamlessly incorporate Hadoop into their modern data architectures to power a variety of new applications and to support existing ones with additional data sources.…

Today our partner Rackspace announced their Big Data solution for dedicated and cloud environments, powered by Hortonworks Data Platform. This collaboration between Hortonworks and Rackspace provides customers a flexible choice of deployment offerings of Apache Hadoop from one of the most trusted vendors in the cloud computing market.

Enterprise adoption of Apache Hadoop

This expanded collaboration is a strong indicator of the ecosystem rallying around Hortonworks Data Platform and our goal at Hortonworks of making Apache Hadoop a core component of the modern data architecture, whether on premise, in a VM, as an appliance, or in the cloud.…

Now that Hortonworks Data Platform 2.0 is GA, you may be looking to migrate your Hadoop stack from another version to take advantage of Hadoop 2’s YARN-based architecture. Fortunately, our Professional Services & Support teams are getting a lot of practice at migration from other distributions as more and more customers turn to 100% enterprise-hardened Apache Hadoop for their big data platform.

While any specific migration may have a few gotchas from a vendor lock-in, or business integration perspective, this high-level process overview is battle tested on large-scale production clusters and we hope it helps you plan for your own migration.…

Behind all the Big Data hype, there is one common thread: Apache Hadoop and its associated components ARE the technology platform of choice. And here at Hortonworks, that’s what we do: Hadoop.

That is also why we are so excited about the incredible growth in customers who have chosen to work with us to ensure their implementation of Hadoop and realize their vision of a modern data architecture.

Here are the key reasons we believe that we can best help your enterprise with Apache Hadoop.…

Today our strategic partner Microsoft announced the General Availability of HDInsight Service: the Azure based enterprise Hadoop cloud service is now ready for production workloads! We are excited to see this service move into general availability as a key milestone in a longstanding engineering partnership between our two organizations.

Application Portability: Deploy Hadoop On-Premise and/or Scale with the Cloud

With HDP for Windows and HDInsight Service there is unprecedented choice for Windows enterprises for their Hadoop deployments.…

I’d like to take a quick moment to welcome Julian Hyde as the latest addition to the Hortonworks engineering team. Julian has a long history of working on data platforms, including development of SQL engines at Oracle, Broadbase, and SQLstream. He was also the architect and primary developer of the Mondrian OLAP engine, part of the Pentaho BI suite.

Julian’s latest role has been as the author and architect of the Optiq project – an Apache licensed open source framework.…

Whew, what a couple of week it had been! We released Hortonworks Data Platform 2.0 to much excitement from our customers, partners and the community at large. With HDP 2.0 and its YARN-based architecture delivered, enterprises can finally realize the vision of gathering all data in Hadoop and interacting with that data in ways that was previously not possible!

We are carrying forward that ebullience through to O’Reilly Strata Conference in New York!…

It’s been a huge couple of weeks for us at Hortonworks HQ. We’ve talked about the GA of Hadoop 2, the subsequent release of Hortonworks Data Platform 2.0, and a little of the future with Apache Storm. We’ve been staggered by the support, goodwill and enthusiasm we’ve seen from you all.

We hope you’re as excited about Hadoop as we are, and we wanted to say thanks to our amazing team, amazing customers, amazing partners and the most amazing community for doing Hadoop with us – THANK YOU.…

We’re continuing our series of quick interviews with Apache Hadoop project committers at Hortonworks.

This week Mahadev Konar discusses Apache Ambari, the open source Apache project to simplify management of a Hadoop cluster.

Mahadev was on the team at Yahoo! in 2006 that started developing what became Apache Hadoop. Since then, he has also held leadership positions in the Apache Zookeeper and Apache Ambari projects. He is an architect and project management committee member for Apache Ambari, Apache ZooKeeper and Apache Hadoop.…

What a difference a year makes! Last Fall Ambari was a nascent Apache project that had recently shipped an inaugural release in the community. Fast forward a bit, at the beginning of this year Ambari shipped what has become the foundation for rapid innovation. Now Ambari has become a key member of the Apache Hadoop project ecosystem and a trusted operational platform for many companies.

Let’s take a brief look at the community’s amazing accomplishments over the past year, and then take some time to look forward.…

The Hadoop Distributed File System is the reliable and scalable data core of the Hortonworks Data Platform. In HDP 2.0, YARN + HDFS combine to form the distributed operating system for your Data Platform, providing resource management and scalable data storage to the next generation of analytical applications.

Over the past six months, HDFS has introduced a slew of major features to HDFS covering Enterprise Multi-tenancy, Business Continuity Processing and Enterprise Integration:

  • Enabled automated failover with a hot standby and full stack resiliency for the NameNode master service
  • Added enterprise standard NFS read/write access to HDFS
  • Enabled point in time recovery with Snapshots in HDFS
  • Wire Encryption for HDFS Data Transfer Protocol

Looking forward, there are evolving patterns in Data Center infrastructure and Analytical applications that are driving the evolution of HDFS.…

This post is the sixth in our series on the motivations, architecture and performance gains of Apache Tez for data processing in Hadoop. The series has the following posts:

Motivation

Tez follows the traditional Hadoop model of dividing a job into individual tasks, all of which are run as processes via YARN, on the users’ behalf – for isolation, among other reasons.…

Today, with overwhelming partner support, we announced GA of Hortonworks Data Platform 2.0 (HDP 2.0).  With 17 certified partners and many more in the works, organizations can confidently get started taking advantage of Hadoop 2.0 its YARN based architecture knowing that the technologies they rely on, run on HDP 2.0.

With a YARN-based architecture that serves as the operating system for Hadoop, HDP 2.0 takes Hadoop beyond single-use, batch processing to a fully functional,  multi-use platform that enables batch, interactive, online and stream processing.…

Typical delivery of enterprise software involves a very controlled date with a secret roadmap designed to wow prospects, customers, press and analysts…or at least that is the way it usually works.  Open source, however, changes this equation.

As described here, the vision for extending Hadoop beyond its batch-only roots in support of interactive and real-time workloads was set by Arun Murthy back in 2008. The initiation of YARN, the key technology for enabling this vision, started in earnest in 2011, was declared GA by the community in the recent Apache Hadoop 2.2 release, and is now delivered for mainstream enterprises and the broader commercial ecosystem with the release of Hortonworks Data Platform 2.0.…

The last couple of weeks have been a period of intense activity around the Apache projects that comprise the Hadoop ecosystem. While most of the headlines were accorded to Apache Hadoop 2 going GA, it would be remiss not to pay attention to the great progress being made in the Apache projects that complement Hadoop.

We have blogged about these over the course of the past week and the list below provides a quick summary of the phenomenal work contributed in the open by the folks driving these diverse and vital communities.…

Go to page:« First...10...1516171819...3040...Last »