Hortonworks Announces Apache Spark is ‘YARN Ready’

Apache Spark is Latest Technology to Achieve Certification on YARN—the Architectural Center of Hadoop

PALO ALTO, Calif.—June 26, 2014—Hortonworks, the leading contributor to and provider of enterprise Apache™ Hadoop®, today announced that Apache Spark is YARN Ready, whereby certifying that it is fully compatible with the Hortonworks Data Platform—the only complete and 100-percent open source Enterprise Hadoop platform.

Apache Spark provides a unique and powerful framework that enables Enterprise Hadoop users to build and execute iterative algorithms for advanced analytics such as clustering and classification of datasets. The Hortonworks YARN Ready Program is the latest addition to the Hortonworks Partner Certification Program. Certifying Apache Spark as YARN Ready helps enterprises run memory and CPU-intensive Spark applications alongside other workloads on a single Hadoop cluster, avoiding the need to deploy Spark applications in separate siloed clusters.

“Hortonworks believes that innovation at the core of enterprise Apache Hadoop is key to delivering a fast, scalable and manageable data platform as the foundation of a modern data architecture,” said Shaun Connolly, vice president of strategy at Hortonworks. “Enterprises now view YARN as the architectural center of Hadoop, upon which promising technologies such as Apache Spark can be supported and centrally managed.”

Concurrent with this news, Hortonworks is an inaugural member of the Databricks “Certified Spark Distribution” program. The combination of both programs provides enterprises and the broader Hadoop ecosystem with the assurance that their tools and applications are fully compatible with Apache Spark, Apache Hadoop YARN, and the Hortonworks Data Platform.

“As the company founded by the creators of Spark, we’re committed to ensuring all Spark users have a terrific experience – and we’re thrilled that Hortonworks shares this vision as part of ‘Certified Spark Distribution’ program,” said Arsalan Tavakoli-Shiraji, Business Development, Databricks. “Additionally, with the designation of Apache Spark as YARN Ready, enterprises can rest assured that Spark can run simultaneously and effectively with other mission-critical applications.”

Hortonworks will be attending the Spark Summit next week, June 30-July 2 at the Westin St. Francis on Union Square in San Francisco. Together with the Hadoop community, Hortonworks will participate in discussions regarding the various applications of Spark and how the project can become even easier for enterprises to adopt.

For more information on Spark integrating with Hortonworks Data Platform, please read the blog post announcing “HDP 2.1 Tech Preview Component: Apache Spark” as well as the blog post announcing Apache Spark as YARN Ready.

Further reading

Apache Spark Launch Blog:                         http://hortonworks.com/blog/announcing-hdp-2-1-tech-preview-component-apache-spark/

Enterprise Hadoop:                                         www.hortonworks.com/hadoop

Hadoop and a Modern Data Architecture:     www.hortonworks.com/mda

YARN Ready:                                                            http://hortonworks.com/partners/yarn-ready/

Become a Hortonworks Partner:                     http://hortonworks.com/partners/become-a-partner/

About Hortonworks

Hortonworks develops, distributes and supports the only 100% open source Apache Hadoop data platform. Our team comprises the largest contingent of builders and architects within the Hadoop ecosystem who represent and lead the broader enterprise requirements within these communities.

The Hortonworks Data Platform provides an open platform that deeply integrates with existing IT investments and upon which enterprises can build and deploy Hadoop-based applications.

Hortonworks has deep relationships with the key strategic data center partners that enable our customers to unlock the broadest opportunities from Hadoop.