Bit Refinery is a Hortonworks Technical Partner and recently certified with HDP. Bit Refinery is a VMware© Cloud Infrastructure-as-a-Service (IaaS) provider featuring virtualization technology hosted within their fully redundant virtual data centers. Bit Refinery offers a hosted Hortonworks Sandbox providing an easy way to experience and learn Hadoop with ease. All the tutorials available from the Hortonworks Sandbox work just as if you were running a localized version of the Sandbox.…
The Hortonworks Blog
- Business Values of Hadoop
- Why Hortonworks
- Industry Verticals
- Industry Happenings
- Deployment Options
- Types of Data
This is the third post in a series that explores the theme of enabling diverse workloads in YARN. Our introductory post to understand the context around all the new features for diverse workloads as part of YARN in HDP 2.2, and a related post on CPU scheduling.Introduction
One of the core responsibilities of YARN is monitoring and limiting resource usage of application containers. When it comes to resource management there are two parts:
TU-Automotive Detroit (formerly Telematics Detroit) is the premier industry show focused on connected car and telematics and Hortonworks is proud to be a Platinum Sponsor of the conference. We hope you can visit us at the show, to learn more about Hadoop for the connected car and infotainment in the vehicle.
Hortonworks counts some of the world’s premier automakers among its subscribers, and at TU-Automotive Detroit, on Wednesday June 3, Hortonworks President Herb Cunitz will deliver a keynote presentation Leveraging Telematics Data in a Connected World that will discuss some common automotive use cases.…
Kristen Hardwick, Vice President of Big Data Solutions at Spry, Inc is our guest blogger. In this blog, Kristen shares performance analysis during Spryinc’s evaluation of Apache Hive with Tez as a fast query engine.
In early 2014, Spry developed a solution that heavily utilized Hive for data transformations. When the project was complete, three distinct data sources were integrated through a series of HiveQL queries using Hive 0.11 on HDP 2.0.…
With YARN and HDFS at the architectural center, Hadoop has emerged as a key component of any modern data architecture. Today, enterprises utilize Hadoop to store critical datasets and power many of their critical workloads. With this in mind, the services and data within a Hadoop cluster needed to be highly available in face of failures and continue to function while the upgrading to the latest software version.
With the Hortonworks Data Platform (HDP) 2.2, we have enhanced the core platform packaging to put in place support for rolling upgrades of the HDP stack while the cluster is actively servicing users.…
This is the fourth post in a series that explores the theme of enabling diverse workloads in YARN. See the introductory post to understand the context around all the new features for diverse workloads as part of YARN in HDP 2.2.Introduction
When it comes to managing resources in YARN, there are two aspects that we, the YARN platform developers, are primarily concerned with:
From its beginning in Hadoop 1, all the way to Hadoop 2 today, the compute platform has always supported memory based allocation and isolation.…
All segments of the oil and gas industry are adopting Hadoop, from exploration through to drilling, production, transportation, refining, and retail.
The Hortonworks Oil and Gas team will be demonstrating some of the Hadoop-based advanced analytics applications for the upstream oil and gas industry at PNEC Houston (the International Conference on Petroleum Data Integration, Information, and Data Management) running from May 19-21.A Transformation in O&G
On a daily basis, the geological and geophysical discipline in upstream oil and gas must deal with a significant number of disparate datasets.…
In this guest blog, Sumeet Kumar Agrawal, principal product manager for Big Data Edition product at Informatica, explains how Informatica’s Big Data Edition integrates with Hortonworks’ security projects, and how you can secure your big data projects.
Many companies already use big data technology like Hadoop for their production environments, so they can store and analyze petabytes of data including transactional data, weblog data, and social media content to gain better insights about their customers and business.…
Historically, the strength of a platform lies in the abilities of developers to learn, try, and build against the platform APIs and capabilities. As Apache Hadoop matures as a platform, it’s the creativity and efforts of the developer community that is driving the innovation that makes Hadoop a vibrant and impactful foundation of a modern data architecture.
A successful developer community leads to a successful platform, and at Hortonworks we are committed to reducing the friction to speed up the success of our customers.…
With Apache Hadoop YARN as its architectural center, Apache Hadoop continues to attract new engines to run within the data platform, as organizations want to efficiently store their data in a single repository and interact with it in different ways. As YARN propels Hadoop’s emergence as a business-critical data platform, the enterprise requires more stringent data security capabilities. The Apache Knox Gateway (“Knox”) provides HTTP based access to resources of the Hadoop ecosystem so that enterprises can confidently extend Hadoop access to more users, while maintaining compliance with enterprise security policies.…
Since the launch of Hortonworks Data Platform (HDP) three years ago, we have seen first hand how Enterprises are embracing Apache Hadoop to enable their modern data architecture’s and power new analytics applications. Hadoop is helping organizations transform their business by providing them with a pervasive, enterprise ready data platform to meet their big data challenges.
Apache Hadoop’s ability to process any data (i.e., clickstream, web and social, IoT, etc.) allows an Enterprise to derive insights in ways that were previously either technologically or economically not possible. …
It’s been a busy few weeks here at Hortonworks and much of that busyness comes from all of the things we’ve been doing with our partners. This has been a stretch of time that we’ve affectionately been calling May-magedon with 9 major partner related events in a two and a half week span. We love telling the story of of the transformative nature of Apache Hadoop along with the increasing pervasiveness of enterprise Hadoop driven through a vibrant ecosystem.…
Next week, in Las Vegas, thousands of attendees will join Informatica World to explore just how far data can take them. Many companies already rely on massive volumes of internal and external data to create new insights and build innovative and profitable business models. Where are you on your journey?
To learn more about how Hortonworks and Informatica partner to optimize the entire big data supply chain on Hadoop and can help you turn data into actionable information to drive business value, join the following sessions:
- On Tuesday, May 12, during the Big Data Ready Summit, John Kreisa, VP Strategic Marketing at Hortonworks, will be part of the Succeeding with Big Data and Avoiding the Pitfalls panel.
Two weeks ago, Apache ORC became an Apache top-level project within the Apache Software Foundation (ASF). This step represents a major step forward for the project, and it is representative of its momentum been built by a broad community of developers.What is ORC and why is it useful?
The connected and collected vehicle data, emitted through embedded smart sensors, are transforming the automotive industry. Is this hype or reality?
To discuss the reality of this transformation, to tackle management of streams of data from connected cars, and to share new data architectures that process, manage and analyze volumes of data, automakers and key industry innovators will gather in Berlin for Telematics Berlin 2015 on May 11-12th.Data Deluge
Because legacy architectures have limited capacity to store streams of unstructured and varied data at petabyte scale, lack the ability to analyze data in real-time and offer value and insights, automakers are looking to next generation data platforms.…