Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
October 19, 2017 | Shelby Khan | Dataworks Summit

7 Sessions From DataWorks Summit Sydney You Should See

October 18, 2017 | Kevin Jordan | Hortonworks Case Study

How Much Can You Trust Your Big Data?

October 16, 2017 | Matt Spillar | Hortonworks Case Study

Leveraging Data to Make Decisions in Financial Services

Viewing posts by: Neeraj Sabharwal« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

In his blog, Tim Hall wrote, “Enterprises are embracing Apache Hadoop to enable their modern data architectures and power new analytic applications. The freedom to choose the on-premises or cloud environments for Hadoop that best meets the business needs is a critical requirement.” One of the choices in deploying Hadoop in the cloud environment is with Microsoft Azure using […]

Mayank Bansal, of EBay, is a guest contributing author of this collaborative blog. This is the 4th post in a series that explores the theme of enabling diverse workloads in YARN. See the introductory post to understand the context around all the new features for diverse workloads as part of Apache Hadoop YARN in HDP. Background  In Hadoop YARN’s […]

Introduction Multihoming is the practice of connecting a host to more than a single network. This is frequently used to provide network-level fault tolerance – if hosts are able to communicate on more than one network, the failure of one network will not render the hosts inaccessible. There are other use cases for multi-homing as […]

The Apache community released Apache Pig 0.15.0 last week. Although there are many new features in Apache Pig 0.15.0, we would like to highlight two major improvements: Pig on Tez enhancements Using Hive UDFs inside Pig Below are some details about these important features. For the complete list of features, improvements, and bug fixes, please […]

As businesses continue to create data at an ever-increasing pace, data architectures are strained under the loads placed upon them. Data volumes continue to grow considerably, low-value workloads like ETL consume more and more processing resources, and new types of data can’t easily be captured and put to use. Organizations struggle with escalating costs, increasing […]

Apache Hadoop has emerged as a critical data platform to deliver business insights hidden in big data. As a relatively new technology, system administrators hold Hadoop to higher security standards. There are several reasons for this scrutiny: External ecosystem that comprise of data repositories and operational systems that feed Hadoop deployments are highly dynamic and […]

Hadoop isn’t optional for today’s enterprises—that much is clear. But as companies race to get control over the significantly growing volumes of unstructured data in their organizations, they’ve been less certain about the right way to put Hadoop to work in their environment. We’ve already seen a variety of wrong approaches with proprietary extensions that […]

Over the past two quarters, Hortonworks has been able to attract over 200 new customers. We are attempting to feed the hunger our customers have shown for Hadoop over the past two years. We are seeing truly transformational business outcomes delivered through the use of Hadoop across all industries. The most prominent use cases are […]

Sumeet Kumar Agrawal, principal product manager for Big Data Edition product at Informatica, is our guest blogger. In this blog, explains how Informatica’s Big Data Edition integrates with Tez and allow for significant performance gains. Informatica Big Data Edition’s codeless visual development environment accelerates the ability of enterprises to take advantage of amazing innovations in big […]

Our guest blogger today is Sean Anderson, Manager of Data Service at Rackspace, the managed cloud company. Sean will share with us all the work Rackspace is doing with Hortonworks Data Platform (HDP) for an an Enterprise-ready Hadoop solution. Rackspace is excited to be joining the open source data platform community for Hadoop Summit 2015 hosted by […]

Last week, the Apache Slider community released Apache Slider 0.80.0. Although there are many new features in Slider 0.80.0, few innovations are particularly notable: Containerized application onboarding Seamless zero-downtime application upgrade Adding co-processors to app packages without reinstallation Simplified application onboarding without any packaging requirement Below are some details about these important features. For the […]

This is a guest blog post from Jerry Megaro, Merck’s Director of Innovation and Manufacturing Analytics. Jerry established the practice of Data Excellence and Data Sciences within the Merck Manufacturing Division and now leads initiatives to transform Merck Manufacturing into a data-driven organization that enhances the company’s performance across the supply chain. Hortonworks experience working […]

As we approach Hadoop Summit in San Jose next week, the debate continues over where Hadoop really is on its adoption curve. George Leopold from Datanami was one of the first to beat the hornet’s nest with his article entitled Gartner: Hadoop Adoption ‘Fairly Anemic’. Matt Asay from TechRepublic and Virginia Backaitis from CMSWire volleyed […]

Today I am excited to announce that we have made a significant expansion of our operations in Australia in response to growing demand for open enterprise Hadoop in Australia and around the APAC region. Focused on Sydney but with the ability to execute across Australia, this year we have hired several senior sales and technical […]

Not a day passes without someone tweeting or re-tweeting a blog on the virtues of Apache Spark. At a Memorial Day BBQ, an old friend proclaimed: “Spark is the new rub, just as Java was two decades ago. It’s a developers’ delight.” Spark as a distributed data processing and computing platform offers much of what […]