The Hortonworks Blog

Posts categorized by : Hadoop in the Enterprise

Today our strategic partner Microsoft announced the General Availability of HDInsight Service: the Azure based enterprise Hadoop cloud service is now ready for production workloads! We are excited to see this service move into general availability as a key milestone in a longstanding engineering partnership between our two organizations.

Application Portability: Deploy Hadoop On-Premise and/or Scale with the Cloud

With HDP for Windows and HDInsight Service there is unprecedented choice for Windows enterprises for their Hadoop deployments.…

Whew, what a couple of week it had been! We released Hortonworks Data Platform 2.0 to much excitement from our customers, partners and the community at large. With HDP 2.0 and its YARN-based architecture delivered, enterprises can finally realize the vision of gathering all data in Hadoop and interacting with that data in ways that was previously not possible!

We are carrying forward that ebullience through to O’Reilly Strata Conference in New York!…

What a difference a year makes! Last Fall Ambari was a nascent Apache project that had recently shipped an inaugural release in the community. Fast forward a bit, at the beginning of this year Ambari shipped what has become the foundation for rapid innovation. Now Ambari has become a key member of the Apache Hadoop project ecosystem and a trusted operational platform for many companies.

Let’s take a brief look at the community’s amazing accomplishments over the past year, and then take some time to look forward.…

The Hadoop Distributed File System is the reliable and scalable data core of the Hortonworks Data Platform. In HDP 2.0, YARN + HDFS combine to form the distributed operating system for your Data Platform, providing resource management and scalable data storage to the next generation of analytical applications.

Over the past six months, HDFS has introduced a slew of major features to HDFS covering Enterprise Multi-tenancy, Business Continuity Processing and Enterprise Integration:

  • Enabled automated failover with a hot standby and full stack resiliency for the NameNode master service
  • Added enterprise standard NFS read/write access to HDFS
  • Enabled point in time recovery with Snapshots in HDFS
  • Wire Encryption for HDFS Data Transfer Protocol

Looking forward, there are evolving patterns in Data Center infrastructure and Analytical applications that are driving the evolution of HDFS.…

Today, with overwhelming partner support, we announced GA of Hortonworks Data Platform 2.0 (HDP 2.0).  With 17 certified partners and many more in the works, organizations can confidently get started taking advantage of Hadoop 2.0 its YARN based architecture knowing that the technologies they rely on, run on HDP 2.0.

With a YARN-based architecture that serves as the operating system for Hadoop, HDP 2.0 takes Hadoop beyond single-use, batch processing to a fully functional,  multi-use platform that enables batch, interactive, online and stream processing.…

Today we are proud to announce the delivery of Apache Ambari 1.4.1. Ambari 1.4.1 combines many months of work in the community advancing the Ambari codebase. Over 760 JIRAs have been resolved since the Ambari 1.2.5 release. We would like to thank the nearly 40 engineers who contributed to help make this release possible.

Hello Hadoop 2, Meet Apache Ambari
The most important addition to Ambari 1.4.1 is support for installing, managing and monitoring a cluster based on the Hadoop 2 stack.…

This post’s Principal Author: Ming Ma, Software Development Manager, eBay.  With contribution from Mayank Bansal (eBay), Devaraj Das (Hortonworks), Nicolas Liochon (Scaled Risk), Michael Weng (eBay), Ted Yu (Hortonworks), John Zhao (eBay)

eBay runs Apache Hadoop at extreme scale, with tens of petabytes of data. Hadoop was created for computing challenges like ours, and eBay runs some of the largest Hadoop clusters in existence.

Our business uses Apache HBase to deliver value to our customers in real-time and we are sensitive to any failures because prolonged recovery times significantly degrade site performance and result in material loss of revenue. …

Today, we are pleased to announce our strategic alliance between Hortonworks and SAS. Through this alliance we are committing to expand the integration between the SAS business analytics and data management capabilities and the Hortonworks Data Platform (HDP).

By better integrating SAS Business Analytics and HDP, SAS users can easily incorporate Hadoop as a component of their data architecture to capture, process and analyze data of any type and scale. This allows businesses to leverage powerful SAS analytic and data management capabilities across massive data sets, including new data sources that previously could not be captured and analyzed.…

Designed for senior IT executives, IT architects, technology planners, and business technologists, Knowledgent’s three-day facilitated Big Data Immersion workshop recently held in New York City, provided participants with an intensive deep dive answering the big data questions:

  • Why Big Data? What are the issues that brought it all about?
  • Demystifying Big Data: How can Hadoop help with big data issues?
  • Implementation: How do I operationalize big data? How is big data analytics different?

On October 16, we’ve been invited to join our partner SAP to talk Big Data and how the integrated SAP HANA + Hadoop approach can solve your big data challenges. This chat will be a live Google Hangout with:

  • Irfan Khan, SVP & GM SAP Global Big Data at SAP (@i_kHANA)
  • Ari Zilka,  CTO at Hortonworks (@ikarzali)
  • Timo Elliot, Innovation Evangelist at SAP (@timoelliott)

When: Wednesday, October 16, 8am PT / 11am ET / 5pm CET…

This is a guest blog post from our partner, Actuate. They’ve been generous enough to create some great Hadoop tutorials on the Open Source BIRT project that use the Hortonworks Sandbox.

By now, Apache™ Hadoop® has become synonymous with the first stage of Big Data: storing, processing and managing huge volumes and varieties of structured and unstructured data. Yet the data stored by Hadoop remains unreadable to the average business user.…

Thanks to all those who joined in person and virtually for the Apache Ambari Meetup at Hortonworks this week. We talked tech, we saw demos, we laughed, we cried, we ate pizza.

The central theme of the night was the newly added support for Hadoop 2. Ambari now has:

  • Hadoop 2 Stack: Ambari adds support for installing, managing and monitoring a Hadoop 2 Stack.
  • NameNode HA: Configure NameNode High Availability based on QJM support built-into HDFS2
  • YARN: Ambari manages YARN Service lifecycle and automatically deploys the MapReduce2 framework.

A lot of people ask me: how do I become a data scientist? I think the short answer is: as with any technical role, it isn’t necessarily easy or quick, but if you’re smart, committed and willing to invest in learning and experimentation, then of course you can do it.

In a previous post, I described my view on “What is a data scientist?”: it’s a hybrid role that combines the “applied scientist” with the “data engineer”. …

‘The world is being digitized’ proclaimed Geoffrey Moore in his keynote at Hadoop Summit 2012 over a year ago. His belief is that we are moving away from an analog society where we collect only casual recording of events to one that is digital, where everything is captured. It is our belief that Hadoop is one of the key technologies powering this shift to a digital society.

There is almost an expectation that we capture the pics, vids and conversations that run before us. …

We’ve been hosting a series of webinars focusing on how to make Apache Hadoop a viable enterprise platform that powers modern data architectures.

Implementing modern data architecture with Hadoop means that it must deeply integrate with existing technologies, leverage existing skills and investments and provide key services. This guest post from David Smith, Vice President of Marketing and Community at Revolution Analytics, shares his perspective on the role of a Data Scientists in a Big Data world.…

Go to page:« First...56789...Last »