The Hortonworks Blog

Actuate is a Hortonworks Technology Partner and founded and co-leads the BIRT open source project, which is used by more than 2.5 million developers around the globe and serves as the foundation of Actuate’s commercial offerings. Applications built with BIRT and BIRT iHub deliver more business and consumer insights to more people than all BI companies combined. 

The deployment of Big Data architectures has become more prevalent as organizations realize the power of what Big Data can bring to their businesses and to their profitability. …

Hadoop can be a great complement to existing data warehouse platforms, such as Teradata, as it naturally helps to address two key storage challenges:

The purpose of this article is to detail some of the key integration points and to show how data can be easily exchanged for enrichment between the two platforms.

As a data integrator who is familiar with RDBMS systems and is new to the Hadoop platform, I was looking for a simple way (i.e.…

Ever since I was a kid, I’ve used memorable movie quotes to help people understand a key point in a way that lightens the mood and generates some laughs. If you’re going to work hard, you gotta have fun, right???

“Don’t make me angry… you wouldn’t like me when I’m angry”

The big data market is rife with aspirational marketing misinformation, which among other things causes customer confusion, slows the path to value, and frankly, makes me a little angry.…

With the growing number of large-scale enterprise deployments of big data, certain limitations have become more apparent bringing to light some weaknesses in this first phase of analytics infrastructures.  Hadoop, clearly a very valuable tool for the collection of unstructured data, poses some challenges that need to be overcome for wide spread successful enterprise adoption.

In our upcoming webinar on Tuesday Feb 19 at 10 am PT, we will address these issues and highlight how to solve them using Hortonworks Data Platform and our partner Actian.…

The Call For Abstracts for Hadoop Summit San Jose closes this Friday (2/14).  So far, we have received hundreds of amazing submissions from a wide range of contributors but we want more.

This year is shaping up to be our biggest and best show and we are working hard to make sure the content is awesome.  In order to make sure content is king at this show, we have assembled some of the brightest in the Hadoop industry to serve as Track Chairs and they are assembling some rock star committees as well. …

Earlier this week Microsoft announced via their blog that a new version of Windows Azure HDInsight is available in public preview.

Microsoft recognizes the importance of the technical innovation in and around YARN as well as Hortonworks leadership in this area and we have worked collaboratively to bring Hadoop 2.2 to Azure via our Hortonworks Data Platform 2.0 for Windows release.

Apache Hadoop YARN is the data operating system for Hadoop and greatly expands the applications possible of this emerging technology by allowing multiple processing frameworks such as streaming or graph processing to plug in natively.…

This is the fourth in our series on modern data architectures across industry verticals. Others in the series are:

We’ve probably all heard the famous quote by John Wanamaker, the father of modern advertising: “Half the money I spend on advertising is wasted; the trouble is, I don’t know which half.”

Wanamaker would love Apache Hadoop for retail applications, because it diminishes (or eliminates) the dilemma he described.…

We cannot wait to see you at the Santa Clara Convention for the next few days! Hortonworks will be one of the sponsors at the conference and will be presenting in various sessions. If you’re going to be around, attend one (or all) of our sessions and remember to stop by Booth #811. We have a nice schedule lined up for you and we hope you can join us!

Attend our sessions

This year’s Strata Santa Clara, Hortonworks will also participate in a number of presentations on all things about data – don’t miss any of them!…

We are excited to announce an expansion of our relationship with open source leader Red Hat to a deeper more strategic alliance. The main goal is to help organizations adopt enterprise Apache Hadoop more quickly. This is a natural progression of our relationship with Red Hat because we are so closely aligned around a strategy of innovating in the open and applying enterprise rigor to open source software thereby de-risking it for the enterprise.…

In this post, we will explore how to quickly and easily spin up our own VM with Vagrant and Apache Ambari. Vagrant is very popular with developers as it lets one mirror the production environment in a VM while staying with all the IDEs and tools in the comfort of the host OS.

If you’re just looking to get started with Hadoop in a VM, then you can simply download the Hortonworks Sandbox.…

This article originally appeared at Opensource.com and is reproduced here.

There are rapidly growing feature set, high commit rates, and code contributions happening across the globe to Apache Hadoop and related Apache Software Foundation projects. However, the number of woman developerscommitters, and Project Management Committee (PMC) members in this vast and diversified ecosystem are really diminutive. For the Hadoop project alone, only 5% out of 84 committers are women; and this has been the case for over the past 2 years.…

This guest post from Steve Ratay, Viewpoint Architect, Teradata Corporation

Teradata’s Unified Data Architecture is a powerful combination of the Teradata Enterprise Data Warehouse, the Aster Discovery Platform, Apache Hadoop (via the Hortonworks Data Platform) and Teradata Enterprise Management tools in a single architecture. 

If you are Teradata user managing an Enterprise Data Warehouse or Data Discovery platform, chances are that you are using Teradata Viewpoint, a monitoring and management platform for Teradata Systems. …

I recently sat down with Mahadev Konar and Jeff Sposetti to discuss Apache Ambari v1.4.1. Ambari 1.4.1 is a single framework to provision, manage and monitor clusters based on the Hadoop 2 stack, with YARN and NameNode HA on HDFS.

Mahadev is one of the original architects of Apache Hadoop, a co-founder of Hortonworks, and a committer on Apache Ambari and Apache ZooKeeper. Jeff is the Hortonworks product manager focused on Apache Ambari and Apache Falcon.…

In this post, we’ll walk through the process of deploying an Apache Hadoop 2 cluster on the EC2 cloud service offered by Amazon Web Services (AWS), using Hortonworks Data Platform.

Both EC2 and HDP offer many knobs and buttons to cater to your specific, performance, security, cost, data size, data protection and other requirements. I will not discuss most of these options in this blog as the goal is to walk through one particular path of deployment to get started.…

Xplenty is a Hortonworks Technology Partner offering Hadoop as a service. We invited Yaniv Mor, Co-founder and CEO of Xplenty to be our guest blogger today sharing his views on HDP and Hadoop. 

We founded Xplenty to make Apache Hadoop easier. A lot easier. We believe Hadoop’s big data revolution should be available for companies of all sizes and intuitive for everyone to use. Whether it’s designing data­flows, setting up clusters, or managing and monitoring them, our platform as a service makes it happen, code­free.…

Go to page:« First...1011121314...203040...Last »