The Hortonworks Blog

Posts categorized by : Data Analyst & Scientist

Three weeks ago, we announced availability of the technical preview of Hortonworks Data Platform (HDP) version 2.1 and since then we have had thousands of downloads of this preview.  We also promised delivery of GA bits on April 22nd  and we are delighted to deliver as stated. HDP 2.1, which includes countless new features across seven new components, is available today from our download page

YARN unlocks the Data Lake

YARN, the resource management layer of Hadoop 2 is delivering value as it has unlocked the data lake vision for many.…

The Apache Hive community has voted on and released version 0.13 today. This is a significant release that represents a major effort from over 70 members who worked diligently to close out over 1080 JIRA tickets.

Hive 0.13 also delivers the third and final phase of the Stinger Initiative, a broad community based initiative to drive the future of Apache Hive, delivering 100x performance improvements at petabyte scale with familiar SQL semantics.…

One of the key concerns in the financial industry today is the alarming increase in fraudulent activities.  It is estimated that over $12 billion is spent on fraud detection and prevention and that number is projected to increase significantly over the next few years. Customer data gets compromised and this leads to a decreased level of customer satisfaction and retention, which results in revenue declines for financial organizations.

Join Hortonworks, Skytree and Forrester Research for a Webinar on April 15, 8am PST/11am EST

As financial institutions continue to embrace the adoption of big data infrastructures like the Hortonworks Data Platform based on Hadoop, there is a wealth of information collected that can help with more sophisticated fraud detection. …

We are excited to announce that the Apache™ Tez community voted to release version 0.4 of the software.

Apache Tez is an alternative to MapReduce that provides a powerful framework for executing a complex topology of tasks for data access in Hadoop. Version 0.4 incorporates the feedback from extensive testing of Tez 0.3, released just last month.

This release is especially meaningful because it coincides with completion of the Stinger Initiative (a collaborative community effort involving 145 developers across 44 companies) and the upcoming release of Apache Hive 0.13.…

Today we are proud to announce that the formation of a terrific partnership with LucidWorks to bring search to the Hortonworks Data Platform. LucidWorks delivers an enterprise-grade search development platform built atop the power of Apache Solr.

Shared Vision and New Scenarios

Both LucidWorks and Hortonworks have a shared vision of innovating in open source and delivering it to customers in an enterprise grade platform.

As part of our continuing mission to build the a completely open, versatile enterprise data platform across many data processing scenarios then Solr offers a simple, yet powerful interface providing advanced search capabilities.…

If you’re excited to get started with the new features in Hortonworks Data Platform 2.1, then we’ve included 4 tutorials for you try out – Sandbox-style.

You can download the HDP 2.1 Technical Preview here, and then get stuck into these great tutorials.

Interactive Query with Apache Hive and Apache Tez

OK, so you’re not going to get huge performance out of a one-node VM, but you can try out Hive on Tez, and see the performance gains versus MapReduce, and also try out features such as Vectorized Query, and the host of new SQL features.…

The pace of innovation within the Apache Hadoop community is truly remarkable, enabling us to announce the availability of Hortonworks Data Platform 2.1, incorporating the very latest innovations from the Hadoop community in an integrated, tested, and completely open enterprise data platform.

Download HDP 2.1 Technical Preview Now

What’s In Hortonworks Data Platform 2.1?

The advancements in HDP 2.1 span every aspect of Enterprise Hadoop: from data management, data access, integration & governance, security and operations. …

We are excited to welcome Blackrock and Passport Capital as Hortonworks investors who today led a $100M round of funding with continued participation from all existing investors.

This latest round of funding will allow us to double-down on our founding strategy: to make open source Apache Hadoop a true enterprise data platform. To that end we are focused in two areas:…

1. Lead the innovation of Hadoop. In open source, for everyone.

We are delighted to host this is a guest blog from John Schitka at SAP.

Join us on March 12 to learn how SAP HANA and Hortonworks Data Platform combine to help you achieve Instant Insight and Infinite Scale – Register Here 

Big Data is changing our world – enabling previously impossible insights and transforming the way we do business, work with others, and live our lives. To be competitive you need to lever Big Data and the business value it brings.…

Elasticsearch’s engine integrates with Hortonworks Data Platform 2.0 and YARN to provide real-time search and access to information in Hadoop.

See it in action:  register for the Hortonworks and Elasticsearch webinar on March 5th 2014 at 10 am PST/1pm EST to see the demo and an outline for best practices when integrating Elasticsearch and HDP 2.0 to extract maximum insights from your data.  Click here to register for this exciting and informative webinar!…

Today, the Forrester WaveTM: Big Data Hadoop Solutions, Q1 2014 was published by Forrester Research and while not surprised, we are delighted that this leading analyst firm recognized us as a clear leader in the Hadoop market. We could not be prouder of our unwavering strategy and hard work that is propelling us to the forefront of this burgeoning Hadoop market.

Download and review the report here.

Forrester evaluated nine vendors across a range of criterion from strategy to product and market presence and we scored a very balanced report across all categories and are way “up and to the right”.…

This is the fifth in our series on modern data architectures across industry verticals. Others in the series are:

Consumers have never generated so much data on how they research, discuss and buy products. This new data is valuable for shaping and promoting a brand or product, but it doesn’t line up neatly to fit in pre-defined, tabular formats.…

It gives me great pleasure to announce that the Apache Hadoop community has voted to release Apache Hadoop 2.3.0!

hadoop-2.3.0 is the first release for the year 2014, and brings a number of enhancements to the core platform, in particular to HDFS.

With this release, there are two significant enhancements to HDFS:

  • Support for Heterogeneous Storage Hierarchy in HDFS (HDFS-2832)
  • In-memory Cache for data resident in HDFS via Datanodes (HDFS-4949)

With support for heterogeneous storage classes in HDFS, we now can take advantage of different storage types on the same Hadoop clusters.…

Hadoop Summit Europe in Amsterdam is approaching fast. From Falcons to Pigs, we have a menagerie of meetups covering all things Hadoop – all with fantastic speakers. This year, we’re also delighted to expand the discussion with meetups from Splunk, SAS and Revolution Analytics.

You can sign up for any and all of the meetups below and remember these are open to everyone to attend.

Tuesday, April 1st

At the Krasnapolsky Hotel, from 5pm onwards:

  • Reception and Cocktails.

Actuate is a Hortonworks Technology Partner and founded and co-leads the BIRT open source project, which is used by more than 2.5 million developers around the globe and serves as the foundation of Actuate’s commercial offerings. Applications built with BIRT and BIRT iHub deliver more business and consumer insights to more people than all BI companies combined. 

The deployment of Big Data architectures has become more prevalent as organizations realize the power of what Big Data can bring to their businesses and to their profitability. …

Go to page:12345

Thank you for subscribing!