The Hortonworks Blog

Posts categorized by : Visitor Type

Three weeks ago, we announced availability of the technical preview of Hortonworks Data Platform (HDP) version 2.1 and since then we have had thousands of downloads of this preview.  We also promised delivery of GA bits on April 22nd  and we are delighted to deliver as stated. HDP 2.1, which includes countless new features across seven new components, is available today from our download page

YARN unlocks the Data Lake

YARN, the resource management layer of Hadoop 2 is delivering value as it has unlocked the data lake vision for many.…

The Apache Hive community has voted on and released version 0.13 today. This is a significant release that represents a major effort from over 70 members who worked diligently to close out over 1080 JIRA tickets.

Hive 0.13 also delivers the third and final phase of the Stinger Initiative, a broad community based initiative to drive the future of Apache Hive, delivering 100x performance improvements at petabyte scale with familiar SQL semantics.…

The power of a well-crafted speech is indisputable, for words matter—they inspire to act. And so is the power of a well-designed Software Development Kit (SDK), for high-level abstractions and logical constructs in a programming language matter—they simplify to write code.

In 2007, when Chris Wensel, the author of Cascading Java API, was evaluating Hadoop, he had a couple of prescient insights. First, he observed that finding Java developers to write Enterprise Big Data applications in MapReduce will be difficult and convincing developers to write directly to the MapReduce API was a potential blocker.…

As enterprises build new applications with the data they cost effectively capture and process with Apache Hadoop it is important for the platform to facilitate the app dev processes. That’s why we are excited to announce that we’ve expanded our partnership with Concurrent, Inc. to simplify and accelerate application development on Hadoop.

There are two components to this expanded partnership.

The Internet of Things (IoT) is in its infancy. You can buy wireless bathroom scales to upload data to monitoring tools helping you manage your weight. You can buy a connected refrigerator that keeps track of the inventory to remind you what you need to buy. It’s fascinating to think about the future of possibilities. In a recent podcast on the SAP Future of Business with Game-Changers Radio, panelist Matt Healey (Analyst at Technology Business Research) commented that he wasn’t ready for the day when his scale and refrigerator talked.…

LOOK Innovative is a new consulting partner of Hortonworks specializing in business applications of Hadoop for retail vertical market.

LOOK Innovative concentrates on delivering the complete Omni-Channel digital experience to retailers, which is the evolution of multi-channel retailing. Omni-Channel is a seamless approach for the consumer through all available shopping channels, including mobile internet devices, computers, bricks-and-mortar, television, radio, direct mail, catalog and so on. It means that consumers make buying decisions based on information from many sources and may purchase through any of those sources – they might research online but buy at the local store and may research at the store but buy online.…

One of the key concerns in the financial industry today is the alarming increase in fraudulent activities.  It is estimated that over $12 billion is spent on fraud detection and prevention and that number is projected to increase significantly over the next few years. Customer data gets compromised and this leads to a decreased level of customer satisfaction and retention, which results in revenue declines for financial organizations.

Join Hortonworks, Skytree and Forrester Research for a Webinar on April 15, 8am PST/11am EST

As financial institutions continue to embrace the adoption of big data infrastructures like the Hortonworks Data Platform based on Hadoop, there is a wealth of information collected that can help with more sophisticated fraud detection. …

We are excited to announce that the Apache™ Tez community voted to release version 0.4 of the software.

Apache Tez is an alternative to MapReduce that provides a powerful framework for executing a complex topology of tasks for data access in Hadoop. Version 0.4 incorporates the feedback from extensive testing of Tez 0.3, released just last month.

This release is especially meaningful because it coincides with completion of the Stinger Initiative (a collaborative community effort involving 145 developers across 44 companies) and the upcoming release of Apache Hive 0.13.…

Securing any system requires you to implement layers of protection.  Access Control Lists (ACLs) are typically applied to data to restrict access to data to approved entities. Application of ACLs at every layer of access for data is critical to secure a system. The layers for hadoop are depicted in this diagram and in this post we will cover the lowest level of access… ACLs for HDFS.

This is part of the HDFS Developer Trail series.  …

Yesterday our partner Teradata announced a new capability called Teradata QueryGrid that further deepens the integration between the Teradata Data Warehouse and the Hortonworks Data Platform. This announcement is important because it delivers on the promise and the value of the Modern Data Architecture by demonstrating how the two technologies complement each other for the enterprise.

Teradata pioneered deeper integration with Apache Hadoop through integration with H-Catalog initially with Aster SQL-H and then the Data Warehouse and now they have taken it to the next level with Teradata QueryGrid.…

Today we are proud to announce that the formation of a terrific partnership with LucidWorks to bring search to the Hortonworks Data Platform. LucidWorks delivers an enterprise-grade search development platform built atop the power of Apache Solr.

Shared Vision and New Scenarios

Both LucidWorks and Hortonworks have a shared vision of innovating in open source and delivering it to customers in an enterprise grade platform.

As part of our continuing mission to build the a completely open, versatile enterprise data platform across many data processing scenarios then Solr offers a simple, yet powerful interface providing advanced search capabilities.…

If you’re excited to get started with the new features in Hortonworks Data Platform 2.1, then we’ve included 4 tutorials for you try out – Sandbox-style.

You can download the HDP 2.1 Technical Preview here, and then get stuck into these great tutorials.

Interactive Query with Apache Hive and Apache Tez

OK, so you’re not going to get huge performance out of a one-node VM, but you can try out Hive on Tez, and see the performance gains versus MapReduce, and also try out features such as Vectorized Query, and the host of new SQL features.…

The pace of innovation within the Apache Hadoop community is truly remarkable, enabling us to announce the availability of Hortonworks Data Platform 2.1, incorporating the very latest innovations from the Hadoop community in an integrated, tested, and completely open enterprise data platform.

Download HDP 2.1 Technical Preview Now

What’s In Hortonworks Data Platform 2.1?

The advancements in HDP 2.1 span every aspect of Enterprise Hadoop: from data management, data access, integration & governance, security and operations. …

There is no doubt that enterprises recognize how Big Data is crucial to monetizing their business.  The information contained in the volumes of data collected can offer key insights into product, customer and competitive trends.  There are a variety of sophisticated tools for Big Data analytics and processing but most big data implementations are based on rudimentary technologies like FTP based scripts for data collection and distribution.

Although FTP is a widely used protocol, there is an inherent lack of reliability in this approach.  …

Apache Falcon is a data governance engine that defines, schedules, and monitors data management policies. Falcon allows Hadoop administrators to centrally define their data pipelines, and then Falcon uses those definitions to auto-generate workflows in Apache Oozie.

InMobi is one of the largest Hadoop users in the world, and their team began the project 2 years ago. At the time, InMobi was processing billions of ad-server events in Hadoop every day.…

Go to page:12345...10...Last »

Thank you for subscribing!