The Hortonworks Blog

Hortonworks is committed to collaborate with ISVs and partners to onboard their applications to YARN and Hadoop. As part of the YARN Webinar Series, we have introduced different methods to help you integrate your applications to YARN: Native YARN integration, Slider and Tez. As part of this series, we now offer the opportunity to learn Scalding, with guest speaker from Twitter, who will talk about simplifying application development on Apache Hadoop and YARN.…

Novetta is a new Hortonworks Technology Partner and recently achieved HDP 2.1 Certification and YARN Ready status. In this guest blog, Jennifer Reed, director of product management at Novetta, talks about Novetta’s YARN Ready entity resolution and relationship dimension-building application.

The New Era of Analytics

Thomas Davenport, in his keynote at the Hadoop Summit San Jose 2014, said that the big data analytics has entered a new phase: From Analytics 2.0 to 3.0.…

Modern retailers collect data from a multitude of consumer engagement channels, including point of sale systems, the web, mobile applications, social media, and more. They hope to use this data to derive greater customer insights, promote increased brand engagement and loyalty, optimize pricing and promotions, streamline the supply chain, and enhance their business models.

Data from the retailer’s transactional systems has historically been stored in an enterprise data warehouse (EDW) or other database, but these traditional data repositories are not well suited for the newer, unstructured data types like log files, social media updates and information from in-store sensors.…

StackIQ, a Hortonworks technology partner, offers a comprehensive software suite that automates the deployment, provisioning, and management of Big Infrastructure. In his second guest blog, Anoop Rajendra (@anoop_r), a Senior Software Developer at StackIQ, gives instructions for using StackIQ Comand Line Interface (CLI) to deploy a Hortonworks Data Platform (HDP) cluster.

In a previous blog post, we discussed how StackIQ’s Cluster Manager automates the installation and configuration of an Apache Ambari server.…

Thanks to all who joined us on our Hortonworks/Voltage webinar, “Securing Hadoop: What are Your Options?” For those who couldn’t attend, we’re sorry we missed you. We’ve included a link to the webinar recording below, and please listen in!

On the webinar, Hortonworks’ Vinod Nair presented the recently-announced Apache Argus incubator: a central policy administration framework across security requirements for authentication, authorization, auditing and data protection. Sudeep Venkatesh, of Voltage Security, defined data-centric protection technologies that easily integrate with Hive, Sqoop, MapReduce and other Hadoop interfaces.…

Hortonworks and Informatica have teamed up to provide the data systems and tools making up the foundation of the modern data architecture. Today, Scott Hedrick, Director of Big Data Partnerships at Informatica, tells us more about the brand new Informatica Big Data Edition Trial Sandbox for Hortonworks

With the help of our friends at Hortonworks, the Informatica Big Data team has preinstalled a 60-day trial version of the Informatica Big Data Edition into the Hortonworks Sandbox.…

Speed, Scale, and SQL Semantics

Since its inception and graduation as a Top Level Project (TPL) from Apache Foundation Project (ASF) in September 2010, Apache Hive has been steadily improving—in speed, scale, and SQL semantics—to meet enterprise requirements for both interactive and batch queries at Hadoop scale.

It has become a defacto standard for SQL queries over petabytes of data stored in Hadoop. It is a compliant SQL engine that offers familiarity to developers over a comprehensive and familiar set of SQL semantics for Apache Hadoop.…

In this partner guest blog, Microsoft Principal Software Development Engineer Eric Hanson weighs in how Stinger.next will benefit HDInsight customers. Coming from someone who worked on Microsoft SQL Server for years and is a committer to Apache Hive, Eric explains that Stinger.next initiatives and capabilities are essential to take Hive to the next level.

Apache Hive is one of the most-used features of Microsoft’s cloud Hadoop service, Azure HDInsight. So our HDInsight customers of course will enjoy new capabilities that make Hive faster.…

What drives successful implementations of big data analytics projects? Hortonworks’ Director of Data Science,

Ofer Mendelevitch, Director of Data Science, Hortonworks

Ofer Mendelevitch, teams up with Zementis’ Founder and CEO Michael Zeller to discuss their learnings from working with dozens of companies from small cloud-based start-ups to some of the largest companies in the world.

Register here for the webinar on September 10 at 10am Pacific Time.

Hortonworks will present their approach to using Apache Hadoop for predictive models with big data, and the benefits of Hadoop to data scientists.…

Continuing our ecosystem momentum for the next generation of SQL in Hadoop, here to share his insights with us on the potential that Stinger.next holds for both the individual data worker as well as the data driven company alike is Dustin Smith, Product Marketing Manager at Tableau Software.

The work delivered over the last year as part of Stinger has made a tremendous impact for our customers who are using Tableau to analyze Hadoop data, and we are excited to see this momentum continue under the leadership of Hortonworks within the Apache Hive community. …

In case you missed it — earlier this week, Alan Gates and team provided some insights into Stinger.next roadmap around the delivery of Enterprise SQL and Hadoop Scale. We’re excited to continue the conversation and include some of our key partners around their excitement on this important initiative. Today’s guest blogger, Michael Hiskey, Chief Product Evangelist & Product Marketing, from MicroStrategy, provides some insight on the Stinger.next initiatives and how this will benefit MicroStrategy customers and the overall Big Data and Hadoop community.…

In this partner guest blog, John Haddad, senior director of product marketing at Infomatica, explains and enumerates how Stinger.next’s key innovations in Enterprise SQL at Hadoop scale will augment Informatica’s Big Data Edition integration with Hortonworks’ Modern Data Architecture.

Informatica is excited about the new innovations Hortonworks is including in the Stinger.next project such as Hive transactions, Hive-Spark integration, and sub-second queries. The Informatica Big Data Edition helps our customers take advantage of these new innovations without having to rebuild their data pipelines for Big Data analytics.…

Big data growth continues to be a major consumer of enterprise IT resources with no end in sight. To gain value out of this data, organizations are creating new analytic applications for their business users. This growth in data and applications impacts hardware, networking and software resource consumption in the data center. CIOs are looking for ways to future proof their data management infrastructure and to be more efficient about how they monitor and manage their Apache Hadoop clusters.…

Apache Ambari is an open operational framework to provision, manage and monitor Hadoop clusters. As Hadoop has grown from a single purpose (MapReduce) framework to an extensible multi-purpose compute platform, with Apache Hadoop YARN as its architectural center, Apache Ambari has marched hand-in-hand to meet the evolving operational needs of Enterprise Hadoop.

Enabling ecosystem integration has been a key thrust of recent innovations within the Apache Ambari community. Key developments including Stack Extensibility and Ambari Views allow Ambari to deploy and manage YARN enabled applications.…

In April of this year, Hortonworks, along with the broad Hadoop community delivered the final phase of the Stinger Initiative on schedule, completing the work to bring interactive SQL query to Apache Hive.  The original directive of Stinger was about advancing SQL capabilities at petabyte scale in pure open source. And over 13 months, 145 developers from 44 companies delivered exactly that, contributing over 390,000 lines of code to the Hive project alone.…

Go to page:12345...102030...Last »