The Hortonworks Blog

More from Lisa Sensmeier

Is a Lake Big Enough to House Your Ocean of Data?

Contrary to popular belief, Hadoop was not the elephant-in-the-china-shop that marauded and disrupted the data center. The real culprit is data and how it has exploded in volume. The past two or three years have seen a rise in the number of successful Hadoop projects in enterprises to tackle this explosion of big data. These large volumes of data, the emergence of the Hadoop technology and the need to store all the siloed data in one place have prompted the phenomenon called the Data Lake among enterprises.…

Our guest blogger today is Rob Rosen, Senior Director Partner Solutions at Platfora, describes how to help customers achieve strategic advantage through data discovery.

While many people have heard the notion of “known unknowns” and “unknown unknowns,” it may surprise you to discover that the concept was first popularized by a NASA scientist. In a presentation given at TEDx GeorgeMasonU, Dr. Kirk Borne described how he used the concept of “known unknowns” (things that we knew might exist, but hadn’t seen evidence of) and “unknown unknowns” (things that we could discover and knew nothing about, but would truly surprise us), and how they relate to the concept of Big Data.…

Our guest blog today is from Don Brown, COO and Founder of Rocana, Hortonworks Technology Partner, talks about our partnership, mainstream Hadoop adoption and the importance of global IT Operations management.

Our partnership with Hortonworks is another exciting step on the path to mainstream adoption of Hadoop as the critical platform for modern, global-scale IT Operations management. Hortonworks’ emphasis on a platform that scales with the demands of big data applications is a great fit for the IT Operations market and for customers looking for more reliable, extensible, analytics, and limitless solutions.…

Today’s guest blogger is from Hortonworks Technology Partner, WANdisco. Peter Scott, SVP of Business Development and OEM Sales at WANdisco, talks about how to easily migrate from one Hadoop distribution to Hortonworks Data Platform (HDP).

Migration between Hadoop versions and distributions can be difficult, often causing extended downtime and disruption, unless you use the right tools. DistCp (distributed copy) is a tool available from Apache™ Hadoop®  used for large inter/intra-cluster copying from Apache.…

Our guest blogger today comes from our partner Talend, who has been working with us for many years to help organizations transition from data chaos to a modern data architecture. In this blog, Talend’s Ashley Stirrup, CMO, talks about a helping organizations to support a dynamic data supply chain.

In order to remain viable in increasingly competitive markets, companies must create ever-more detailed models of the business that incorporate all data – regardless of source or volume.…

In this Hortonworks’ partner guest blog, Jorik Blaas, chief technical officer at SynerScope, explores a use case in a new class of exploratory analytics, using Apache Spark on YARN, HDP and SynerScope.

Preliminaries

SynerScope is a pioneering developer of fast, sense-making Big Data Analytics technology. Focusing on human-in-the-loop analytics, we excel at combining heterogeneous data sources to enable a new class of exploratory analytics. By leveraging the Hortonworks Data Platform (HDP) platform through Apache Spark on YARN, we are able to bring agile lock-in-free analytics at scale to our market.…

In this Hortonworks’ partner guest blog, Abhimanyu Aditya, Senior Product Manager and co-founder at Skytree, explains how Skytree APIs solve challenges facing data engineers, simplifies data preparation and data transformation, using Apache Spark on YARN with Hortonworks Data Platform (HDP).

Challenges Facing Data Engineers and Data Scientists

Machine learning as a technology can be challenging. It is difficult to create, understand and deploy machine learning models. Even before the modeling process can begin, the data needs to be prepared for machine learning and modern data scientists, developers, hackers, Ph.D.’s, analysts and domain experts spend a significant amount of time and effort doing this.…

On August 19th, Dr. Alexander Gray, CTO and Co-Founder, Skytree, and Cindy Maike, General Manager, Insurance at Hortonworks, will be joining Patricia Harman, Editor-in-Chief at Claims Magazine, for a Skytree webinar on “Driving profitability and lowering costs using Machine Learning on Hadoop.”

Register for the Webinar on August 19th at 10am Pacific/1pm Eastern time

In this blog, Alex and Cindy exchange perspectives on what machine learning means for insurers, and where opportunities are for its application.…

Bit Refinery is a Hortonworks Technical Partner and recently certified with HDP. Bit Refinery is a VMware© Cloud Infrastructure-as-a-Service (IaaS) provider featuring virtualization technology hosted within their fully redundant virtual data centers. Bit Refinery offers a hosted Hortonworks Sandbox providing an easy way to experience and learn Hadoop with ease. All the tutorials available from the Hortonworks Sandbox work just as if you were running a localized version of the Sandbox.…

Argyle Data is a Hortonworks Technology Partner and recently certified on the Hortonworks Data Platform (HDP), and was awarded the OPS Ready badge for their integration with Apache Ambari. Here, Dr. Ian Howells talks about how Argyle Data is helping customers detect fraud faster with their native Hadoop application.

We believe that the world is moving to a new generation of native Apache Hadoop applications. When you build your application from the ground up on Hadoop, it is critical to make it simple for any organization to provision, manage and monitor at scale.…

Waterline Data is a Hortonworks Technology Partner and recently earned HDP Certification and YARN Ready with their solution that automates the inventory of data assets in the data lake, enables data governance, and provides self-service to data engineers and data scientists to find and understand their data. Learn more by joining the upcoming webinar on May 6, download the Sandbox tutorial or joint whitepaper. Our guest blogger is Oliver Claude, CMO at Waterline Data.…

In this guest blog, Kumar Srivastava, senior director of product management at ClearStory Data, shares his thoughts on ClearStory’s integration with Hortonworks Data Platform (HDP)

We are excited to be working with and announcing ClearStory Data’s integration with Hortonworks Data Platform (HDP) during Strata + Hadoop World 2015. This partnership with Hortonworks is significant as it brings ClearStory’s business-ready, fast-cycle, scalable analysis on Hadoop Data Lakes and specifically on the Hortonworks Data Platform (HDP).…

Talend is a Hortonworks Certified Technology Partner, and our guest blogger today is Shawn James, director, big data business development, Talend. Shawn and Jim Walker, director of product marketing at Hortonworks, are our guest speakers in an upcoming webinar on Feb. 12th.

If you are a data scientist, MapReduce or Hadoop developer, you are in demand given the massive increase in data science-based projects. These projects are being driven by the private sector of course, but also by a public sector that is looking to tackle a new range of use cases using big data.…

This guest blog post is from Alyssa Jarrett, product marketing manager at Splice Machine. Splice Machine is a Hortonworks Certified Technology Partner and provides one of the only Hadoop RDBMS to power a new generation of real-time applications and operational analytics. With its recent Certification with HDP, Splice Machine offers a 10x price/performance improvement over traditional relational databases.

Built on top of the HDFS and Apache HBase components in the Hortonworks Data Platform (HDP), Splice Machine is delighted to announce that it has completed the required integration testing with HDP.…

VoltDB is a Certified Hortonworks Technology Partner and developers of an in-memory relational DBMS capable of supporting high volume OLTP and real-time analytics with Hortonworks Data Platform. Our guest blogger today is John Piekos, vice president of engineering at VoltDB.

It’s a common phrase here at VoltDB: Streaming Apps are Really Database Apps When You Use a Database that’s Fast Enough.

What does that mean?

We’re seeing a trend: developers are struggling to create interactive, real-time applications on fast streaming data.…