The Hortonworks Blog

Guest blogger David Hill, Business Development Director at Open Energi, explains the challenges of building a virtual power station, and why data is the fuel. Follow Open Energi on @openenergi

Open Energi is working with businesses in the UK to harness the flexible energy demand from their equipment and aggregating it to create a virtual power station. We’re turning the whole system on its head so that instead of energy supply adjusting to meet demand, our demand for energy adjusts to meet supply – in real-time.…

Today’s guest blogger is from Hortonworks Technology Partner, WANdisco. Peter Scott, SVP of Business Development and OEM Sales at WANdisco, talks about how to easily migrate from one Hadoop distribution to Hortonworks Data Platform (HDP).

Migration between Hadoop versions and distributions can be difficult, often causing extended downtime and disruption, unless you use the right tools. DistCp (distributed copy) is a tool available from Apache™ Hadoop®  used for large inter/intra-cluster copying from Apache.…

From Brian Burns, Regional Vice President-North Asia, Hortonworks

Big data is affecting nearly every industry and in every geography worldwide. That is particularly true in regions that have heavy manufacturing and advanced digital economies like Japan and the whole of the APAC region. The potential for harnessing this data to improve operations and find new business models and create new analytic applications is enormous and Hortonworks is seeing tremendous interest and uptake in Open Enterprise Hadoop throughout the region.…

Our guest blogger today comes from our partner Talend, who has been working with us for many years to help organizations transition from data chaos to a modern data architecture. In this blog, Talend’s Ashley Stirrup, CMO, talks about a helping organizations to support a dynamic data supply chain.

In order to remain viable in increasingly competitive markets, companies must create ever-more detailed models of the business that incorporate all data – regardless of source or volume.…

Are you still learning about the Data Lake? Wondering how it can help your organization manage and leverage massive amounts of data? On September 8th, VHA, the largest member-owned health care company delivering supply chain management services and clinical services to its members, will share their experience and explain how they simplified data management and enabled faster data discovery with Hadoop and data virtualization.

Register Now

At VHA, product, supplier and member information, among other data, was siloed across multiple sources.…

This blog is jointly submitted by Alexander Gray, Ph.D., is chief technology officer, Skytree, a Hortonworks Technology Partner, and Eric Thorsen, general manager, consumer products and retail, Hortonworks.

As consumers increasingly reveal their shopping habits online, retailers can access social media, purchase history, consumer demand and market trends to better understand their customers, maximize spending and encourage repeat purchases. Retailers are considered early adopters of big data technology, integrating it into every imaginable business process to achieve a deeper understanding of consumers and associated buying trends.…

Hortonworks has redesigned its certification program by creating hands-on, performance-based exams. Our new exams consist of tasks that candidates perform on a live HDP cluster. We are the first Hadoop provider to exclusively offer hands-on certification exams! The goal is to distinguish our exams from multiple-choice certifications. This will provide a measure of skills recognized in the Hadoop industry as meaningful and relevant to real-world tasks that a candidate would perform on the job.…

Today, Tech Mahindra announced that their in-house developed analytics platform Tech Mahindra Analytics Platform (TAP) has joined the family of Hortonworks certified partners and products. As part of Hortonworks’ continued focus on the expansion of the Apache™ Hadoop® ecosystem, Tech Mahindra recently completed extensive certification testing on the Hortonworks Data Platform (HDP). Not only is Tech Mahindra a certified HDP Ready partner, they are also certified as HDP YARN Ready and as an HDP Systems Integrator Gold partner.…

Hortonworks subscribers across all major industries use Hortonworks Data Platform (HDP) to power advanced analytics applications for data discovery and predictive analytics. Learn how the leading Fortune 100 Manufacturers are using Hadoop to accelerate innovation and transform business models.

Attend The Event

I am the GM for Manufacturing Solutions at Hortonworks, and tomorrow August 26th I have been invited to present “Turning Big Data into Big Opportunity in Manufacturing” at the 2015 Manufacturing CIO Summit.…

As more organisations start to run big data projects, organisations are uncovering unique big data challenges specific to Europe. Join this weekly European Big Data Series where we will be uncovering the latest trends, best practices and challenges of making big data projects successful in Europe.

In my last blog post, I told you how excited I am about the huge opportunity ahead for us at Hortonworks. In addition to continued innovation in HDP, there is a huge opportunity for the industry being driven by the Internet of Things (IoT). I think it’s broader than the Internet of Things, and is really the Internet of Everything and ANYTHING.

Whether it is in moving metal (like cars or jet engines), wearable technology or even simple things like our modern refrigerator, sensors and the data they create are the next big thing.…

Today, Ernst & Young LLP announced today a strategic business relationship with Hortonworks, Inc.® to provide new data management offerings that leverage and extend Hortonworks Data Platform (HDPTM) together with EY’s data and information management services. EY has been building expertise and solutions around Hadoop to help data driven organizations leverage newly available data (structured and unstructured) to gain better insights and increase operational efficiency, improve profitability and reduce costs, ultimately with a goal to obtaining a competitive edge in the marketplace.…

In this Hortonworks’ partner guest blog, Jorik Blaas, chief technical officer at SynerScope, explores a use case in a new class of exploratory analytics, using Apache Spark on YARN, HDP and SynerScope.

Preliminaries

SynerScope is a pioneering developer of fast, sense-making Big Data Analytics technology. Focusing on human-in-the-loop analytics, we excel at combining heterogeneous data sources to enable a new class of exploratory analytics. By leveraging the Hortonworks Data Platform (HDP) platform through Apache Spark on YARN, we are able to bring agile lock-in-free analytics at scale to our market.…

In this Hortonworks’ partner guest blog, Abhimanyu Aditya, Senior Product Manager and co-founder at Skytree, explains how Skytree APIs solve challenges facing data engineers, simplifies data preparation and data transformation, using Apache Spark on YARN with Hortonworks Data Platform (HDP).

Challenges Facing Data Engineers and Data Scientists

Machine learning as a technology can be challenging. It is difficult to create, understand and deploy machine learning models. Even before the modeling process can begin, the data needs to be prepared for machine learning and modern data scientists, developers, hackers, Ph.D.’s, analysts and domain experts spend a significant amount of time and effort doing this.…

Everyday more and more new devices—smartphones, sensors, wearables, tablets, home appliances—connect together by joining the “Internet of Things.” Cisco predicts that by 2020, there will be 50 billion devices connected to Internet of Things. Naturally, they all will emit streams of data, in short intervals. Obviously, these data streams will have to be stored, will have to be processed, and will have to be analyzed, in real-time.

Apache Storm is the scalable, fault-tolerant realtime distributed processing engine that allows you to the handle massive streams of data in realtime, in parallel, and at scale.…