The Hortonworks Blog

Hortonworks is a huge supporter of the Apache Software Foundation (ASF) and fully embrace the processes and procedures through the only 100% open source Hadoop platform HDP. As Forrester VP Mike Gualtieri said in the Forrester Wave “Hortonworks lives and loves open source.” And that will be fully on display at the inaugural at Apache: Big Data Europe 2015 event this year in Budapest, Hungary.

The event will be held 28-30 September at the Corinthia Hotel and Hortonworks will be contributing in a big way as a Diamond Sponsor.…

On September 22nd at 10:00 am PST, Vincent Lam, Director of Product Marketing at Protegrity, and Syed Mahmood, Sr. Product Marketing Manager at Hortonworks, will be talking about how to secure sensitive data in Hadoop Data Lakes.

Register Now

In this blog, they provide answers to some of the most frequently asked questions they have heard on the topic.

  • What’s the best approach for the security of Hadoop Data Lakes?
  • As enterprises continue to harness the power of Hadoop to store large amounts of data, security becomes an even more important part of the ecosystem.…

    Over multiple conversations and espressos, Steven Witt, Senior Director of Industry Solutions at Hortonworks, and I have been exploring the diverse challenges associated with collecting, conducting and curating data flows from the well site.

    Steven recently joined Hortonworks when we acquired Onyara. Steven was the Onyara CEO and co-founder. This is the first in a series summarizing our conversations, focused on how Hortonworks DataFlow collects data from the field in upstream oil and gas operations, then conducts that through to the data center and back in order to make critical decisions related to drilling and production.…

    Symantec helps consumers and organizations secure and manage their information-driven world by protecting digital information and online transactions.

    The Symantec Cloud Platform team turned to Hortonworks to ingest an enormous volume of security logs, analyze that security metadata and then use that insight to protect its customers. Symantec now analyzes threat data much more quickly because it optimized its data architecture using the storage and processing power of HDP—for both historical and real-time analysis.…

    Today we are thrilled to officially open our new international headquarters in the heart of London. This new office is a substantial increase and upgrade to the space we had been occupying for more than a year, and was much needed in order to accommodate our continued growth and success in international markets. We’ve reached a very exciting stage in our growth. The move to new, larger premises in the City of London will allow us to better serve our customers, partners and the Hadoop Community and up the tempo of our international commercial activities as a result.…

    We are excited to announce the general availability of Hortonworks Sandbox with HDP 2.3 on Microsoft Azure Gallery. Hortonworks Sandbox is already a very popular environment for developers, data scientists and administrators to learn and experiment with the latest innovations in Hortonworks Data Platform.

    The hundreds of innovations span across Apache Hadoop, Kafka, Storm, Spark, Hive, Pig, YARN, Ambari, Falcon, Ranger and other components that make up HDP platform.…

    Guest blogger David Hill, Business Development Director at Open Energi, explains the challenges of building a virtual power station, and why data is the fuel. Follow Open Energi on @openenergi

    Open Energi is working with businesses in the UK to harness the flexible energy demand from their equipment and aggregating it to create a virtual power station. We’re turning the whole system on its head so that instead of energy supply adjusting to meet demand, our demand for energy adjusts to meet supply – in real-time.…

    Today’s guest blogger is from Hortonworks Technology Partner, WANdisco. Peter Scott, SVP of Business Development and OEM Sales at WANdisco, talks about how to easily migrate from one Hadoop distribution to Hortonworks Data Platform (HDP).

    Migration between Hadoop versions and distributions can be difficult, often causing extended downtime and disruption, unless you use the right tools. DistCp (distributed copy) is a tool available from Apache™ Hadoop®  used for large inter/intra-cluster copying from Apache.…

    Our guest blogger today comes from our partner Talend, who has been working with us for many years to help organizations transition from data chaos to a modern data architecture. In this blog, Talend’s Ashley Stirrup, CMO, talks about a helping organizations to support a dynamic data supply chain.

    In order to remain viable in increasingly competitive markets, companies must create ever-more detailed models of the business that incorporate all data – regardless of source or volume.…

    Are you still learning about the Data Lake? Wondering how it can help your organization manage and leverage massive amounts of data? On September 8th, VHA, the largest member-owned health care company delivering supply chain management services and clinical services to its members, will share their experience and explain how they simplified data management and enabled faster data discovery with Hadoop and data virtualization.

    Register Now

    At VHA, product, supplier and member information, among other data, was siloed across multiple sources.…

    This blog is jointly submitted by Alexander Gray, Ph.D., is chief technology officer, Skytree, a Hortonworks Technology Partner, and Eric Thorsen, general manager, consumer products and retail, Hortonworks.

    As consumers increasingly reveal their shopping habits online, retailers can access social media, purchase history, consumer demand and market trends to better understand their customers, maximize spending and encourage repeat purchases. Retailers are considered early adopters of big data technology, integrating it into every imaginable business process to achieve a deeper understanding of consumers and associated buying trends.…

    Hortonworks has redesigned its certification program by creating hands-on, performance-based exams. Our new exams consist of tasks that candidates perform on a live HDP cluster. We are the first Hadoop provider to exclusively offer hands-on certification exams! The goal is to distinguish our exams from multiple-choice certifications. This will provide a measure of skills recognized in the Hadoop industry as meaningful and relevant to real-world tasks that a candidate would perform on the job.…

    Today, Tech Mahindra announced that their in-house developed analytics platform Tech Mahindra Analytics Platform (TAP) has joined the family of Hortonworks certified partners and products. As part of Hortonworks’ continued focus on the expansion of the Apache™ Hadoop® ecosystem, Tech Mahindra recently completed extensive certification testing on the Hortonworks Data Platform (HDP). Not only is Tech Mahindra a certified HDP Ready partner, they are also certified as HDP YARN Ready and as an HDP Systems Integrator Gold partner.…

    Hortonworks subscribers across all major industries use Hortonworks Data Platform (HDP) to power advanced analytics applications for data discovery and predictive analytics. Learn how the leading Fortune 100 Manufacturers are using Hadoop to accelerate innovation and transform business models.

    Attend The Event

    I am the GM for Manufacturing Solutions at Hortonworks, and tomorrow August 26th I have been invited to present “Turning Big Data into Big Opportunity in Manufacturing” at the 2015 Manufacturing CIO Summit.…

    In my last blog post, I told you how excited I am about the huge opportunity ahead for us at Hortonworks. In addition to continued innovation in HDP, there is a huge opportunity for the industry being driven by the Internet of Things (IoT). I think it’s broader than the Internet of Things, and is really the Internet of Everything and ANYTHING.

    Whether it is in moving metal (like cars or jet engines), wearable technology or even simple things like our modern refrigerator, sensors and the data they create are the next big thing.…