cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

I had the pleasure to speak at Spark Summit in New York today about accelerating the adoption of Spark by mainstream enterprises. I had to admit at the beginning of my talk that I’m an “open source addict” — over the past 12 years I’ve been blessed to have called JBoss, Red Hat, SpringSource, and Hortonworks […]

Attunity is an ISV partner of Hortonworks focused provides data optimization and data integration software that helps Hortonworks customers address exploding data growth and efficiently manage the performance of BI and data warehouse systems. As our guest blogger today, Carole Gunst, Marketing Director at Attunity, introduces the findings of a recent report on Data Lake Adoption, the […]

Welcome back to my blogging adventure.  If you’ve been reading along, you’re aware of the lightbulb moments from my article, “echo: hello world”, that allowed me to discover the benefits of an analytic approach to cybersecurity.  Next I gave a little slice in the life of our intrepid SOC analyst in, “Cybersecurity: the end of […]

This is another great European customer guest blog post  authored by Joan Viladrosa, Tech Lead & Senior Big Data Engineer at Billy Mobile. You can hear more about their solution by joining our live webcast February 23rd. Register here.   About Billy Mobile As a mobile ad exchange with a large marketplace of direct publishers […]

We started Hortonworks Community Connection at the end of 2015, and there is some amazing content that any data developer or data administrator should read and bookmark. I will publish this blog weekly and highlight the top technical articles that are on HCC based on community activity and votes.  Top 3 articles on the site:  Sample […]

Register now for the February 25th Webinar at 10am PST/1pm EST. Data is a natural resource for insurance companies. It is acquired, exchanged, and analyzed on an unprecedented scale. Insurance brokers around the world now rely on data obtained from: Mobile devices Wearable devices Telematics Clickstream Social media Claims notes and diaries Call center recordings […]

Hortonworks has achieved quite a bit of success with online dating. Personally, I haven’t just yet, but hey it warms my heart to think about all those that we’ve helped bring together. Valentine’s Day is upon us and so I wanted to launch this Cupid’s arrow with a missive about how Hortonworks Data Platform (HDP) […]

This year’s Insurance Analytics USA Summit has an exciting new format with presentations and panels that focus on using data to its full potential, creating a data-conscious culture, and applying innovative modeling techniques. Sessions include “The Future of Insurance: Using Analytics to Take Advantage of the Data-Driven Age of Insurance” and “New and Big Data: […]

Big Data and Apache™ Hadoop® are driving tectonic shifts in enterprise data management (EDM) within the financial services industry. Open Enterprise Hadoop and the vendor ecosystem growing up around it are consolidating and standardizing data architectures at the country’s largest banks—transforming expensive, inflexible, and proprietary data landscapes into economic, agile, open source data environments. Regulatory […]

People have been asking us – Is Google Cloud Dataflow the same thing as Hortonworks DataFlow (HDF)? So we thought we’d take the opportunity to share with you how we see these two products work together. Both have the word dataflow in their name, and both systems are rooted in the premise of dataflow programming, […]

We are already more than a month into 2016 and it’s anything but business as usual in Oil and Gas. Current markets are making companies rethink every aspect of their business model, foundational cost structure, and strategy for delivering value to customers and shareholders. The same thorough scrutiny should be applied to traditional enterprise software […]

Apache Storm is the scalable, fault-tolerant realtime distributed processing engine that allows you to handle massive streams of data in realtime, in parallel, and at scale. Windowing computations is one of the most common use cases in stream processing. Support for windowing computations is a must for deriving actionable insights from real time data streams. […]

 

Hadoop All Grown Up

It’s amazing the growth Apache Hadoop and the extended ecosystem has had in the last 10 years. I read through Owen’s “Ten Years of Herding Elephants” blog and downloaded the early docker image of his first patch.  It reminded me of the days it took me to do my first Hadoop install and the effort […]

This year’s Insurance Canada Technology Conference will focus on the impact of new technologies in the insurance industry. Key topics include telematics, analytics, the Internet of Things (IoT), and how these capabilities enable insurance companies to improve underwriting and reduce risk. A recent article at Strategy Meets Action identified digital transformation in the insurance industry […]

Author: Michael Bironneau, Data Scientist, Open Energi At Open Energi, we think of our service as an automated, virtual power station. Whenever the electric grid experiences sudden, unforeseen surges in supply or demand, assets under the control of our Dynamic Demand algorithm automatically pick up the slack – just like a power station would but cheaper […]