Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics, offering information and knowledge of the Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
August 21, 2017 | Matt Spillar | Hortonworks Case Study

Data Has Sparked a Retail Revolution

August 18, 2017 | Tom Hastain | Hortonworks Case Study

Worldpay: Influencing Open Source for Enterprise Readiness via Hortonworks Support

August 17, 2017 | Syed Mahmood

What is a Data Science Workbench and Why Do Data Scientists Need One?

Viewing posts by: Robert Hryniewicz« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

Recently Raj Verma (President & COO of Hortonworks) spoke to Jim McHugh from Nvidia at the DataWorks Summit keynote in San Jose (video). Jim began by talking about how parallel processing that is used in gaming is also essential to Deep Learning*. And the lifeblood of Deep Learning is data. With its insatiable desire for […]

This blog was co-authored by: George Vetticaden, Sriharsha Chintalapani, Jungtaek Lim, Sanket Shah Last week, in Part 3 of this blog series, we announced the GA of HDF 3.0 and let the cat out of the bag by introducing the new open source component called  Streaming Analytics Manager (SAM), an exciting new technology that helps developers, […]

Apache Spark 2.1 Improves in Structured Streaming and Machine Learning. Structured Streaming: Kafka .10 support, Metrics & Stability improvements Machine Learning: SparkR Improvements including new ML algorithms for LDA, Random forests, GMM, etc. The recent release of Hortonworks Data Platform 2.6 (“HDP 2.6”) includes Apache Spark 2.1. And Hortonworks Data Cloud (“HDCloud”) for AWS gives […]

Destination Autonomous The march towards autonomous vehicles continues to accelerate. While expert opinion differs on the specific timing and use cases that will emerge first, few deny that self-driving cars are in our future. Not surprisingly, when reviewing Big Data strategies with my automotive clients, discussions on data management strategies for autonomous driving research inevitably […]

Open Energi is a UK clean tech company working with businesses to intelligently optimize their electricity demand and to deliver both revenue and cost savings. The company pioneers in its market in terms of services, real-time analytics, machine learning, and Internet of Things (IoT). Through Hortonworks Data Platform (HDP) and DataFlow (HDF), Open Energi developed […]

You have heard about Big Data for a long time, and how companies that use Big Data as part of their business decision making process experience significantly higher profitability than their competition. Now that your company is ready to embark on its first Apache Hadoop® journey there are important lessons to be learned. Read on […]

Apache Spark has ignited an explosion of data exploration on very large data sets. Spark played a big role in making general purpose distributed compute accessible. Anyone with some level of skill in Python, Scala, Java, and now R, can just sit down and start exploring data at scale. It also democratized Data Science by […]

How RoboAdvisors work..

“Millennials want “finance at their fingertips”..they want to be able to email and text the financial advisors and talk to them on a real-time basis,”– Greg Fleming, Ex-Morgan Stanley Executive The first post in this series on Robo-advisors,  touched on the fact that Wealth Management has been an area largely untouched by automation as far as […]

Guest author: Jeff Kelly, Data Strategist, Pivotal The phrase “digital transformation” gets bandied about a lot these days, but what exactly does it mean? When you strip away the hyperbole, I believe digital transformation is the process by which enterprises evolve from using traditional information technology to merely support existing business models to adopting modern […]

Provenance, Lineage & Chain of Custody The models of Provenance, Lineage and Chain of Custody are used in fine art to determine when a piece was created, the sequence of locations where it was held, how it was touched along the way, and who has owned it since creation, all with the purpose of authenticating the piece. […]

People often think about cloud architecture in simplistic terms: you’re either public, private, or hybrid. (In fact, there’s even confusion about the meaning of the term “hybrid” itself—this video helps clear it up: In the real world, of course, virtually every implementation is hybrid—no company puts 100% of its IT environment into one single cloud. […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

The Financial regulators are driving a Data Evolution Traditionally technology moves fast, regulators react slow. When technology leaps forward, it enables financial firms to change the nature of their business – often into un-regulated territory; Regulators react to pass regulation to catch up. This model can work in slow moving markets, but in todays interconnected […]

With the release of Hortonworks 2.5 Sandbox several new exciting features have been added to Apache Spark and Apache Zeppelin. Apache Spark Updates One of the most powerful new Hortonworks 2.5 Sandbox features is the ability to run two versions of Spark alongside in the same environment: a Generally Available (GA) Spark 1.6.2 and a […]

It’s never been easier to get started with Apache Hadoop. The Hortonworks Sandbox combines 100% open-source Apache Hadoop and its data access engines (Apache Spark, Apache Hive, Apache HBase, Apache Solr, Apache Pig) with enterprise-grade Operations (Apache Ambari), Security (Apache Ranger and Apache Knox) and Governance (Apache Atlas).  The Sandbox also provides tools for devOps, […]