Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
September 19, 2017 | Simon Ball | Announcements

Hortonworks Cybersecurity Platform – Big Data Cybersecurity Solution

September 18, 2017 | Matt Spillar | Hortonworks Case Study

Lloyds Banking Group Brings Home Data Accolade

September 18, 2017 | Vinod Kumar Vavilapalli | From the Dev Team

Engineering @ Hortonworks – The Matrix

Viewing posts by: Guest Author« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

This guest blog is from our partner Denodo who is a leader in data virtualization and a Hortonworks partner for many years. Denodo helps customers who have data from multiple, heterogeneous sources to quickly, easily and cost-effectively integrate it to derive business insights and positively change their strategy to become more data-driven. In addition to […]

This blog has contributions from Mingliang Liu and Rajesh Balamohan. Late last year, we provided a brief history of Apache Hadoop support for Amazon S3. Our first focus of work was speeding up the read of S3-hosted data acting as a query input. That was followed by the write pipeline, as well as scaling and […]

On June 7, we hosted the Next Gen Data Analytics Powered by Cloud webinar with speakers from Hortonworks, Jeff Sposetti  and Amazon Web Services, Karthik Krishnan. The webinar provided an overview on how your organization can achieve the benefits of data analytics with the cloud, how to use the AWS marketplace for ease of deployment, […]

This guest blog is from Fuzzy Logix, a new partner of Hortonworks. Michael Upchurch, Co-founder & COO at Fuzzy Logix, describes the challenges the Manufacturing industry is faced with, use cases and benefits of analyzing data in place with solutions from Fuzzy Logix and Hortonworks. Manufacturers are on the forefront of re-imagining how Big Data can redefine industry […]

Recently Shaun Connolly (of Hortonworks) and Tony Baer (of Ovum) presented “Get Started with Big Data in the Cloud”.  During this webinar, they discussed the opportunity to take advantage of the cloud for big data workloads. As we see an increase in data analytics in the cloud, we are also seeing an increase in data […]

Apache Spark 2.1 Improves in Structured Streaming and Machine Learning. Structured Streaming: Kafka .10 support, Metrics & Stability improvements Machine Learning: SparkR Improvements including new ML algorithms for LDA, Random forests, GMM, etc. The recent release of Hortonworks Data Platform 2.6 (“HDP 2.6”) includes Apache Spark 2.1. And Hortonworks Data Cloud (“HDCloud”) for AWS gives […]

Last week, we hosted  Get Started with Big Data in the Cloud ASAP webinar with speakers from Hortonworks, Shaun Connolly and Ovum, Tony Baer. The webinar provided a very informative overview around the challenges enterprises are facing with the overwhelming number of choices available in the cloud. It covered how businesses can get over the […]

Time is running out to secure your spot at DataWorks Summit/Hadoop Summit. With over 170 sessions featuring top organization using open source technologies to leverage their data, drive predictive analytics, distributed deep-learning, and artificial intelligence initiatives, you don’t want to miss the industry’s premier event. Join us June 13 – 15 in San Jose and save […]

Comcast. Verizon. Netflix. Walgreens. PayPal. Ford Motor Company. Wells Fargo. LinkedIn. Uber. These and other organizations like these will be at this year’s DataWorks Summit/Hadoop Summit. Open source experts will present over 170 sessions. They’ll share how they use open source technologies to leverage their data, drive predictive analytics, distributed deep-learning and artificial intelligence initiatives. There’s still time for you to […]

BMW Group (BMW) is a German luxury vehicle, motorcycle, and engine manufacturing company founded in 1916. It is one of the best-selling luxury automakers in the world and is leveraging deep learning with HDP to save on manufacturing and costs. Three weeks ago, at the DataWorks Summit in Munich, we announced the Data Hero winners for the […]

In 2016, we published the second version v1.0.1 of Spark HBase Connector (SHC). In this blog, we will go through the major features we have implemented this year. Support Phoenix coder SHC can be used to write data out to HBase cluster for further downstream processing. It supports Avro serialization for input and output data […]

R is one of the primary programming languages for data science with more than 10,000 packages. R is an open source software that is widely taught in colleges and universities as part of statistics and computer science curriculum. R uses data frame as the API which makes data manipulation convenient. R has powerful visualization infrastructure, […]

Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we introduced what a Data Lake 3.0 is. In part 2 of the series, we talked about how a multi-colored YARN will play a critical role in building a successful Data Lake 3.0. In part 3 of the series, we […]

The first post (https://hortonworks.com/blog/european-banking-regulation-evolves-mar-mifid-ii-13/) in this three part series explored the evolution of capital markets regulation in the European financial markets over the last 15 years. We covered the important aspects of MAR (Market Abuse Regulation) and MiFid II. In this second blogpost, we will discuss the business and technology requirements that drive these implementations to an […]

As I sat in the audience of the 2017 Autonomous Vehicle Silicon Valley conference this week in Santa Clara, listening to luminaries present their visions for an Autonomous Vehicle future, I was struck by the change sweeping across the automotive industry. This was definitely not the same industry I have worked within over the last […]