Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
November 13, 2017 | Matt Spillar | Hortonworks Case Study

How Nissan is Harnessing Big Data to Provide Value to Customers

November 10, 2017 | Syed Mahmood | Announcements

Certification of IBM Data Science Experience (DSX) on HDP is a Win-Win for Customers

November 9, 2017 | Will Xu | Hadoop Ecosystem

Ambari Kerberos support for HBase Part 1

Viewing posts by: Shelby Khan« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

In September, we rounded out the 2017 DataWorks Summit events in Sydney, Australia. Each event boasts different sessions, keynotes, sponsors, etc. so you can be sure to get a new experience and the latest updates from the community wherever you go. Aside from the networking opportunities, the best part of DataWorks Summit is the session […]

At Hortonworks we are constantly striving to achieve high quality releases. HDP/HDF releases are deployed by thousands of enterprises and are used in business critical environments to crunch several petabytes of data every single day. So maintaining the highest standards of quality and investing in an infrastructure to support the repeatable standards of quality is […]

This guest blog is from our partner Denodo who is a leader in data virtualization and a Hortonworks partner for many years. Denodo helps customers who have data from multiple, heterogeneous sources to quickly, easily and cost-effectively integrate it to derive business insights and positively change their strategy to become more data-driven. In addition to […]

This blog has contributions from Mingliang Liu and Rajesh Balamohan. Late last year, we provided a brief history of Apache Hadoop support for Amazon S3. Our first focus of work was speeding up the read of S3-hosted data acting as a query input. That was followed by the write pipeline, as well as scaling and […]

On June 7, we hosted the Next Gen Data Analytics Powered by Cloud webinar with speakers from Hortonworks, Jeff Sposetti  and Amazon Web Services, Karthik Krishnan. The webinar provided an overview on how your organization can achieve the benefits of data analytics with the cloud, how to use the AWS marketplace for ease of deployment, […]

This guest blog is from Fuzzy Logix, a new partner of Hortonworks. Michael Upchurch, Co-founder & COO at Fuzzy Logix, describes the challenges the Manufacturing industry is faced with, use cases and benefits of analyzing data in place with solutions from Fuzzy Logix and Hortonworks. Manufacturers are on the forefront of re-imagining how Big Data can redefine industry […]

Recently Shaun Connolly (of Hortonworks) and Tony Baer (of Ovum) presented “Get Started with Big Data in the Cloud”.  During this webinar, they discussed the opportunity to take advantage of the cloud for big data workloads. As we see an increase in data analytics in the cloud, we are also seeing an increase in data […]

Apache Spark 2.1 Improves in Structured Streaming and Machine Learning. Structured Streaming: Kafka .10 support, Metrics & Stability improvements Machine Learning: SparkR Improvements including new ML algorithms for LDA, Random forests, GMM, etc. The recent release of Hortonworks Data Platform 2.6 (“HDP 2.6”) includes Apache Spark 2.1. And Hortonworks Data Cloud (“HDCloud”) for AWS gives […]

Last week, we hosted  Get Started with Big Data in the Cloud ASAP webinar with speakers from Hortonworks, Shaun Connolly and Ovum, Tony Baer. The webinar provided a very informative overview around the challenges enterprises are facing with the overwhelming number of choices available in the cloud. It covered how businesses can get over the […]

Time is running out to secure your spot at DataWorks Summit/Hadoop Summit. With over 170 sessions featuring top organization using open source technologies to leverage their data, drive predictive analytics, distributed deep-learning, and artificial intelligence initiatives, you don’t want to miss the industry’s premier event. Join us June 13 – 15 in San Jose and save […]

Comcast. Verizon. Netflix. Walgreens. PayPal. Ford Motor Company. Wells Fargo. LinkedIn. Uber. These and other organizations like these will be at this year’s DataWorks Summit/Hadoop Summit. Open source experts will present over 170 sessions. They’ll share how they use open source technologies to leverage their data, drive predictive analytics, distributed deep-learning and artificial intelligence initiatives. There’s still time for you to […]

BMW Group (BMW) is a German luxury vehicle, motorcycle, and engine manufacturing company founded in 1916. It is one of the best-selling luxury automakers in the world and is leveraging deep learning with HDP to save on manufacturing and costs. Three weeks ago, at the DataWorks Summit in Munich, we announced the Data Hero winners for the […]

In 2016, we published the second version v1.0.1 of Spark HBase Connector (SHC). In this blog, we will go through the major features we have implemented this year. Support Phoenix coder SHC can be used to write data out to HBase cluster for further downstream processing. It supports Avro serialization for input and output data […]

R is one of the primary programming languages for data science with more than 10,000 packages. R is an open source software that is widely taught in colleges and universities as part of statistics and computer science curriculum. R uses data frame as the API which makes data manipulation convenient. R has powerful visualization infrastructure, […]

Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we introduced what a Data Lake 3.0 is. In part 2 of the series, we talked about how a multi-colored YARN will play a critical role in building a successful Data Lake 3.0. In part 3 of the series, we […]