Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.

cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
September 22, 2017 | Vinod Kumar Vavilapalli

YInception: A YARN based container cloud and how we certify Hadoop on Hadoop

September 21, 2017 | Carter Shanklin | Hadoop Ecosystem, From the Dev Team

3x Faster Interactive Query with Apache Hive LLAP

September 20, 2017 | Vinay Shukla | From the Dev Team

Data Science for the Modern Data Architecture

Viewing posts by: Syed Mahmood« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

Data science is inherently an exploratory and creative process because there is usually neither a definitive answer to the problem at hand nor a well-defined approach to reaching one. Data scientists research problems, explore data, visualize patterns across data and use their experience and judgment to choose parameters and processes that may be relevant to […]

Apache Spark has been Open Source’s new kid on the block. Companies are using Spark to develop sophisticated models that would enable them to discover new opportunities or avoid risk. But what does the future or at least the near future hold for Spark? In this blog we have outlined five trends we see in […]

As enterprises around the world bring more of their sensitive data into Hadoop data lakes, balancing the need for democratization of access to data without sacrificing strong security principles becomes paramount. According to a recent research report by Securosis, “Hadoop has (mostly) reached security parity with the relational platforms of old, and that’s saying a […]

Water, water everywhere, Nor any drop to drink These lines from “The Rime of the Ancient Mariner,” by Samuel Taylor Coleridge also accurately describe the companies that are trying to transform themselves into a data driven company. These organizations have astronomical volumes of raw data at their disposal but how do they find that proverbial […]

Public Preview – Apache Atlas and Apache Ranger are now integrated to drive dynamic classification-based security    Why Governance and Security are better together? How do you keep track of large number of diverse data objects (think hundred thousand data entities) in your data lake that continue to increase every day. Now that Apache Hadoop […]