cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
May 24, 2017 | Carter Shanklin | Hadoop Ecosystem

How to connect Tableau to Druid

May 24, 2017 | Tom Hastain | Hortonworks Case Study

CenterPoint Energy: Business Value from Large, Complex Data

May 23, 2017 | Tom Hastain | Hortonworks Case Study

Clearsense: Maximum Healthcare Transformation, Minimal Investment

Viewing posts by: Jitendra Pandey« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

Since its first deployment at Yahoo in 2006, HDFS has established itself as the defacto scalable, reliable and robust file system for Big Data. It has addressed several fundamental problems of distributed storage at unparalleled scales and with enterprise grade robustness. As more and more enterprises adopt Apache Hadoop, it is becoming a unified central […]

We reached a significant milestone in HDFS: the Namenode HA branch was merged into the trunk. With this merge, HDFS trunk now supports HOT failover. Significant enhancements were completed to make HOT Failover work: Configuration changes for HA Notion of active and standby states were added to the Namenode Client-side redirection Standby processing journal from […]

Hadoop RPC is the primary communication mechanism between the nodes in an Apache Hadoop cluster. Maintaining wire compatibility, as new features are added to Apache Hadoop, has been a significant challenge with the current RPC architecture. In this blog, I highlight the architectural improvement in Hadoop RPC and how it enables wire compatibility and rolling […]

Apache Hadoop is equipped with a robust and scalable security infrastructure. It is being used at some of the biggest cluster installations in the world, where hundreds of terabytes of sensitive and critical data are processed every day. Owen O’Malley provided a nice overview of Apache Hadoop security in his blog Motivations for Apache Hadoop Security. […]