We take pride in producing valuable technical blogs and sharing it with a wider audience. Of all the blogs published in 2014 on our website, the following were most popular:
Gopal Vijayaraghavan and Oleg Zhurakousky demonstrate improved Apache Spark, which with the help of the pluggable execution context.
Tim Hall outlines six months of innovation and new features across Apache Hadoop and its related projects.
Arun Murthy explains YARN’s extended capabilities for resource and workload management for long-running services.
Ofer Mendelevitch and Beau Plath illustrate how to build predictive models using Apache Hadoop and Data Science’s Machine Learning Algorithms.
Using Apache Hadoop YARN’s extensible capabilities and multiple workloads resource management, Sidharta Seethana explains how to enable PaaS.
Devaraj Das et al., discuss an integrated architecture for closed-loop operational and analytical processing.
To put your memory into its right place in the storage hierarchy for efficient queries, Julian Hyde proposes a solution for a new kind of data set: Discardable, In-Memory, Materialized Query (DIMMQ).
For heterogeneous storage support in HDFS, Arpit Agrawal explores scenarios that aim to achieve this capability.
Carter Shanklin shares the Stinger.next initiative that delivers batch and interactive SQL query workloads in a single engine.
What it takes to build a thriving Enterprise ecosystem with your partners and why key initiatives—partner, certify, engineer, and resell—are crucial for the ecosystem’s success, explains John Kreisa
Happy New Year!