cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
May 25, 2017 | Simon Ball

Apache Metron Insight #1: Why real-time enrichment matters

May 25, 2017 | Kevin Jordan | Hortonworks Case Study

TMW Systems Drives Transportation Businesses Out of the Dark with Big Data

May 24, 2017 | Carter Shanklin | Hadoop Ecosystem

How to connect Tableau to Druid

Viewing posts by: Sam Shah« Back to all

X
FILTERS
ALL
TECHNICAL
BUSINESS

All Topics















All Channels











CLEAR FILTERS

If Pig is the “duct tape for big data“, then DataFu is the WD-40. Or something. No, seriously, DataFu is a collection of Pig UDFs for data analysis on Hadoop. DataFu includes routines for common statistics tasks (e.g., median, variance), PageRank, set operations, and bag operations. It’s helpful to understand the history of the library. […]