6:00 PM- 6:30 PM: drinks, mingling
6:30 PM – 8:30PM: Hands-on: Data Science at Scale with HAWQ and MADlib and Hadoop
In this Meetup we’ll learn about Apache HAWQ, the elastic, parallel processing query engine that operates on all your data directly within Hadoop. We’ll also learn about Apache MADlib, the big data machine-learning library that provides commonly used data science algorithms capable of leveraging the parallel processing capabilities of HAWQ.
The main part of this event will be a guided hands-on where we use Apache Zeppelin as the notebook to perform a data science investigation of our data in Hadoop by invoking MADlib functions in Python, R, and directly with SQL.
Feel free to come watch the extended demonstration. If you want to play-along with your own sandbox, please bring a system that meets these minimum requirements. The software will be distributed by a USB drive:
· VirtualBox 4.2 or later, or VMWare 5.0 or later installed Pre-downloaded Sandbox VM with HAWQ
· 15 GBs free disk space
This meetup will be at new location @ WEWORK MARKET ST.
1601 Market Street Philadelphia PA 19103 (19th floor)
About our sponsor:
WeWork is a community for creators. We transform buildings into
beautiful, collaborative workspaces and provide the infrastructure, services,
events and technology so our members can focus on doing what they love.
WeWork currently has 111 locations in 29 cities across the world with over
70,000 members. Book a tour at wework.com now!