Hadoop is Transforming Retail

Use Apache Hadoop to Sell More in Stores & Online

We’ve probably all heard the famous quote by John Wanamaker, the father of modern advertising: “Half the money I spend on advertising is wasted; the trouble is, I don’t know which half.”

Zulily is an online retailer using Hortonworks Data Platform to personalize its online product offerings

Wanamaker would love Apache Hadoop, because it diminishes (or eliminates) the dilemma he described.

When Hadoop is integrated with modern retail operations, it dramatically reduces the cost of capturing, ingesting, storing and analyzing data.

Now Wanamaker wannabes can analyze enough data to make statistically confident observations on empirical retail data, rather than rolling the dice with customer panels, in-store surveys or focus groups to guess what drives sales.

The following reference architecture diagram represents a combination of approaches that we see our retail customers adopt, whether they sell automobiles, ladders, shirts or shoes.

Retail Ref Arch

The following are a few of the most common ways that retailers do Hadoop.

Build a 360° View of the Customer

Retailers interact with customers across multiple channels, yet customer interaction and purchase data is often isolated in data siloes. Few retailers can accurately correlate eventual customer purchases with marketing campaigns and online browsing behavior.

Apache Hadoop gives retailers a single view of customer behavior. It lets them store data longer and identify phases of the customer lifecycle. Better customer analytics increase sales, reduce inventory expenses and retain the best customers.

Analyze Brand Sentiment

Enterprises lack a reliable way to track their brand health. It is difficult to analyze how advertising, competitor moves, product launches or news stories affect the brand. Internal brand studies can be slow, expensive and flawed.

Apache Hadoop enables quick, unbiased snapshots of brand opinions expressed in social media. Retailers can analyze sentiment from Twitter, Facebook, LinkedIn or industry-specific social media streams. With better understanding of customer perceptions, they can align their communications, products and promotions with those perceptions.

Localize & Personalize Promotions

Retailers that can geo-locate their mobile subscribers can deliver localized and personalized promotions. This requires connections with both historical and real-time streaming data.

Apache Hadoop brings the data together to inexpensively localize and personalize promotions delivered to mobile devices. Retailers can develop mobile apps to notify customers about local events and sales that align with their preferences and geographic location (even down to a particular section in a specific store).

In time for the 2013 Holiday shopping season, Macy’s launched a test in two flagship stores with Apple’s iBeacons technology. This article describes how, “down the road, Macy’s might also ping shoppers on a department-by-department basis, possibly telling them about sneaker sales when they’re in the shoe section, or even recommending nearby products.”

Optimize Websites

Online shoppers leave billions of clickstream data trails. Clickstream data can tell retailers the web pages customers visit and what they buy (or what they don’t buy) on their site. But at scale, the huge volume of unstructured weblogs is difficult to ingest, store, refine and analyze for insight. Storing web log data in relational databases is too expensive.

Apache Hadoop can store all web logs, for years, at a low cost. Web retailers use information in that data to understand user paths, do basket analysis, run A/B tests and prioritize site updates. This improves online conversions and increases revenue.

Optimize Store Layouts

In-store layout and product placement affect sales. Retailers often hire extraneous staff to make up for a sub-optimal layout (e.g. “Are you finding what you need?”). Brick-and-mortar stores lack “pre- cash register” data about what in-store shoppers do before they buy. In-store sensors, RFID tags & QR codes can fill that data gap, but they generate a lot of data.

Apache Hadoop can store that huge volume of unstructured sensor and location data. Once analyzed, the resulting intelligence allows retailers to reduce costs and simultaneously improve customer in-store satisfaction. This improves same-store sales and customer loyalty.

Hortonworks Partners with Industry Certifications in Retail

SoftNet Solutions

Get the Whitepaper



Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.