USING DATA TO FIND THE OPTIMAL MIX OF RETAIL LOCATIONS AND - PowerPoint PPT Presentation

USING DATA TO FIND THE OPTIMAL MIX OF RETAIL LOCATIONS AND RESOURCES

INTRODUCTION Education • BS CS, Georgia Tech 2009 – Theory and Machine Learning • MS CS, Georgia Tech 2011 – Heavy Tail Network Analysis Work • Institute of Nuclear Power Operations (2010-16) • Build, deploy, maintain a model that predicted nuclear power station performance along 13 key functional areas • North Highland (2016-) • ETL, BI, Advanced Analytics for Fortune 100 retailer 2 Proprietary & Confidential

1. Data & Analytics outside academia 2. Case Study: Reassigning territories for district managers 3. Q&A 3 Proprietary & Confidential

WORKING WITH CLIENTS • Problems are never stated formally • “Interesting” problems can be few and far between • But they can build your personal brand 4 Proprietary & Confidential

REMAPPING TERRITORIES – PROBLEM DESCRIPTION • Minimizing travel time for regional managers can reduce incurred travel costs and boost morale • Aligning districts to strategic goals can help ensure a variety of goals: o A level playing field where top talent can be evaluated evenly o Specialized focus for individual district owners o No one regional leader becomes overburdened compared to the others 5 Proprietary & Confidential

AVAILABLE DATA • Store Metadata – geocoding, age, size, store annual sales category, etc. • Sales Data – department, class, subclass, SKU grain data anywhere from monthly roll-ups to individual transactions • Inventory Data • Online Transactions • Current Territory 6 Proprietary & Confidential

(ABBREVIATED) TOOLBOX OF TECHNIQUES Technique 1: k-means • Unsupervised Learning • Identifies a number of means around the map and builds clusters with equal variance inside them • Very much a black box-hard to specify, and requires a lot of tuning • Use if: You want to explore your data, equal size isn’t as important Technique 2: Integer programming • Can specify exactly what you want, but rules are rigid • Computationally impossible for large datasets—constraints have to be relaxed • Use if: You have little data Technique 3: Network construction • Randomized (or can be non-random) algorithm to build out a network ‘greedily’ • Easy to specify and tune parameters as you go • Use if: Iteration is OK, exact solutions aren’t required 7 Proprietary & Confidential

PULLING IT TOGETHER • SQL • Python o Pandas • High performance data management/manipulation, SQL-like interface o Numpy • N-dimensional arrays, math libraries o Scikit-learn • Huge number of supervised and unsupervised ML algorithms prewritten o Networkx • Network/Graph analysis library • Brute force 8 Proprietary & Confidential

Low Spatial Weighting Medium Spatial Weighting High Spatial Weighting 9 Proprietary & Confidential

NETWORKX https://networkx.github.io/ • Graph data structure with huge library of built-ins o Graph Operations • Edge/Node maintenance, weighting, node attributes, etc. o Graph Algorithms • Connectivity, Neighborhoods, k-core, max-flow, matching, bipartite, approximation algorithms, and on and on… o Linear algebra library that takes graph objects • Eigenvalue spectrums, laplacians, PageRank o Generators • Random graph generators (e.g. random normal, Erdős–Rényi, power law) • Canonical graphs (Karate club, Florentine families graph) o Visualization Tools 10 Proprietary & Confidential

GREEDY ALGORITHM OVERVIEW • Load data • Using networkx, build an approximately-planar graph based on district mean locations o Find the norm of the district centers, pick n-closest • Set parameters for “optimizer” • Loop: o Pick manager with lowest score, assign them a random district that’s a neighbor as long as constraints are met • If that manager has no districts, pick a random district to add. o Simulated annealing—jostle where districts are in an attempt to avoid local minima, cooling over time • Once all districts are assigned, score districts and reshuffle them to minimize variance 11 Proprietary & Confidential

LOAD DATA 12 Proprietary & Confidential

BUILD GRAPH 13 Proprietary & Confidential

PARAMETERS AND CONTROLS 14 Proprietary & Confidential

ITERATE AND BE GREEDY • Pick a random manager from the ones that have approximately the lowest score • Get a list of possible districts they could have, and randomly pick one of those • Verify all the constraints (lots of IFs) are met • Perform some simulated annealing along the way—some random chance to jostle districts from one manager to another adjacent manager occasionally to avoid local minima • If all districts are assigned, still grab a local district if it improves your score more than it decreases your neighbor’s score 15 Proprietary & Confidential

RESULTS 16 Proprietary & Confidential

WHY DO IT THIS WAY? • Explainable o Client has minimal experience and trust of advanced analytics, a simple algorithm makes it easier to get buy-in • Repeatable, with little variation o Similar but not identical results allow fine-tuning / re-running to smooth out client concerns • Very easy to tweak in live sessions o Simple code, simple algorithms mean you can modify on-the-fly in response to questions • In this case, all solutions are approximations o There’s no right answer 17 Proprietary & Confidential

SOME OTHER PROJECTS Advanced Analytics Toolkit Are there natural clusters and needs of customers/employees? Predictive/Explanatory modeling Based on forecasted vs. Which actual sales, what stores customers/employees are under-performing? are likely to churn? Behavioral segmentation Where should the next Why? store be located? Survey segmentation and projection Among elderly Which customers are population, who is likely likely to click/convert to need assisted living? Forecasting Key Business How do we create Questions Pricing analytics Which patients are robust tests of content likely to be readmitted? customers are most Why? likely to respond to? Design of Experiments (A/B and MVT) Can we use predictive Who are most likely Text/VOC analytics maintenance to social influencers? minimize production impacts? Social influence propensity What is the next best action/offer for each customer? 18 Proprietary & Confidential

THANK CHARLIE MORN YOU Sr. Data Analyst North Highland www.northhighland.com charlie.morn@northhighland.com 19 Proprietary & Confidential

QUICK OIL CHANGE CHAIN Transactions (Dates/mileage) Acxiom Vehicle(s) PROBLEM (demos/hobbies (Year/Make/Model) /census) Converters / Our client has a large base of customers that are Non converters “oil-only” and have never used them for mechanical services (e.g., belts, brakes, hoses) Store Invoice distance details SOLUTION Coupon behaviors Created 350+ 1 st party variables Develop a predictive model used to target customers most likely to convert so they can receive a differentiated experience on their next visit. PREDICTIVE MODEL PERFORMANCE Perform deep data-mining of prevailing customer 10.0% behaviors to identify ones that tend to lead to conversion 9.4% Target these customers with and just as important, ones that might turn off customers 9.0% aggressive conversion offer (e.g., “over-selling”) 8.0% 7.8% 7.0% A sound byte from the modeling process is that 6.5% 6.0% air filter replacement recommendations tend to turn Next visit conversion rate 5.4% customers off and reduce their chance of mechanical 5.0% 4.9% conversion by 25%. 4.1% 4.0% 3.7% 3.3% 3.0% 2.9% 2.1% 2.0% RESULTS 1.0% Paid back initial investment at two month mark 0.0% (based on net EBIT) 1 2 3 4 5 6 7 8 9 10 Model decile At three months (mid-October 2016), converted 1,377 customers for a total of $350k net NEW mechanical Theory matches reality revenue. Decile 1 – Most likely to convert >> highest next visit conversion (9.4%) Decile 10 – Least likely to convert >> lowest next visit conversion (2.1%) 20 Proprietary & Confidential

USING DATA TO FIND THE OPTIMAL MIX OF RETAIL LOCATIONS AND - PowerPoint PPT Presentation

USING DATA TO FIND THE OPTIMAL MIX OF RETAIL LOCATIONS AND RESOURCES INTRODUCTION Education BS CS, Georgia Tech 2009 Theory and Machine Learning MS CS, Georgia Tech 2011 Heavy Tail Network Analysis Work Institute of

ACI Mix Design Updated Version CIVL 3137 1 ACI Mix Design So-called mix design methods

Hampton Roads Retail Hampton Roads Retail Hampton Roads Retail Hampton Roads Retail Southside

Warm Mix Asphalt Warm Mix Asphalt (WMA 101) (WMA 101) What Is Warm Mix Asphalt ? What Is Warm

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Linearly-Homomorphic Signatures and Scalable Mix-Nets Chlo Hbant, Duong Hieu Phan and David

RAKNOR READY MIX CONCRETE T +971 7 2668351 F +971 7 2668910 E raknor@emirates.net.ae RAKNOR

and ERCC RNA Spike-in control Control IL2 treated Control IL2 treated Control IL2 treated

MEDIA KIT COVERAGE LOCATIONS English Language Locations Chinese Language Locations Melbourne

A to Z of Filming Locations at Bowood A Arboretum Archways A to Z of Filming Locations at

MAINLAND NAVIGATING THE WORLD OF RETAIL NAVIGATING THE WORLD OF RETAIL METRO VANCOUVER RETAIL

THE INDEX OF RETAIL PRICES REVISION OF THE INDEX OF RETAIL PRICES INDEX OF RETAIL PRICES The

AFR RETAIL SUMMIT AFR RETAIL SUMMIT PARALLELS GFC TO NOW BROADER BEHAVIOURAL SHIFTS LONG TERM

Preserve Rural Post Offices May 9, 2012 Current Retail Locations 3 Alternate Access

Harbour Ready Mix Plant Ready Mix Truck Parking Proposal General Background Lafarge Harbour

Asphalt Mix Volumetrics Mix Volumetrics Aggregate Particle Bulk Volume (V G ) (M G ,V G ) Water

Superpave TM Mix Design Marshall Mix Design 1. Select suitable aggregates 2. Select a suitable

The term structure of interest rates Financial markets and recession expectations 2018-2019

'()+,)&-.),& 4E&+;.+F,&;.2&6(&)E6,&G+,().=:(&

Adversarial Search R&N 5.15.5 Jacques Fleuriot University of Edinburgh, School of

while Loops Introducing: while Loops General form of a while loop statement: while [boolean

| 1 New gTLD Subsequent Procedures PDP Working Group ICANN65 Monday, 24 June 2019 13:30 to

Water ers o s of t the U he U.S. S. Presented By: Anjali Karve , M.S. Certified Ecologist

Law Law of the of the Sea i Sea in n Historical Perspective Historical Perspective R. R. P.

brief you on maritime security and surveillance cooperation in the Baltic. First of all I want to

USING DATA TO FIND THE OPTIMAL MIX OF RETAIL LOCATIONS AND - PowerPoint PPT Presentation

USING DATA TO FIND THE OPTIMAL MIX OF RETAIL LOCATIONS AND RESOURCES INTRODUCTION Education BS CS, Georgia Tech 2009 Theory and Machine Learning MS CS, Georgia Tech 2011 Heavy Tail Network Analysis Work Institute of

ACI Mix Design Updated Version CIVL 3137 1 ACI Mix Design So-called mix design methods

Hampton Roads Retail Hampton Roads Retail Hampton Roads Retail Hampton Roads Retail Southside

Warm Mix Asphalt Warm Mix Asphalt (WMA 101) (WMA 101) What Is Warm Mix Asphalt ? What Is Warm

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Linearly-Homomorphic Signatures and Scalable Mix-Nets Chlo Hbant, Duong Hieu Phan and David

RAKNOR READY MIX CONCRETE T +971 7 2668351 F +971 7 2668910 E raknor@emirates.net.ae RAKNOR

and ERCC RNA Spike-in control Control IL2 treated Control IL2 treated Control IL2 treated

MEDIA KIT COVERAGE LOCATIONS English Language Locations Chinese Language Locations Melbourne

A to Z of Filming Locations at Bowood A Arboretum Archways A to Z of Filming Locations at

MAINLAND NAVIGATING THE WORLD OF RETAIL NAVIGATING THE WORLD OF RETAIL METRO VANCOUVER RETAIL

THE INDEX OF RETAIL PRICES REVISION OF THE INDEX OF RETAIL PRICES INDEX OF RETAIL PRICES The

AFR RETAIL SUMMIT AFR RETAIL SUMMIT PARALLELS GFC TO NOW BROADER BEHAVIOURAL SHIFTS LONG TERM

Preserve Rural Post Offices May 9, 2012 Current Retail Locations 3 Alternate Access

Harbour Ready Mix Plant Ready Mix Truck Parking Proposal General Background Lafarge Harbour

Asphalt Mix Volumetrics Mix Volumetrics Aggregate Particle Bulk Volume (V G ) (M G ,V G ) Water

Superpave TM Mix Design Marshall Mix Design 1. Select suitable aggregates 2. Select a suitable

The term structure of interest rates Financial markets and recession expectations 2018-2019

'()*+*,)&amp;-.)*,&amp; 4E*&amp;+*;.+F,&amp;;.2*&amp;6(&amp;)E6,&amp;G+*,*().=:(&amp;

Adversarial Search R&amp;N 5.15.5 Jacques Fleuriot University of Edinburgh, School of

while Loops Introducing: while Loops General form of a while loop statement: while [boolean

| 1 New gTLD Subsequent Procedures PDP Working Group ICANN65 Monday, 24 June 2019 13:30 to

Water ers o s of t the U he U.S. S. Presented By: Anjali Karve , M.S. Certified Ecologist

Law Law of the of the Sea i Sea in n Historical Perspective Historical Perspective R. R. P.

brief you on maritime security and surveillance cooperation in the Baltic. First of all I want to

'()+,)&-.),& 4E&+;.+F,&;.2&6(&)E6,&G+,().=:(&

Adversarial Search R&N 5.15.5 Jacques Fleuriot University of Edinburgh, School of