predicting piccollage users first purchase
play

Predicting PicCollage users first purchase for targeted promotions - PowerPoint PPT Presentation

Predicting PicCollage users first purchase for targeted promotions Team 2 Reggie Escobar . Eduardo Salazar Uni Ang . Lynn Pan Founded in 2011 100m $2.3m installs Seed funding (2013) Cardinal Blue, Inc. In-app purchases - backgrounds;


  1. Predicting PicCollage users’ first purchase for targeted promotions Team 2 Reggie Escobar . Eduardo Salazar Uni Ang . Lynn Pan

  2. Founded in 2011 100m $2.3m installs Seed funding (2013) Cardinal Blue, Inc. In-app purchases - backgrounds; stickers & watermark removal

  3. About PicCollage watermark Sticker Background

  4. Business Goal & Humanistic Evaluation : Stakeholder PicCollage Target users likely to make a first purchase Business Goal Send personalized promotions Limited user info data hinders Problem : user specific targeted promotions

  5. Data Mining Goal Ranking the user’s with high probability of making a first purchase when they create their first collage Supervised . Forward-looking Categorical: Binary for first purchase (Y/N)

  6. Data Source - One Month New user (2017/9) data from firebase - Data size: 38,740,087 session - Structure: User info + Events info by session First open First Collage Save First Purchase - Variables: User info + User behavior - Create_Collage: Empty / Grid / Remix Background pick : search / URL / library Remix_category Doodle per added First open time Add Photos: type & avg number Sum of Frame try Continent / Country + Add photo from web Sum of Clip Device category Per Collage: Sticker / ... Avg Collage in Library Login Font type : 10 type Num of sticker preview create_collage_empty Share Collage : type + number Export collage : sticker / background/ …..

  7. Data Description and Preparation Filter events before Extract Filter Create derived Missing first purchase / Data By User variables from events value First collage save Country ↑ Sample Over-sampling device language # record % purchase # record % purchase Training data 10,000 28% 9344 50% Validation 11,405 28% 8202 28% data Test data 11,405 28% 8202 28%

  8. Methods & Performance Evaluation ● Task : Ranking ● Benchmark : naive (all class “0”) ● Method ○ Naive Bayes (Binned variables) ○ Classification tree (single) ○ Random Forest ○ Boosted Tree ○ Logistic Regression ● Performance measure ○ Lift Chart ○ Decile lift chart ○ Sensitivity ○ Specificity

  9. Method : Random Forest Non-oversample

  10. Method : Single Tree oversample / Full Tree / terminal 934

  11. Method : Random Forest oversampling

  12. Method : Boosted Tree oversampling

  13. Method : Logistic Regression oversampling Variables selection—Stepwise Num_events remix_cat_Congrats Create_collage_empty remix_cat_Just_for_Fun Num_background_try remix_cat_Labor_Day_Weekend Num_frame_try font_Roboto_BlackItalic Avg_of_image_export Create_collage_grid Avg_photo_facebook Login remix_cat_Back_to_School

  14. Performance Evaluation — Boosted tree — Random Forest — Single Tree — Random Forest (non-oversample) — Logistic Regression — Benchmark ● Boosted Tree and Random Forest are top two best model.

  15. Recommendations • How to use this model for marketing promotion? Offering bundles/discount to users that have a high probability of making a first purchase. • Model recommendation – Due to the unbalanced dataset and ranking goal, we suggest to adopt over-sampling • Date recommendation – The data we are using now is missing the October purchase. – Collect events data per user for their 30 days full history. • Variables recommendation – Getting user information might help to predict first purchase earlier.

Recommend


More recommend