collaborative search
play

Collaborative Search Haystack 28.10.2019 Sadat Anwar and Matthieu - PowerPoint PPT Presentation

Collaborative Search Haystack 28.10.2019 Sadat Anwar and Matthieu Pons We are... Matthieu Pons Backend engineer at reBuy Sadat Anwar Backend Search Engineer (ex- reBuy) Search engineer Delivery Hero 2 1. Context


  1. Collaborative Search Haystack 28.10.2019 Sadat Anwar and Matthieu Pons

  2. We are... Matthieu Pons ● Backend engineer at reBuy ○ Sadat Anwar ● Backend Search Engineer (ex- reBuy) ○ Search engineer Delivery Hero ○ 2

  3. 1. Context 2. Sequential Rules Method 3. Applications

  4. reBuy re-commerce shop ● Media and consumer ● electronics Volatile availability ● 4 4

  5. Context Replace our 3 rd party ● provider Very little product reviews ● Anonymous user sessions ● Difgerent type of ● recommendations 5 5

  6. Context Product User First tried Product/User embeddings ● Satisfying results ○ Problems with implicit and sparse signals ○ ... Moved on to Prod2Vec with gensim ● Worked much better ○ No more problem with implicit signals ○ 6 6

  7. Problem Prod2Vec is “too good” for ● our add-to-cart sessions Comes at a cost (time, GPUs, ● ops) Sequence based is the right ● track 7 7

  8. Sequential Rules Method - Intuition 1 st order Markov chain: A -> R -> F -> C -> G -> E -> H A -> C -> H -> G -> E -> D -> I p(G|C) = .5 p(H|C) = .5 8 8

  9. Sequential Rules Method - Intuition Simple Co-occurrences: A -> R -> F -> C -> G -> E -> H A -> C -> H -> G -> E -> D -> I C : [A, H, G, E, R, D, I] 9 9

  10. Sequential Rules Method - Intuition 10

  11. Sequential Rules Method - Intuition Sequential Rules: A -> R -> F -> C -> G -> E -> H 5 4 3 A -> C -> H -> G -> E -> D -> I 5 4 3 2 1 C: [G: 9, H: 8, E: 7, D: 2, I: 1] 11 11

  12. Sequential Rules - results at rebuy Hit Rate on add to cart sessions: HR @5 HR @10 Provider .230 .359 Prod2Vec .167 .241 Sequential Rules .245 .372 12 12

  13. Sequential Rules - improvements Using a SR on itself: A: [R, F,..., C, Q] B: [G, H,..., C, E] Recommendations for one user Original recommendations for D D: [M,..., C, K, N] session D: [M, C, R,..., G] D re-arranged, based on session 13 13

  14. Sequential Rules - improvements Results of using a SR on itself: HR @5 HR @10 Provider .230 .359 Prod2Vec .167 .210 SR - simple .245 .372 SR - many2many .257 .416 14 14

  15. Sequential Rules - pros and cons Pros: Explainable ● Only one pass through data 0(n) ● Fast training (~10 min. for 10M sessions) ○ Can be written as a map-reduce ● Versatile ● 15 15

  16. Sequential Rules - pros and cons Cons: A -> R -> F -> C -> Q Overfit on small datasets ● A -> G -> H -> C -> E No seasonality ● A -> L -> M -> C -> S Not good at replaying sequences ● A: [C: 9, ...] 16 16

  17. Sequential Rules - original research M. Ludewige, D. Jannach: Evaluation of Session-based Recommendation Algorithms , Oct.2018, https://arxiv.org/pdf/1803.09587.pdf I.Kamehkhosh, M. Ludewige, D. Jannach: A Comparison of Frequent Pattern Techniques and a Deep Learning Method for Session-Based Recommendation , 2017, http://ceur-ws.org/Vol-1922/paper10.pdf 17 17

  18. SR method - original research M. Ludewige, D. Jannach: Evaluation of Session-based Recommendation Algorithms, 5.1.1 Table 3 18

  19. Search Applications

  20. Search Applications 1. (Collaborative) Spell Check 2. Similar Search 3. Hybrid Search and Recommendation 20 20

  21. Status Quo Bad query and results ● Ambiguous situation ● Unsatisfying customer ● experience 21

  22. Status Quo Bad query and results ● Incorrect spellcheck ● Unsatisfying customer ● experience 22

  23. Collaborative Spell Check Solr/Elasticsearch only do text spell check ● min-prefix is too constrictive ● mintendo -> nintendo ○ Word2Vec clustered spelling errors ● 23 23

  24. Collaborative Spell Check 24

  25. How to use SR for search huwai p20 pro -> angebot handy -> handy angebot -> huawei -> huawei p20 pro 5 4 3 2 huwai p20 pro -> huwai -> huawei 5 4 Would give huwai p20 pro -> [huawei: 7, ...] 25

  26. Collaborative Spell Check Applied on enough ● sessions Works on long queries ● Correct error at first ● position 26

  27. Status Quo Correct query and results ● Disappointing from a ● customer point of view Dead-end ● 27

  28. Similar searches 28 28

  29. 29

  30. 30

  31. Similar Search Very little data preprocessing ● needed Increase user engagement ● How to difgerentiate between ● spellcheck and similar search? 31

  32. Status Quo Correct query and results ● Wasted real estate ● Looks broken ● Unsatisfying customer ● experience 32

  33. Hybrid Search and Recommendation Nash bridges | louis de funes | p#1735350 | p#3112824 | p#1492633 | p#1492632 | p#7238 | p#9177463 | p#1166173 | Rabbi jakob | rabbi jacob | p#7483 | james bond Consider not only search terms, but also products ● Boost products that customers interact with ● (Learn-to-Rank?) 33

  34. Hybrid Search and Recommendation Recommend related products when page is not full 34

  35. Hybrid Search and Recommendation Boost products on ambiguous queries 35

  36. Hybrid Search and Recommendation Recommend products on bad queries 36

  37. Q & A 37

  38. Thank You!

  39. Hybrid Search and Recommendation 39

Recommend


More recommend