understanding e commerce fraud from autonomous chat with
play

Understanding E-commerce Fraud from Autonomous Chat with - PowerPoint PPT Presentation

Into the Deep Web: Understanding E-commerce Fraud from Autonomous Chat with Cybercriminals Peng Wang , Xiaojing Liao, Yue Qin, XiaoFeng Wang Indiana University Bloomington February 26, 2020 E-commerce fraud online fraudsters February 26,


  1. Into the Deep Web: Understanding E-commerce Fraud from Autonomous Chat with Cybercriminals Peng Wang , Xiaojing Liao, Yue Qin, XiaoFeng Wang Indiana University Bloomington

  2. February 26, 2020 E-commerce fraud online fraudsters

  3. February 26, 2020 Crowdsourcing in e-commerce fraud Crowdsourcing

  4. February 26, 2020 Crowdsourcing via IM Crowdsourcing via Instant Messaging (IM)

  5. February 26, 2020 Bonus hunting $$ $$ + $$ Small-time Bonus workers hunters + E-commerce platforms

  6. February 26, 2020 Fraud account trading Account trading storefronts $$ Type1-----$0.5 Account Type2-----$0.8 merchants Type3-----$1.5 Type4-----$4.5 $$ $$ E-commerce platforms Small-time workers

  7. February 26, 2020 SIM farming SIM Sources: - VoIP cards - … $$ $$ $$ Carriers SIM farmers Account SIM farms merchants (websites or software)

  8. February 26, 2020 E-commerce fraud ecosystem SIM farmers Small-time $$ workers + Fake transaction operators Account + $$ $$ fraudsters E-commerce platforms

  9. February 26, 2020 E-commerce fraud groups Fake review groups Fraud account groups on Telegram on QQ

  10. February 26, 2020 E-commerce fraud group chat Group chat

  11. February 26, 2020 Threat intelligence gathering: collecting evidence - based threat information about an existing or emerging threat Fraud account merchants: SIM farmers: Fraud account operators: 1) Account types 1) SIM card source 1) Fraud order tasks 2) Store link 2) Gateway link/tool 2) Shipping address 3) Payment method 3) Account merchants 3) Report link 4) SIM card source 4) Hack tools 4) Hack tools 5) Hack tools 5) Account merchants 6) Fraud order tasks

  12. February 26, 2020 Group chat V.S. individual chat account type account store link SIM card source hack tool name Group chat V.S. Individual chat

  13. February 26, 2020 Intelligence gathering challenges • Active intelligence gathering • useful intelligence is only shared through one-on-one conversation • the number of new fraudsters keep growing

  14. February 26, 2020 Intelligence gathering challenges • Active intelligence gathering • useful intelligence is only shared through one-on-one conversation • the number of new fraudsters keep growing • Automated conversation with fraudsters • existing chatbots can not collect e-commerce threat intelligence • how to strategically lead the fraudsters to discuss the target threat intelligence is complicated

  15. February 26, 2020 Aubrey Autonomous chatbot for intelligence discovery • first autonomous conversation system for active threat intel. gathering from e-commerce miscreants • effectively extract great number of valuable fraud- related artifacts • new insights into the e-commerce fraud ecosystem

  16. February 26, 2020 Information exchange SIM farmers Small-time $$ workers + Fake transaction operators Account + $$ $$ fraudsters E-commerce platforms

  17. February 26, 2020 Observation E-commerce fraudster Small-time worker

  18. February 26, 2020 Observation Question Question Answer Question Question Answer Answer Answer E-commerce fraudster Small-time worker

  19. February 26, 2020 Architecture

  20. February 26, 2020 Target Finder 150 fraud IM groups keyword features behavioral features intent indicators

  21. February 26, 2020 Strategy Generator seed conversations IM group chats E-comm forum posts

  22. February 26, 2020 FSM definition 5-tuple: 𝑇, 𝑆, 𝜀, 𝑡 & , 𝐹 𝑇 : set of states, question Aubrey can send to the target roles 𝑆 : set of responses from the target roles 𝜀 : 𝑇 × 𝑆 → 𝑇 , state transition function, decide the next state 𝑡 & : start state 𝐹 : end state

  23. February 26, 2020 Seed conversation

  24. February 26, 2020 Segmentation dialog blocks + text clustering Seed conversation

  25. February 26, 2020 Topic detection dialog blocks SimSource account types Cross-role storelink + text clustering topic identification + Seed conversation

  26. February 26, 2020 Dialog Manager

  27. February 26, 2020 Retrieval model • FSM for retrieval model Current state ✕ Response is interrogative → Retrieval model state sentence Q&A pairs most similarity relevant answer Answers for fraudsters

  28. February 26, 2020 Evaluation 470 miscreants 7,250 communication messages

  29. February 26, 2020 Threat intelligence analysis E-commerce miscreants and corresponding threat intelligence

  30. February 26, 2020 Intelligence from SIM farmers 90% were used for account registration 72% accounts were used to order online

  31. February 26, 2020 Intelligence from Account merchants Abused private APIs and hack tools never been known before

  32. February 26, 2020 Intelligence from Fraud operators

  33. February 26, 2020 Hidden criminal infrastructures Complicity of roles

  34. February 26, 2020 Conclusion Lesson learnt • Chatbot is effective to study the cybercrime which are highly rely on crowdsourcing • Account trading lies at the center of the fraud ecosystem, more effort should be put to mitigate the fraud account threats Future work • The current implementation of Aubrey is simple while effective; • more complicated conversation (jargon identification), larger open domain corpora, hybrid model with human analyst involvement https://sites.google.com/view/aubreychatbot

  35. February 26, 2020 Thank you !

  36. February 26, 2020 Discussion • Scope • collected threat intel. is related to Chinese e-commerce platforms • Generalization • with target intel. and domain-specific corpora, Aubrey can be re- trained to chat with other roles (drug dealers etc.) and languages • Impact • fraud-related artifacts can be used as ground truth • fix exposed private APIs to raise the bar for automated abuse • stop fraudulent activities at the early stage

  37. February 26, 2020 FSM for fake account trading

  38. February 26, 2020 FSM for SIM farm and fake order operation FSM for fake order operation FSM for SIM farm

  39. February 26, 2020 Knowledge source extension candidate similar as questions Questions seed questions for miscreants IM group chats + extract Answers Forum discussions candidate Q&A pairs to miscreants Q&A pairs

  40. February 26, 2020 Data collection • Datasets Dataset # of raw data # of dialog pairs Seed conversation 800 200 IM group discussion 1 Million 50,000 Forum discussion 135,000 700,000

  41. February 26, 2020 Evaluation • Role identification classifier • Ground truth: 500 upstream, 180 downstream, 3,000 unrelated actors • Unknown set: 20,265 IM group members (from 150 IM groups) • Effectiveness: upstream: 87.0% precision, 91.2% recall downstream: 81.1% precision, 95.6% recall upstream actor: 89.0% precision, 92.8% recall overall: 86.2% F1 score 1,044 SIM farmers, 700 account merchants, 2,648 fraud order ops • Accuracy • 545 chat attempts, 470 responded (185 SIM farmers, 130 account merchants, 155 fraud order operators); • one questioned Aubrey • 97.4% (458) accuracy

  42. February 26, 2020 Effectiveness 52% CDF of interaction round per miscreant CDF of interaction round for intel. gathering

  43. February 26, 2020 Case study Revenue = sales * price = $48K/month Account inventory and price tracking

Recommend


More recommend