rahul simha
play

Rahul Simha department of computer science George Washington - PowerPoint PPT Presentation

Big Dating: Computer Science and Relationships Rahul Simha department of computer science George Washington University Office hours: 1pm Tuesdays, 5pm Wednesdays some stats Usage of online da-ng websites : (total registra=ons) OKCupid: 5


  1. Big Dating: Computer Science and Relationships Rahul Simha department of computer science George Washington University Office hours: 1pm Tuesdays, 5pm Wednesdays

  2. some stats Usage of online da-ng websites : (total registra=ons) • OKCupid: 5 million • Chemistry: 11 million • E-harmony: 33 million • POF: 40 million • Tinder: 50 million • Match: 96 million • Badoo: 200 million

  3. your data Exercise 1 Part 1: On the worksheet, write three things about yourself (without iden=fying yourself) that will help you stand out in an online da=ng site Part 2: Write down your height in inches but add a fudge factor of +10 or -10 using the following rule: if you were born in an even-numbered month, add 10. Otherwise subtract 10. Thus, if your height is 67 inches and you were born in March, you would write 57 Follow remaining “snowball” instruc=ons

  4. more stats Percep-on : • 60% of US adults: “online da=ng is a good way to meet people” • 2/3 of online daters have gone on a date with someone they met online • 27% between ages 18-24 have used online da=ng But … 5% of those in commi[ed rela=onships say they met online

  5. why the gap? Exercise 2: At your table, come up with three reasons why online matchups may not lead to commi[ed rela=onships

  6. computer science to the rescue Algorithmic matching : • OKCupid • EHarmony Basic ideas : • Ask lots of ques=ons • Perform some kind of scoring and matching

  7. computer science issues High dimensional data Large data size Privacy, security (height example) Algorithm design

  8. high dimensional data Exercise 3 : draw the points (1,2), (2,1), (2,2), (7,6), (8,7), (9,10), (1,9) on paper. How many clusters do they fall into? Exercise 4 : how many dimensions are present in the survey data you filled?

  9. high dimensional data Exercise 3 : draw the points (1,2), (2,1), (2,2), (7,6), (8,7), (9,10), (1,9) on paper. How many clusters do they fall into? Exercise 4 : how many dimensions are present in the survey data you filled? Algorithmic challenge : effec=ve clustering of high dimensional data E-Harmony: 29 dimensions

  10. the scoring problem Distance measure: • Given any two users, compute “how compa=ble they are” Sort: • Sort all users by compa=bility For every user we now have a sorted list of other users, in order of preference

  11. the matching problem Suppose we need to match people, e.g., H 1 H 2 H 3 with R 1 R 2 R 3 Example matching: H 1 H 2 H 3 R 1 R 2 R 3

  12. the matching problem Suppose we know “spousal” preferences: H 1 `s preferences: R 2 R 1 R 3 H 2 `s preferences: R 2 R 3 R 1 H 3 `s preferences: R 2 R 1 R 3 R 1 `s preferences: H 1 H 2 H 3 R 2 `s preferences: H 3 H 1 H 2 R 3 `s preferences: H 2 H 1 H 3 So H 1 would prefer R 2 as spouse to R 1 and R 1 over R 3

  13. the matching problem Consider Example matching: H 1 `s preferences: R 2 R 1 R 3 H 1 H 2 H 3 H 2 `s preferences: R 2 R 3 R 1 H 3 `s preferences: R 2 R 1 R 3 R 1 R 2 R 3 R 1 `s preferences: H 1 H 2 H 3 R 2 `s preferences: H 3 H 1 H 2 R 3 `s preferences: H 2 H 1 H 3 Exercise 5: what is the problem with this matching?

  14. the matching problem Consider Example matching: H 1 `s preferences: R 2 R 1 R 3 H 1 H 2 H 3 H 2 `s preferences: R 2 R 3 R 1 H 3 `s preferences: R 2 R 1 R 3 R 1 R 2 R 3 R 1 `s preferences: H 1 H 2 H 3 R 2 `s preferences: H 3 H 1 H 2 R 3 `s preferences: H 2 H 1 H 3 H 3 and R 2 will elope! Algorithmic challenge : devise an algorithm to create a stable matching

  15. the proposal algorithm 1. Ini=ally place all H’s in unmarried-list 2. while unmarried-list is not empty 3. H i = lowest numbered from list 4. try R’s in order of H i ’s preference 5. if R j is not matched, match H i and R j 6. else if R j prefers H i to current match then 7. match R j with H i 8. return current match to unmarried list Can prove: provides a stable matching Exercise 6: do the H’s or R’s get the best deal?

  16. the proposal algorithm Demo Prac-cal applica-ons?

  17. the proposal algorithm Demo Prac-cal applica-ons: • Med school internships • Clerkships with judges

  18. the (network) structure of rela-onships Exercise 7: Go to h[ps://oracleomacon.org/ and enter actors in two movies YOU have seen. Try to find two actors with a distance of 4.

  19. the (network) structure of rela-onships The Milgram experiment Demo

  20. the (network) structure of rela-onships The Milgram experiment Demo Facebook: 4.57 (among 1.5b users) Da=ng app based on “who knows who”: Hinge

  21. talk to a bot Exercise 8: Choose between 1. Go to h[p://www.masswerk.at/elizabot/ and converse with Eliza. 2. Talk to Siri and record the exchange on paper. 3. Volunteer as judge. Compe-tors: your conversa=on must be short (two back-and-forths) Judges: pick the best conversa=on. Remember Robert Epstein?

  22. our future with robots Exercise 9: Choose between 1. Yes, it’s fine for humans to marry robots in the future. 2. No, that should never be allowed. Write down your reasons on the worksheet

  23. summary Computer science under the hood: • Programming of websites (for online da=ng) • Servers, networks, clouds, large data: Ø Example: 25 TB of data at E-Harmony, incl. 200+ million images • Algorithms for matching, graph structure • Algorithms for clustering, machine learning, natural languages • Robo=cs / AI

Recommend


More recommend