Crowdsourcing with MTurkR Thomas J. Leeper Department of Political - PowerPoint PPT Presentation

Jan 10, 2023 •240 likes •400 views

Crowdsourcing with MTurkR Thomas J. Leeper Department of Political Science, Twitter: @thosjleeper GitHub: leeper thosjleeper@gmail.com Imagine we have some data. . . gender var1 var2 first last image 1 female 0.5 1 sara annala

Crowdsourcing with MTurkR Thomas J. Leeper Department of Political Science, Twitter: @thosjleeper GitHub: leeper thosjleeper@gmail.com
Imagine we have some data. . . gender var1 var2 first last image 1 female 0.5 1 sara annala img94.jpg 2 male 0.6 3 julius haataja img69.jpg 3 male 1.2 2 ross meyer img32.jpg 4 female 0.3 1 sarah lahti img96.jpg 5 female 1.1 5 ada park img24.jpg 6 female 0.9 2 joan hernandez img92.jpg 7 female 0.4 1 sofia korhonen img87.jpg 8 female 0.1 3 helle kivela img52.jpg 9 male 1.8 4 kasper johnson img17.jpg 10 male 0.6 2 dirk luoma img62.jpg . . . but how do we analyze an image variable?
Data search/ Coding retrieval/scraping Categorization Content moderation Manual Audio/Video translation Human Transcription subjects research Writing tasks Building UX testing training sets
Ideal Case for Crowdsourcing Human Intelligence Massively Parallel
Analyze R Data Need data Design Data HTML Entry Form Assignment Assignment Create MTurk Review Assignment HIT(s) Assignment Assignment
# set API keys in environment variables library("MTurkR") BulkCreateFromURLs( url = paste0("https://example.com/",1:10,".html"), title = "Image Categorization", description = "Describe contents of an image", keywords = "categorization, image", reward = .01, duration = seconds(minutes = 5), annotation = "My Project", expiration = seconds(days = 4), auto.approval.delay = seconds(days = 1) )
Get back a data.frame: GetAssignments(annotation = "My Project") The image coding task with 27,500 images took 225 workers about 75 minutes and cost $412.50 Pay workers with: ApproveAssignments(annotation = "My Project")
a = GenerateHTMLQuestion(file = "hit.html") hit = CreateHIT( title = "Short Survey", description = "5 question survey", keywords = "survey, questionnaire", duration = seconds(hours = 1) reward = .10, assignments = 5000, expiration = seconds(days = 4), question = a$string, )
GetHIT(hit$HITId) ExtendHIT(hit$HITId, add.assignments = 500) add.seconds = seconds(days = 1) ) ExpireHIT(hit$HITId) ChangeHITType(hit$HITId, title = "New, better title", reward = 5.00 )
Advanced Features Choose who works ⇒ Qualifications for you and tests ⇒ Notifications Monitor HITs ⇒ Qualifications, Sanction and reward bonuses, and blocks workers ⇒ Review Policies Automatic review
Anatomy of an MTurkR App CreateHIT() (with Review Policies) Assignment Check Reject Known Answer(s) Approve Reject Compare w/ Other Assignments Approve GetReviewResults()
What’s next? 1 Packages for more crowdsourcing platforms Common interface? 2 HIT templates 3 Performance improvements
# Start Crowdsourcing # CRAN install.packages("MTurkR") # GitHub install_github("leeper/MTurkR") # Questions? # thosjleeper@gmail.com # https://github.com/leeper/MTurkR/wiki

Recommend

Crowdsourcing and HCI 2: Privacy and Latency Crowdsourcing and Human Computation Instructor:

Crowdsourcing and HCI 2: Privacy and Latency Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website: crowdsourcing-class.org Privacy Would you let crowd workers read your email? Problems with email as a task management

787 views • 59 slides

Crowdsourcing and Human Computer Interaction Design Crowdsourcing and Human Computation

Crowdsourcing and Human Computer Interaction Design Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website: crowdsourcing-class.org Wizard of Oz in HCI Wizard of Oz in HCI Oz-like HCI in SciFi AI is lacking compared to

1.36k views • 54 slides

How Crowdsourcing Enabled Computer Vision Crowdsourcing and Human Computation Instructor: Chris

How Crowdsourcing Enabled Computer Vision Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website: crowdsourcing-class.org Connect a television camera to a computer and get the machine to describe what it sees.

1.12k views • 53 slides

Rise of Crowdsourcing Crowdsourcing = Harvesting societys wisdom, skill, creativity, and scale

Rise of Crowdsourcing Crowdsourcing = Harvesting societys wisdom, skill, creativity, and scale to solve a task Crowdsourcing: Opportunities and Challenges Deepak Ganesan Associate Professor UMass Amherst ... Computer Science@UMASS Amherst

249 views • 11 slides

A Micro Crowdsourcing Architecture to Localize A Micro Crowdsourcing Architecture to Localize Web

A Micro Crowdsourcing Architecture to Localize A Micro Crowdsourcing Architecture to Localize Web Content for Less-Resourced Languages Asanka Wasala Chris Exton Ruvan Weerasinghe Reinhard Schler Adapted from:

504 views • 31 slides

A/B Testing Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website:

A/B Testing Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website: crowdsourcing-class.org Active versus Passive Crowdsourcing So far we have mainly looked at active crowdsourcing, where we explicitly solicit help

615 views • 33 slides

Crowdsourcing Projects December 11, 2014 Presented by: Crowdsourcing Consortium for Libraries

Scoping and Funding Crowdsourcing Projects December 11, 2014 Presented by: Crowdsourcing Consortium for Libraries and Museums (CCLA) crowdconsortium.org @crowdconsortium Todays Presenters Sharon Leon Director, Public Projects, Christina

605 views • 37 slides

Speech Transcrip-on with Crowdsourcing Crowdsourcing and Human Computa2on Instructor: Chris

Speech Transcrip-on with Crowdsourcing Crowdsourcing and Human Computa2on Instructor: Chris Callison-Burch Thanks to Sco< Novotney for todays slides! Lecture Takeaways 1. Get more data, not be<er data 2. Use other Turkers to do QC

1.07k views • 54 slides

Crowdsourcing CSCI 470: Web Science Keith Vertanen

Crowdsourcing CSCI 470: Web Science Keith Vertanen Overview Crowdsourcing = Crowd + Outsourcing Incented coopera6on Paid tasks Compe66ons

545 views • 17 slides

Enhancing Online 3D Products through Crowdsourcing Thi Phuong Nghiem, Axel Carlier, Geraldine

Enhancing Online 3D Products through Crowdsourcing Thi Phuong Nghiem, Axel Carlier, Geraldine Morin Vincent Charvillat University of Toulouse IRIT ENSEEIHT ACM Workshop on Crowdsourcing for mmedia - ACM MM'12 ACM Workshop on Crowdsourcing for

569 views • 46 slides

crowdsourcing workflow control Nate Tucker and Perry Green barriers to effective crowdsourcing

crowdsourcing workflow control Nate Tucker and Perry Green barriers to effective crowdsourcing Last time: Ensure proper incentives: Positive and negative Social and economic This time: How can we help people

482 views • 12 slides

Compliance Crowdsourcing: Managing customer audits at scale Craig Erickson, CISSP, CISA Data

Compliance Crowdsourcing: Managing customer audits at scale Craig Erickson, CISSP, CISA Data Protection Officer, PrivacyPortfolio Crowdsourcing Crowdsourcing is a sourcing model in which individuals or organizations obtain goods and services,

904 views • 36 slides

Crowdsourcing of Weather Data on Mobile App and Deep Learning Lior Perez 99th AMS annual

Crowdsourcing of Weather Data on Mobile App and Deep Learning Lior Perez 99th AMS annual meeting Crowdsourcing on Meteo-France mobile app Context: fewer resources devoted to human observation Crowdsourcing can help: To get a

390 views • 18 slides

Using CrowdSourcing for Data Analytics Hector Garcia-Molina (work with Steven Whang, Peter

Using CrowdSourcing for Data Analytics Hector Garcia-Molina (work with Steven Whang, Peter Lofgren, Aditya Parameswaran and others) Stanford University 1 Big Data Analytics CrowdSourcing 1 CrowdSourcing 3 Real World Examples

678 views • 29 slides

Distilling Collective Intelligence from Twitter Crowdsourcing and Human Computation Lecture 17

Distilling Collective Intelligence from Twitter Crowdsourcing and Human Computation Lecture 17 Instructor: Chris Callison-Burch TA: Ellie Pavlick Website: crowdsourcing-class.org Todays slides come courtesy of Miles Osborne and Benjamin

1.05k views • 67 slides

Putting out a HIT Putting out a HIT Crowdsourcing Malware Installs Stephen Checkoway Keaton

Putting out a HIT Putting out a HIT Crowdsourcing Malware Installs Stephen Checkoway Keaton Mowery Chris Kanich UC San Diego 1 Mechanical Turk Crowdsourcing platform Requesters post tasks paying 1 $10 Workers perform HITs

381 views • 18 slides

Crowdsourcing and Which volunteer- Peer Production written software do you rely most heavily

Reply in Zoom chat: Crowdsourcing and Which volunteer- Peer Production written software do you rely most heavily CS 278 | Stanford University | Michael Bernstein on? Last time Crowdsourcing: an open call to a large group of people who self-

577 views • 46 slides

Crowdsourcing Cytogenetic Biodosimetry Dose Estimation Crowdsourcing Cytogenetic Biodosimetry Dose

Crowdsourcing Cytogenetic Biodosimetry Dose Estimation Crowdsourcing Cytogenetic Biodosimetry Dose Estimation Using The Dicentric Chromosome Challenge Using The Dicentric Chromosome Challenge J essica Cutlip, Chelsea J ohnson, Brian Lewis, Nghia

334 views • 18 slides

An Analytic Framework for Human Computation Crowdsourcing and Human Computation Instructor:

An Analytic Framework for Human Computation Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website: crowdsourcing-class.org Todays topic Classification System for Human Computation Motivation Quality Control

774 views • 48 slides

Incentives in Crowdsourcing: A Game-theoretic Approach ARPITA GHOSH Cornell University NIPS

Incentives in Crowdsourcing: A Game-theoretic Approach ARPITA GHOSH Cornell University NIPS 2013 Workshop on Crowdsourcing: Theory, Algorithms, and Applications Incentives in Crowdsourcing: A Game-theoretic Approach 1 / 26 Users on the Web:

1.48k views • 90 slides

Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website:

Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website: crowdsourcing-class.org What will we cover in this class (and should you take it)? Syllabus Taxonomy of crowdsourcing and human computation The

990 views • 88 slides

Fundamentals and Case Studies October 29, 2014 Presented by: Crowdsourcing Consortium for

Crowdsourcing 101: Fundamentals and Case Studies October 29, 2014 Presented by: Crowdsourcing Consortium for Libraries and Museums (CCLA) crowdconsortium.org @crowdconsortium Todays Presenters Ben Vershbow Director of NYPL Digital

1.46k views • 106 slides

Perspectives on Infrastructure for Crowdsourcing Omar Alonso Microsoft 9 February 2011 WSDM

Perspectives on Infrastructure for Crowdsourcing Omar Alonso Microsoft 9 February 2011 WSDM 2011 Workshop on Crowdsourcing for Search and Data Mining Disclaimer The views and opinions expressed in this talk are mine and do not necessarily

633 views • 17 slides

Crowdsourcing and Human Computation Tuesdays and Thursdays 3pm-4:30pm 3401 Walnut St room 401B

Crowdsourcing and Human Computation Tuesdays and Thursdays 3pm-4:30pm 3401 Walnut St room 401B Instructor: Chris Callison-Burch Website: crowdsourcing-class.org Inter-related concepts Groups of individuals doing things collectively

940 views • 74 slides