Gender Bias in Contextualized Word Embeddings Jieyu Zhao 1 , Tianlu - PowerPoint PPT Presentation

NLP Gender Bias in Contextualized Word Embeddings Jieyu Zhao 1 , Tianlu Wang 2 , Mark Yatskar 3 , Ryan Cotterell 4 , Vicente Ordonez 2 , Kai-Wei Chang 1 1 UCLA, 2 University of Virginia, 3 Allen Institute for AI, 4 University of Cambridge 2

NLP Two Perspectives of Fairness in ML/NLP • ML/NLP models should work for everyone Gender shade: https://www.youtube.com/watch?v=TWWsW1w-BVo [Buolamwini& Gebru 18] 3 kwchang.net/talks/sp.html

NLP Two Perspectives of Fairness in ML/NLP • ML/NLP models should work for everyone • ML/NLP models should be aware of potential stereotypes existing in the data/model and avoid affecting downstream tasks 4 kwchang.net/talks/sp.html

NLP Bias in NLP: Word Embeddings she he 5 http://wordbias.umiacs.umd.edu/

NLP Bias in NLP: Downstream Task • Coreference resolution • Model fails for “she” when given same context Semantics Only w/ Syntactic Cues 6

NLP Outline • Training corpus for ELMo is biased • Visualize gender geometry in ELMo • Bias propagates to downstream tasks • Mitigate the bias 7

NLP Background: ELMo • Take LM information • Assign words with different embeddings based on the surrounding contexts He taught himself to play the violin . Do you enjoy the play ? Embedding visualization from from context 2 context 1 word2vec ELMo 8

NLP • Bias Analysis • Training Dataset Bias • Geometry of the Gender Bias in ELMo • Unequal Treatment of Gender in ELMo • Downstream task – Coreference resolution 9

NLP Bias in ELMo Training Dataset Bias - Dataset is biased towards male Gender Male Female Pronouns Pronouns Occurrence 5,300 1,600 (*1000) • Male pronouns (he, him, his) occur 3 times more often than females’ (she, her) 10

NLP Bias in ELMo (continued) • Male pronouns co-occur more frequently with occupation words 1 180 Male Pronoun Female Pronoun # co-occurrence (*1000) 135 90 45 0 M-Biased Occupations F-Biased Occupations 11 1 Zhao et al. Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods NAACL 2018

NLP Geometry of Gender in ELMo • ELMo has two principle components % of explained variance ELMo principle components 12

NLP Geometry of Gender in ELMo The driver transported the counselor to the hospital because she was paid The driver transported the counselor to the hospital because he was paid Female context Male context 13

<latexit sha1_base64="r2/BIv92hsxnS9nS8QghuqsRs=">AB9HicbVA9TwJBEJ3DL8Qv1NLmIphYkTstCTaWGIiYAIXsrcsGFv9ydw5ALv8PGQmNs/TF2/hsXuELBl0zy8t5MZuaFseAGPe/bya2tb2xu5bcLO7t7+wfFw6OmUYmrEGVUPohJIYJLlkDOQr2EGtGolCwVji6mfmtMdOGK3mPk5gFERlI3ueUoJWCckfzwRCJ1uqp3C2WvIo3h7tK/IyUIEO9W/zq9BRNIiaRCmJM2/diDFKikVPBpoVOYlhM6IgMWNtSJmgnR+9NQ9s0rP7StS6I7V39PpCQyZhKFtjMiODTL3kz8z2sn2L8KUi7jBJmki0X9RLio3FkCbo9rRlFMLCFUc3urS4dE4o2p4INwV9+eZU0qxX/olK9q5Zq1kceTiBUzgHy6hBrdQhwZQeIRneIU3Z+y8O/Ox6I152Qzx/AHzucPgN+R6w=</latexit> <latexit sha1_base64="MiBzBz8O6rmAcrAGaRyZjZKL+Rs=">AB63icbVA9SwNBEJ2LXzF+RS1tFhPBKtwlhWIVtLGMYD4gOcLeZi9Zsrt37O4J4chfsLFQxNY/ZOe/cS+5QhMfDzem2FmXhBzpo3rfjuFjc2t7Z3ibmlv/+DwqHx80tFRoghtk4hHqhdgTmTtG2Y4bQXK4pFwGk3mN5lfveJKs0i+WhmMfUFHksWMoJNJlXDm+qwXHFr7gJonXg5qUCO1rD8NRhFJBFUGsKx1n3PjY2fYmUY4XReGiSaxphM8Zj2LZVYUO2ni1vn6MIqIxRGypY0aKH+nkix0HomAtspsJnoVS8T/P6iQmv/ZTJODFUkuWiMOHIRCh7HI2YosTwmSWYKGZvRWSCFSbGxlOyIXirL6+Tr3mNWr1h3qleZvHUYQzOIdL8OAKmnAPLWgDgQk8wyu8OcJ5cd6dj2VrwclnTuEPnM8fAOmNjA=</latexit> NLP Unequal Treatment of Gender • Classifier context gender f : ELMo(occupation) → 100 • ELMo propagates gender information to 95 other words Acc (%) 90 • Male information is 14% more accurately 85 propagated than female 80 Male Context Female Context 14

NLP Bias in Downstream Task -- Coreference Resolution Semantics Only w/ Syntactic Cues 15

NLP Bias in Downstream Task: Coreference Resolution Semantics Only Type 1 • WinoBias dataset 1 The physician hired the secretary because he was overwhelmed with clients. • Pro-Stereotypical and Anti- The physician hired the secretary because she was overwhelmed with clients. Stereotypical dataset w/ Syntactic Cues Type 2 • Bias: different The secretary called the physician and told him about a new patient. performance between Pro. The secretary called the physician and told her about a new patient. and Anti. dataset. Pro. Anti. 16 1 https://uclanlp.github.io/corefBias

NLP Bias in Coreference • ELMo boosts the performance • However, enlarge the bias ( Δ ) OntoNotes Pro. Anti. 80 71.25 F 1 (%) Δ : 29.6 62.5 Δ : 26.6 53.75 45 GloVe +ELMo Semantics Only 17

NLP Mitigate Bias (Method 1) • Neutralizing ELMo Embeddings • Generating gender swapped test variants • Average the ELMo embeddings for test dataset • Do not need retrain; keeps the performance • lightweight 72.5 65 F 1 (%) 57.5 50 GloVe w/ ELMo Neutralizing Embeddings 18

NLP Mitigate Bias (Method 2) • Data Augmentation • Generate gender swapped training variants • Better mitigation; need retrain Semantics Only OntoNotes Pro. Anti. Avg. 72 69.5 F 1 (%) 67 64.5 62 Data Augmentation 19

NLP Bias in NLP/ML Output • Zhao et al. Men Also Like Shopping: Reducing Gender Bias Amplification Inference using Corpus-level Constraints • Bolukbasi et al. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings • Zhao et al. Learning Gender-Neutral Word Embeddings • Elazar & Goldberg. Adversarial Removal of Demographic Attributes from Text Representation Data • Wang et al. Adversarial Removal of Gender from Deep Image Representations • Xie et al. Controllable Invariance through Adversarial Feature Learning • Zhao et al. Gender Bias in Coreference Resolution: Evaluation and Debiasing Input Methods • Park et al. Reducing Gender Bias in Abusive Language Detection 20

NLP Thank you! 21

Gender Bias in Contextualized Word Embeddings Jieyu Zhao 1 , Tianlu - PowerPoint PPT Presentation

NLP Gender Bias in Contextualized Word Embeddings Jieyu Zhao 1 , Tianlu Wang 2 , Mark Yatskar 3 , Ryan Cotterell 4 , Vicente Ordonez 2 , Kai-Wei Chang 1 1 UCLA, 2 University of Virginia, 3 Allen Institute for AI, 4 University of Cambridge 2 NLP

Retrofitting Contextualized Word Embeddings with Paraphrases Weijia Shi 1* , Muhao Chen 1 * , Pei

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Contextualized Word Embeddings Luke Zettlemoyer (Slides adapted from Danqi Chen, Chris Manning,

IN5550: Neural Methods in Natural Language Processing Lecture 11/1 Contextualized embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word, Sense and Contextualized Embeddings: Vector Representations of Meaning in NLP Jose

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

BIAS What Is Bias? Bias can be defined as favoring one side, position, or belief being

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

Inductive Bias: How to generalize on novel data CS 478 - Inductive Bias 1 Non-Linear Tasks l

Machine Learning 2 DS 4420 - Spring 2020 Bias and fairness Byron C. Wallace Material in this

Long-Range Planning and Behavioral Biases: A Computational Approach Jon Kleinberg Including

Understanding Unconscious/Implicit Bias Tuesday, April 3, 2018 1:00 p.m. 1:50 p.m. NIH

Implicit Bias Definition : The assumptions, attitudes or stereotypes that affect our

Bounding User Contributions: A Bias-Variance Trade-off in Differential Privacy Kareem Amin, Alex

When asking about preventative measures for a given health condition, a provider commented

Efficient Tracing of Cold Code via Bias-Free Sampling Baris

Sambuz

Useful Links

Newsletter

Mail Us