Understanding the Origins of Bias in Word Embeddings Marc-Etienne - PowerPoint PPT Presentation

Understanding the Origins of Bias in Word Embeddings Marc-Etienne Brunet Colleen Alkalay-Houlihan Ashton Anderson Richard Zemel

Introduction Graduate student at U of T (Vector Institute) Algorithmic NLP Bias Work at the intersection of Bias, Explainability, and Natural Language Processing Collaborated with Colleen Alkalay-Houlihan Explainability Supervised by Ashton Anderson and Richard Zemel

Many Forms of Algorithmic Bias For example: Facial Recognition ● Automated Hiring ● Criminal Risk Assessment ● Word Embeddings ●

How can we attribute the bias in word embeddings to the individual documents in their training corpora?

> Background Method Overview Critical Details Experiments

Word Embeddings: Definitions in Vector Space Definitions encode relationships between words cleaner leader cleaning leading

Word Embeddings: Definitions in Vector Space Definitions encode relationships between words cleaner lead er cleaning lead ing

Word Embeddings: Definitions in Vector Space Definitions encode relationships between words clean er lead er role action clean ing lead ing

Problematic Definitions in Vector Space Definitions encode relationships between words cleaner leader woman man

Problematic Definitions in Vector Space Definitions encode relationships between words cleaner man male female leader woman man a woman Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, Adam Kalai (NeurIPS 2016)

Measuring Bias in Word Embeddings T = cleaner How can we measure bias S = leader in word embeddings? B = woman A = man

Measuring Bias in Word Embeddings I mplicit A ssociation T est (IAT) T = cleaner T = cleaner S = leader S = leader B = woman B = woman A = man A = man

Measuring Bias in Word Embeddings I mplicit A ssociation T est (IAT) T = cleaner S = leader B = woman A = man

Measuring Bias in Word Embeddings I mplicit A ssociation T est (IAT) T = cleaner T = cleaner S = leader S = leader W ord E mbedding A ssociation T est (WEAT) B = woman B = woman Association S,A ≈ Σ S,A cos(s,a) A = man A = man Aylin Caliskan, Joanna J. Bryson, Arvind Narayanan (Science 2017)

Measuring Bias WEAT on popular corpora matches IAT study results IAT WEAT Target Words Attribute Words effect size p-val effect size p-val Flowers v.s. Insects Pleasant v.s. Unpleasant 1.35 1.0E-08 1.5 1.0E-07 Math v.s. Arts Male v.s. Female Terms 0.82 1.0E-02 1.06 1.8E-02 ... ... ... ... Aylin Caliskan, Joanna J. Bryson, Arvind Narayanan (Science 2017)

Measuring Bias WEAT on popular corpora matches IAT study results IAT WEAT Target Words Attribute Words effect size p-val effect size p-val Flowers v.s. Insects Pleasant v.s. Unpleasant 1.35 1.0E-08 1.5 1.0E-07 Math v.s. Arts Male v.s. Female Terms 0.82 1.0E-02 1.06 1.8E-02 ... ... ... ... “ Semantics derived automatically from language corpora contain human-like biases” Aylin Caliskan, Joanna J. Bryson, Arvind Narayanan (Science 2017)

Background > Method Overview Critical Details Experiments

How can we attribute the bias in word embeddings to the individual documents in their training corpora?

From Word2Bias Male Female GloVe WEAT Career Doc n Family { w i } = w(X) X : Corpus B(w(X)) Word Embedding (e.g. Wikipedia) Bias Measured

Differential Bias Idea: Consider the differential contribution of each document ∆B Doc k Doc n removal

Bias Attributed Differential Bias Document ID ∆B 1 -0.0014 2 0.0127 ∆B Doc k ... ... Doc n k 0.0374 ... ... n 0.0089

Analyse Metadata? Differential Bias Document ID ∆B Year Author 1 -0.0014 2 0.0127 Doc k ... ... Doc n k 0.0374 ? ? ... ... n 0.0089

Bias Gradient Male Female GloVe WEAT Career Doc n Family { w i } = w(X) X : Corpus B(w(X)) Word Embedding (e.g. Wikipedia) Bias Measured

Background Method Overview > Critical Details Experiments

Computing the Components Fast & Easy: Math, Automatic Differentiation, or two evaluations of B(w). Slow & Hard: Differentiate through an entire training procedure: - Leave-one-out retraining? ( time-bound ) - Backprop? ( memory-bound ) - Approximate using Influence Functions Koh & Liang (ICML 2017)

Influence Functions Give us a way to approximate the change in model parameters model parameters: θ new model params: θ̃ ≈ infl_func( θ , ∆X) perturb training data by ∆X

Influence Functions Inverse Hessian (GloVe: 2VD x 2VD matrix) 2VD can easily be > 10 9

Applying Influence Functions to GloVe GloVe Loss : other params ( treat as const ) word vectors

Applying Influence Functions to GloVe Gradient of Pointwise Loss Hessian becomes block diagonal! ( V Blocks of D by D ) Allows us to apply influence function approximation to one word vector at a time!

Algorithm: Compute Differential Bias WEAT words

Background Method Overview Critical Details > Experiments

Objectives of Experiments 1. Assess the accuracy of our influence function approximation 2. Identify and analyse most bias impacting documents

WEAT S = Science T = Arts S = Instruments T = Weapons A = Male B = Female A = Pleasant B = Unpleasant Corpora

Differential Bias Differential Bias (%)

Differential Bias log Differential Bias (%)

Differential Bias

Differential Bias 1 doc ≈ 0.00007% of corpus increase bias by 0.35%! Differential Bias (%)

Approximated WEAT Ground Truth WEAT

Baseline Bias (no removals) Approximated WEAT (0.7% of corpus) Removal of bias increasing docs Removal of bias increasing docs Ground Truth WEAT

Approximated WEAT Baseline Bias Removal of bias increasing docs Ground Truth WEAT

Approximated WEAT Baseline Bias (0.7% of corpus) Removal of bias increasing docs Ground Truth WEAT

Document Impact Generalizes WEAT 1 (Science v.s. Arts Gender Bias) remove bias baseline remove bias increasing docs (no removals) decreasing docs GloVe -1.27 1.14 1.7 word2vec 0.11 1.35 1.6 Removal of documents also affects word2vec , and other metrics!

Limitations & Future Work Consider multiple biases at simultaneously ● Use metrics that depend on more words ● Consider bias in downstream tasks where embeddings are used ● Does this carry over to BERT ? ●

Recap cleaner leader Bias can be quantified; correlates with ● woman man known human biases We can identify the documents that most ● Doc k impact bias, and approximate impact Doc n These documents are qualitatively ● meaningful, and impact generalizes

Thank you! Poster # 146 Marc Colleen mebrunet@cs.toronto.edu arXiv: 1810.03611 Ashton Rich

References T. Bolukbasi, K.-W. Chang, J. Zou, V. Saligrama, and A. Kalai. Man is to computer programmer as ● woman is to homemaker? debiasing word embeddings. In 30th Conference on Neural Information Processing Systems (NIPS), 2016. A. Caliskan, J. J. Bryson, and A. Narayanan. Semantics derived automatically from language ● corpora contain human-like biases. Science, 356(6334):183–186, 2017. P. W. Koh and P. Liang. Understanding Black-box Predictions via Influence Functions. In ● Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pages 1885–1894, 2017.

Measuring Bias “...results raise the possibility that all implicit human biases are reflected in the statistical properties of language.” Aylin Caliskan, Joanna J. Bryson, Arvind Narayanan (Science 2017)

Impact on Word2Vec Removal of Documents Identified by our Method Decrease (0.7%) Baseline Increase (0.7%) GloVe -1.27 1.14 1.7 word2vec 0.11 1.35 1.6

Understanding the Origins of Bias in Word Embeddings Marc-Etienne - PowerPoint PPT Presentation

Understanding the Origins of Bias in Word Embeddings Marc-Etienne Brunet Colleen Alkalay-Houlihan Ashton Anderson Richard Zemel Introduction Graduate student at U of T (Vector Institute) Algorithmic NLP Bias Work at the intersection of

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

BIAS What Is Bias? Bias can be defined as favoring one side, position, or belief being

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

BIAS BIAS LIGHT LIGHT & & MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Construction Andrew Laundrie UW Physical Sciences Lab DUNE Electronics Review 2019 November

Bias in Rendering Keenan Crane (kcrane@uiuc.edu) Contents 1 What does unbiased mean? 1

Efficient Tracing of Cold Code via Bias-Free Sampling Baris

When asking about preventative measures for a given health condition, a provider commented

CS 6316 Machine Learning The Bias-Complexity Tradeoff Yangfeng Ji Department of Computer Science

If you have a brain youre biased -Dr. Heidi Grant NeuroLeadership Institute Explicit

go to the source The Media Bias Chart The Media Bias Chart A new taxonomy for discussing the

Bias-Variance Tradeoff Matthieu R. Bloch h in a given set H that minimizes the true risk R ( h ) .

Understanding the Origins of Bias in Word Embeddings Marc-Etienne - PowerPoint PPT Presentation

Understanding the Origins of Bias in Word Embeddings Marc-Etienne Brunet Colleen Alkalay-Houlihan Ashton Anderson Richard Zemel Introduction Graduate student at U of T (Vector Institute) Algorithmic NLP Bias Work at the intersection of

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

BIAS What Is Bias? Bias can be defined as favoring one side, position, or belief being

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

BIAS BIAS LIGHT LIGHT &amp; &amp; MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

Construction Andrew Laundrie UW Physical Sciences Lab DUNE Electronics Review 2019 November

Bias in Rendering Keenan Crane (kcrane@uiuc.edu) Contents 1 What does unbiased mean? 1

Efficient Tracing of Cold Code via Bias-Free Sampling Baris

When asking about preventative measures for a given health condition, a provider commented

CS 6316 Machine Learning The Bias-Complexity Tradeoff Yangfeng Ji Department of Computer Science

If you have a brain youre biased -Dr. Heidi Grant NeuroLeadership Institute Explicit

go to the source The Media Bias Chart The Media Bias Chart A new taxonomy for discussing the

Bias-Variance Tradeoff Matthieu R. Bloch h in a given set H that minimizes the true risk R ( h ) .

BIAS BIAS LIGHT LIGHT & & MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to