Making Generative Classifiers Robust to Selection Bias Andrew Smith - PowerPoint PPT Presentation

Making Generative Classifiers Robust to Selection Bias Andrew Smith Charles Elkan November 30th, 2007

Outline ◮ What is selection bias? ◮ Types of selection bias. ◮ Overcoming learnable bias with weighting. ◮ Overcoming bias with maximum likelihood (ML). ◮ Experiment 1: ADULT dataset. ◮ Experiment 2: CA-housing dataset. ◮ Future work & conclusions.

What is selection bias? Traditional semi-supervised learning assumes: ◮ Some samples are labeled, some are not. ◮ Labeled and unlabeled examples are identically distributed.

What is selection bias? Traditional semi-supervised learning assumes: ◮ Some samples are labeled, some are not. ◮ Labeled and unlabeled examples are identically distributed. Semi-supervised learning under selection bias: ◮ Labeled examples are selected from the general population not at random. ◮ Labeled and unlabeled examples may be differently distributed .

Examples ◮ Loan application approval ◮ Goal is to model repay/default behavior of all applicants . ◮ But the training set only includes labels for people who were approved for a loan.

Examples ◮ Loan application approval ◮ Goal is to model repay/default behavior of all applicants . ◮ But the training set only includes labels for people who were approved for a loan. ◮ Spam filtering ◮ Goal is an up-to-date spam filter. ◮ But, while up-to-date unlabeled emails are available, hand-labeled data sets are expensive and may be rarely updated .

Framework Types of selection bias are distinguised by conditional independence assumptions between: ◮ x is the feature vector. ◮ y is the class label. If y is binary, y ∈ { 1 , 0 } . ◮ s is the binary selection variable. If y i is observable then s i = 1, otherwise s i = 0.

Types of selection bias – No bias x y s s ⊥ x , s ⊥ y ◮ The standard semi-supervised learning scenario.

Types of selection bias – No bias x y s s ⊥ x , s ⊥ y ◮ The standard semi-supervised learning scenario. ◮ Labeled examples are selected completely at random from the general population.

Types of selection bias – No bias x y s s ⊥ x , s ⊥ y ◮ The standard semi-supervised learning scenario. ◮ Labeled examples are selected completely at random from the general population. ◮ The missing labels are said to be “missing completely at random” (MCAR) in the literature.

Types of selection bias – Learnable bias x y s s ⊥ y | x ◮ Labeled examples are selected from the general population only depending on features x .

Types of selection bias – Learnable bias x y s s ⊥ y | x ◮ Labeled examples are selected from the general population only depending on features x . ◮ A model p ( s | x ) is learnable.

Types of selection bias – Learnable bias x y s s ⊥ y | x ◮ Labeled examples are selected from the general population only depending on features x . ◮ A model p ( s | x ) is learnable. ◮ The missing labels are said to be “missing at random” (MAR), or ”ignorable bias” in the literature. ◮ p ( y | x , s = 1) = p ( y | x ).

Model mis-specification under learnable bias p ( y | x , s = 1) = p ( y | x ) implies decision boundaries are the same in the labeled and general populations. But suppose the model is misspecified?

Model mis-specification under learnable bias p ( y | x , s = 1) = p ( y | x ) implies decision boundaries are the same in the labeled and general populations. But suppose the model is misspecified? Then a sub-optimal decision boundary may be learned under MAR bias. Ignoring samples Viewing hidden labels without labels + + + + + + + + + + + + + + + + + + + + + + + + + + + + + − + + + + + + + + + + − + − + + + + + + + + + + + + + + + + + + + + + + + + + + − + + + + + + + + + + + + + + + + + + + + − + + + + + − + + + + + + − + + + + + + + + + + + − + + + + + + + + + + + + + + − + − + + + + + + + + + + + + + − − + + + − + + + + − + + + + − + + − − + + + + + + + + + + + + + − − + + − − − + + + + + + + + + + + + + + − + + + + + − − + + + + − + − + + + + + + + − + + − − − + − + + − − − + + − − + + + + − − − − + − − − − + + + + − + − + − + − + + + + + − + − − − − + + + − + + + − − − + + + + − + + + − − − − − + + + − − + − − + + + − + + − − − + + + + + − − − − + − − − − + − − + − − + − − − − + + + + + − − − + + − − − − − − − − − + + − − − − − − − − − − + − − − + + + − − − − − − − − − − − − − − − − − − + + − − + − − + + − + + − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − best mis−specified bondary estim. mis−specified boundary − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − true boundary − − estim. well−specified boundary − − − − − − − − − − − − − − −

Types of selection bias – Arbitrary bias x y s ◮ Labeled examples are selected from the general population possibly depending on the label itself.

Types of selection bias – Arbitrary bias x y s ◮ Labeled examples are selected from the general population possibly depending on the label itself. ◮ No independence assumptions can be made.

Types of selection bias – Arbitrary bias x y s ◮ Labeled examples are selected from the general population possibly depending on the label itself. ◮ No independence assumptions can be made. ◮ The missing labels are said to be “missing not at random” (MNAR) in the literature.

Overcoming bias – Two alternate goals The training data consist of { ( x i , y i ) | s i = 1 } and { ( x i ) | s i = 0 } . Two goals are possible:

Overcoming bias – Two alternate goals The training data consist of { ( x i , y i ) | s i = 1 } and { ( x i ) | s i = 0 } . Two goals are possible: ◮ General population modeling: Learn p ( y | x ), e.g. loan application approval.

Overcoming bias – Two alternate goals The training data consist of { ( x i , y i ) | s i = 1 } and { ( x i ) | s i = 0 } . Two goals are possible: ◮ General population modeling: Learn p ( y | x ), e.g. loan application approval. ◮ Unlabeled population modeling: Learn p ( y | x , s = 0), e.g. spam filtering.

Overcoming learnable bias – General population modeling Lemma 1 Under MAR bias in the labeling, p ( s = 1) p ( x , y ) = p ( s = 1 | x ) p ( x , y | s = 1) if all probabilities are non-zero. The distribution of samples in the general population is a weighted version of the distribution of labeled samples. Since p ( s | x ) is learnable, we can estimate weights.

Making Generative Classifiers Robust to Selection Bias Andrew Smith - PowerPoint PPT Presentation

Making Generative Classifiers Robust to Selection Bias Andrew Smith Charles Elkan November 30th, 2007 Outline What is selection bias? Types of selection bias. Overcoming learnable bias with weighting. Overcoming bias with

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Nonlinear Classifiers II 2 Nonlinear Classifiers: Introduction Classifiers Supervised

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

BIAS What Is Bias? Bias can be defined as favoring one side, position, or belief being

BIAS BIAS LIGHT LIGHT & & MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

generative design systems Generative Brief Design Definitions Workshop Processes

Review Selection bias, overfitting Bias v. variance v. residual Bias-variance tradeoff

Expectancy bias and Bias and forensic evidence Bias and speech research forensic speech

Publication bias in QCA Publication bias in QCA Publication bias in QCA Meaning, diagnosis and

Logistic Regression, Generative and Discriminative Classifiers Recommended reading: Ng and

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

On Robust Trimming of Bayesian Network Classifiers YooJung Choi and Guy Van den Broeck UCLA

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Invertible Generative Models for Inverse Problems Mitigating Representation Error and Dataset Bias

Fast and Robust Classifiers Adjusted for Skewness Mia Hubert and Stephan Van der Veeken

Audio instructions Select Computer audio to use your computers sound OR Select

Staying Safe at Work Teaching PRIDE Workers with Disabilities about Health and Safety on the Job

Finite element methods for Maxwells equations: A local a priori estimate Claudio Rojik Vienna

No Smooth Julia Sets for Complex H enon Maps Eric Bedford Stony Brook U. joint with John

Road Map When Costs and Probabilities are Both Unknown Problem and Challenge Formulation

Industrial survey papers Six reasons to reject them Marco Torchiano, Filippo Ricca CESI 2013

Robust Models in Information Retrieval Nedim Lipka Benno Stein Bauhaus-Universitt Weimar

GPP 501 Microeconomic Analysis for Public Policy Fall 2017 Given by Kevin Milligan Vancouver

Sambuz

Useful Links

Newsletter

Mail Us

Making Generative Classifiers Robust to Selection Bias Andrew Smith - PowerPoint PPT Presentation

Making Generative Classifiers Robust to Selection Bias Andrew Smith Charles Elkan November 30th, 2007 Outline What is selection bias? Types of selection bias. Overcoming learnable bias with weighting. Overcoming bias with

Variable selection bias Bias in Ensemble Bias in Ensemble Methods Methods Variable selection

Nonlinear Classifiers II 2 Nonlinear Classifiers: Introduction Classifiers Supervised

Cognitive Modeling Unseen Examples 2 Bayes Classifiers Lecture 14: Naive Bayes Classifiers

BIAS What Is Bias? Bias can be defined as favoring one side, position, or belief being

BIAS BIAS LIGHT LIGHT &amp; &amp; MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

generative design systems Generative Brief Design Definitions Workshop Processes

Review Selection bias, overfitting Bias v. variance v. residual Bias-variance tradeoff

Expectancy bias and Bias and forensic evidence Bias and speech research forensic speech

Publication bias in QCA Publication bias in QCA Publication bias in QCA Meaning, diagnosis and

Logistic Regression, Generative and Discriminative Classifiers Recommended reading: Ng and

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

On Robust Trimming of Bayesian Network Classifiers YooJung Choi and Guy Van den Broeck UCLA

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Invertible Generative Models for Inverse Problems Mitigating Representation Error and Dataset Bias

Fast and Robust Classifiers Adjusted for Skewness Mia Hubert and Stephan Van der Veeken

Audio instructions Select Computer audio to use your computers sound OR Select

Staying Safe at Work Teaching PRIDE Workers with Disabilities about Health and Safety on the Job

Finite element methods for Maxwells equations: A local a priori estimate Claudio Rojik Vienna

No Smooth Julia Sets for Complex H enon Maps Eric Bedford Stony Brook U. joint with John

Road Map When Costs and Probabilities are Both Unknown Problem and Challenge Formulation

Industrial survey papers Six reasons to reject them Marco Torchiano, Filippo Ricca CESI 2013

Robust Models in Information Retrieval Nedim Lipka Benno Stein Bauhaus-Universitt Weimar

GPP 501 Microeconomic Analysis for Public Policy Fall 2017 Given by Kevin Milligan Vancouver

Sambuz

Useful Links

Newsletter

Mail Us

BIAS BIAS LIGHT LIGHT & & MEDIUM MEDIUM TR TRUCK UCK TIRES TIRES Bias Bias Ligh