Causal Learning in Question Quality Improvement Yichuan Li (Arizona - PowerPoint PPT Presentation

Causal Learning in Question Quality Improvement Yichuan Li (Arizona State University) Ruocheng Guo (Arizona State University) Weiying Wang (Arizona State University) Huan Liu (Arizona State University) 1

Overview 1. Motivation 2. Introduction about Stack Overflow 3. Related Work in Question Quality Improvement 4. Introduction about dataset 5. Experiment: a. Models b. Result 6. Contribution 7. Future Work 2

Motivation: Community based Question Answering 1. Community-based question answering (CQA) forums (Stack Overflow, Quora and Zhihu) have attracted millions of users all over the world monthly. 2. However, the low quality of questions has been widespread in these website. 3. The submitted questions can be unclear , lack of background information , and the chaos question format . 3

Motivation: Community based Question Answering Example of the low-quality questions The beginners will miss the clarification of the question, the program version they are using, the method they have already tried and the missing of input and output format . 4

Motivation: Community based Question Answering The ratio of unanswered questions has been increased in recent years in Stack Overflow. This indicates more and more low-quality questions have been posted in the website. 5

Motivation: Community based Question Answering Can we give suggestion to the low-quality questions before users post them online? 6

Other QA work 1. Find the best answer given the question and a corpus of answers SemEval 2015 Task 3 : Answer Selection in Community Question Answering. “ Given ( i) a new question and (ii) a large collection of question-comment threads created by a user community, rank the comments/answers that are most useful for answering the new question 2. Generate the answer based on the question and given text SQuAD: finding an answer in a paragraph or a document. Given a document and question, find the target answer for the question. http://alt.qcri.org/semeval2015/task3/ https://rajpurkar.github.io/SQuAD-explorer/ 7

What is Stack Overflow? ● Stack Overflow is a question and answer site for professional and enthusiast programmers. Users can easily post questions and wait for others response. ● Users can ask question from program to movie and language. 9

What is Stack Overflow? Question Title Question Body 10

What is Stack Overflow? Question wait for answer 11

What is Stack Overflow? For low quality Questions, others will revise it and leave comment for the revision 12

What is Stack Overflow? Comment Actual Revision 13

Related Work 1. Binary classification: Classify the Question into High Low Quality Quality high quality and low quality then reject posting Answer Answer unqualified question. (No suggestion) 2. Multi-class classification: Classify the questions S1 S2 into the suggestion label. (No suggestion effect S4 S3 estimation) 3. Directly make intervention on the text. (Impractical) 15

Related Work 1. To solve aforementioned problems, we build a new dataset contains all the information we need. 16

Process of Data Crawling Others’ work cannot afford the question t ext( X ), question revision suggestion( T ) and the reward( Y ) after taking this suggestion in the same time. And the reward( Y ) is very important in evaluating the suggestion effects. The optimal suggestion should get the largest reward. Can we build up a dataset that contains Question Text(X), Revision Suggestion(T) and reward(Y)? 18

Process of Data Crawling 1. “ PostHistory ” and “Posts” tables in Stack Exchange contain these information Text: question text(X) Comment: revision suggestion(T) Answer Count: reward(Y) 19

Process of Data Crawling Users can use SQL to query the dataset. 20

Process of Data Crawling We use the keywords in revision comment as the type of suggestion to retrieve the revised question then remove questions that locate in more than one type of revision type. PostHistory De-duplicate Revised Question Cleaned Question Post 21

Process of Data Crawling ● Types to revision suggestion: ○ Clarification : the askers provide additional context and clarify what they want to achieve ○ Example : the askers added an output or input format or included the expected results for their problems. ○ Attempt : the possible attempts askers have tried in the process of solving their problems. ○ Solution : the askers add content to or comment on the solution found for the questions. ○ Code : modification of the source code, only considering code additions; ○ Version : inclusion of additional details about the hardware or software used (program version, processor specification, etc); ○ Error Information : warning message and stack trace information of the problem 22

Process of Data Crawling The number of revision suggestions in each category 23

Process of Data Crawling ● Example instances of the contributed dataset 24

Process of Data Crawling Dataset difference among ours and others 25

Overview 1. Motivation 2. Introduction about Stack Overflow 3. Related Work in Question Quality Improvement 4. Introduction about dataset 5. Experiment : a. Models b. Result 6. Contribution 7. Future Work 26

Experiment Setup We want to answer two questions in our dataset : 1. What is the answer count difference after taking specific suggestions? 2. What is the optimal suggestions for the low-quality questions? 27

Experiment Setup We cannot get the for the suggestion type directly, here Y is the reward answer count, X is the question text and T is the treatment suggestion type. We want to break the connection between the question text and treatment and choose any treatment. We choose three SOTA causal inference models to solve this problem to know the causal effect after taking specific suggestion. 28

Experiment Setup Question 1: What is the answer count difference after taking specific suggestion ? 29

Experiment Setup Question 1: What is the answer count difference before and after taking specific suggestion ? We estimate the conditional average treatment effect (CATE) for each revision suggestion separately. Our target is to learn function which enables us to approximate the CATE mean squared error Precision in Estimation of Heterogeneous Effect (PEHE) 30

Experiment Setup Question 2: . What is the optimal suggestions for the low-quality question? Choose the treatment which makes the greatest improvement in the outcome 31

SOTA models: BART 1. Bayesian Additive Regression Trees(BART): . ● An additive error mean regression model ● It is a sum-of-trees model ● Each tree’s complexity is constrained by a regularization prior Reward Prediction 32

SOTA models: CEVAE Causal effect variational autoencoder(CEVAE) ● This model estimates the unknown confounders from observation data through Variational Autoencoders 33

SOTA models: CFRnet Counterfactual Regression Networks (CFRnet) • This model learns a balanced representation of the control and treatment groups. 34

Features in treat effect estimation models 35

Experiment Result 1. Result Metric BART CEVAE CFRnet 0.041 0.169 0.508 0.661 1.030 1.522 From the experiment result, we found that BART get the best performance result. 36

Experiment Result Metric BART CEVAE CFRnet Accuracy 0.086 0.126 0.161 From this accuracy result, CFRnet achieve the highest accuracy. 37

Contribution 1. Afford new dataset to solve question quality improvement problem: The dataset contains three main components: (1) context: text features of questions, (2) treatment: revision suggestions and (3) outcome: indicators of the quality of the revised question. 2. Demonstrate the utility of this dataset on three causal inference model: This dataset contains rich information in the revision treatment and various kinds of outcomes. Researchers can discover the treatment from the revision text and estimate the causal effect simultaneously. 39

Future Work 1. Advanced models for feature extraction and classification like Bert . 2. Generate suggestion text by the question. 40

Causal Learning in Question Quality Improvement Yichuan Li (Arizona - PowerPoint PPT Presentation

Causal Learning in Question Quality Improvement Yichuan Li (Arizona State University) Ruocheng Guo (Arizona State University) Weiying Wang (Arizona State University) Huan Liu (Arizona State University) 1 Overview 1. Motivation 2.

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Quality Improvement Methodology Beth Hickerson Quality Improvement Advisor September 18, 2017

11/14/2012 Public Health Quality Improvement 101 Public Health Quality Improvement 101 Learning,

11/15/2012 Public Health Quality Improvement 101 Public Health Quality Improvement 101 Learning,

Week 5 Video 2 Relationship Mining Causal Mining Causal Data Mining These slides developed in

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Randomized Experiments The goal of randomized experiments is to identify The causal

Causal and Non-Causal Feature Selection for Ridge Regression Gavin Cawley School of Computing

Reporting Reporting Reporting Reporting and and and and Quality Improvement Quality

ASL AIRS contributions Using AIRS data in the presence of dust L2 : dust impact OLR forcing :

CS473: Link Analysis Luo Si Department of Computer Science Purdue University Borrowed Slides

Ge Generativ ive Pre-Tr Training for Speech with Autoregressive Pr Predictive Coding Yu-An

Autoregressive Models Autoregressive Models In [1]: from mxnet import autograd, nd, gluon, init

http://cs224w.stanford.edu Evolving Networks are networks that change as a function of time

TOTEM experiment Online Software chain: the role of the firmware in the TOTEM Scientific

Web Dynamics Part 3 Searching the Dynamic Web 3.1 Crawling and recrawling policies 3.2

Basic Concepts in Algorithmics Marco Chiarandini Department of Mathematics & Computer Science

Causal Learning in Question Quality Improvement Yichuan Li (Arizona - PowerPoint PPT Presentation

Causal Learning in Question Quality Improvement Yichuan Li (Arizona State University) Ruocheng Guo (Arizona State University) Weiying Wang (Arizona State University) Huan Liu (Arizona State University) 1 Overview 1. Motivation 2.

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Quality Improvement Methodology Beth Hickerson Quality Improvement Advisor September 18, 2017

11/14/2012 Public Health Quality Improvement 101 Public Health Quality Improvement 101 Learning,

11/15/2012 Public Health Quality Improvement 101 Public Health Quality Improvement 101 Learning,

Week 5 Video 2 Relationship Mining Causal Mining Causal Data Mining These slides developed in

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Randomized Experiments The goal of randomized experiments is to identify The causal

Causal and Non-Causal Feature Selection for Ridge Regression Gavin Cawley School of Computing

Reporting Reporting Reporting Reporting and and and and Quality Improvement Quality

ASL AIRS contributions Using AIRS data in the presence of dust L2 : dust impact OLR forcing :

CS473: Link Analysis Luo Si Department of Computer Science Purdue University Borrowed Slides

Ge Generativ ive Pre-Tr Training for Speech with Autoregressive Pr Predictive Coding Yu-An

Autoregressive Models Autoregressive Models In [1]: from mxnet import autograd, nd, gluon, init

http://cs224w.stanford.edu Evolving Networks are networks that change as a function of time

TOTEM experiment Online Software chain: the role of the firmware in the TOTEM Scientific

Web Dynamics Part 3 Searching the Dynamic Web 3.1 Crawling and recrawling policies 3.2

Basic Concepts in Algorithmics Marco Chiarandini Department of Mathematics &amp; Computer Science

Basic Concepts in Algorithmics Marco Chiarandini Department of Mathematics & Computer Science