Utility Theory, Minimum Effort, and Predictive Coding Fabrizio - PowerPoint PPT Presentation

Utility Theory, Minimum Effort, and Predictive Coding Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Istituto di Scienza e Tecnologie dell’Informazione Consiglio Nazionale delle Ricerche 56124 Pisa, Italy DESI V – Roma, IT, 14 June 2013

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results What I’ll be talking about A talk about text classification (“predictive coding”), about humans in the loop, and about how to best support their work I will be looking at scenarios in which text classification technology is used for identifying documents 1 belonging to a given class / relevant to a given query ... ... but the level of accuracy that can be obtained from the classifier 2 is not considered sufficient ... ... with the consequence that one or more human assessors are asked 3 to inspect (and correct where appropriate) a portion of the classification decisions, with the goal of increasing overall accuracy. How can we support / optimize the work of the human assessors? Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results A worked out example predicted Y N 2 TP F 1 = 2 TP + FP + FN = 0 . 53 Y TP = 4 FP = 3 true N FN = 4 TN = 9 Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results A worked out example (cont’d) predicted Y N 2 TP F 1 = 2 TP + FP + FN = 0 . 53 Y TP = 4 FP = 3 true N FN = 4 TN = 9 Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results A worked out example (cont’d) predicted Y N 2 TP F 1 = 2 TP + FP + FN = 0 . 63 TP = 5 FP = 3 Y true FN = 3 TN = 9 N Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results A worked out example (cont’d) predicted Y N 2 TP F 1 = 2 TP + FP + FN = 0 . 67 Y TP = 5 FP = 2 true N FN = 3 TN = 10 Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results What I’ll be talking about (cont’d) We need methods that given a desired level of accuracy, minimize the assessors’ effort necessary to achieve it; alternatively, given an available amount of human assessors’ effort, maximize the accuracy that can be obtained through it This can be achieved by ranking the automatically classified documents in such a way that, by starting the inspection from the top of the ranking, the cost-effectiveness of the annotators’ work is maximized We call the task of generating such a ranking Semi-Automatic Text Classification (SATC) Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results What I’ll be talking about (cont’d) Previous work has addressed SATC via techniques developed for “active learning” In both cases, the automatically classified documents are ranked with the goal of having the human annotator start inspecting/correcting from the top; however in active learning the goal is providing new training examples in SATC the goal is increasing the overall accuracy of the classified set We claim that a ranking generated “à la active learning” is suboptimal for SATC 1 1 G Berardi, A Esuli, F Sebastiani. A Utility-Theoretic Ranking Method for Semi-Automated Text Classification. Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2012), Portland, US, 2012. Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Error Reduction, and How to Measure it Error Reduction, and How to Maximize it Some Experimental Results Outline of this talk We discuss how to measure “error reduction” (i.e., increase in 1 accuracy) We discuss a method for maximizing the expected error reduction 2 for a fixed amount of annotation effort We show some promising experimental results 3 Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Utility Theory, Minimum Effort, and Predictive Coding

Utility Theory, Minimum Effort, and Predictive Coding Fabrizio - PowerPoint PPT Presentation

Utility Theory, Minimum Effort, and Predictive Coding Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Istituto di Scienza e Tecnologie dellInformazione Consiglio Nazionale delle Ricerche 56124 Pisa, Italy DESI V

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

An Introduction to (Network) Coding Theory Anna-Lena Horlemann-Trautmann University of St.

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding?

Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding

Coding and Applications in Sensor Networks Why coding? Information compression

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Utility Flood SOLUTIONS November 9, 2017 UTILITY LIGHTING PRODUCTS 1 1 HO HOWARD WARD

Feder ederal al Time Time and and Effort Effort Reporting Requirements Reporting

Overview Coding and Information Theory What is information theory? Entropy Coding Chris

Utility Theory [RN2] Sect 16.1-16.3 [RN3] Sect 16.1-16.3 CS 486/686 University of Waterloo

IS GODS IS His God- Rules/Directs Gods Gifted People Gods Word Gifted His Church

20/03/2015 SHARIA COMPLIANT TRUSTS: SOME RECENT CASE STUDIES Thursday 19 March 2015 Piers

From Unprofitable to Profitable 2 Tim. 4:9-11

The Demand Side of the The Demand Side of the Market Market Starring Starring N Utility Theory

Advanced Econometrics 2, Hilary term 2020 Statistical decision theory Maximilian Kasy Department

Utility Theory CMPUT 654: Modelling Human Strategic Behaviour S&LB 3.1 Recap: Course

Non-Additive Measures and their Applications to Decision Theory Jean-Yves Jaffray, LIP6 ,

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Utility Theory, Minimum Effort, and Predictive Coding Fabrizio - PowerPoint PPT Presentation

Utility Theory, Minimum Effort, and Predictive Coding Fabrizio Sebastiani (Joint work with Giacomo Berardi and Andrea Esuli) Istituto di Scienza e Tecnologie dellInformazione Consiglio Nazionale delle Ricerche 56124 Pisa, Italy DESI V

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

An Introduction to (Network) Coding Theory Anna-Lena Horlemann-Trautmann University of St.

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding?

Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding

Coding and Applications in Sensor Networks Why coding? Information compression

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Utility Flood SOLUTIONS November 9, 2017 UTILITY LIGHTING PRODUCTS 1 1 HO HOWARD WARD

Feder ederal al Time Time and and Effort Effort Reporting Requirements Reporting

Overview Coding and Information Theory What is information theory? Entropy Coding Chris

Utility Theory [RN2] Sect 16.1-16.3 [RN3] Sect 16.1-16.3 CS 486/686 University of Waterloo

IS GODS IS His God- Rules/Directs Gods Gifted People Gods Word Gifted His Church

20/03/2015 SHARIA COMPLIANT TRUSTS: SOME RECENT CASE STUDIES Thursday 19 March 2015 Piers

From Unprofitable to Profitable 2 Tim. 4:9-11

The Demand Side of the The Demand Side of the Market Market Starring Starring N Utility Theory

Advanced Econometrics 2, Hilary term 2020 Statistical decision theory Maximilian Kasy Department

Utility Theory CMPUT 654: Modelling Human Strategic Behaviour S&amp;LB 3.1 Recap: Course

Non-Additive Measures and their Applications to Decision Theory Jean-Yves Jaffray, LIP6 ,

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Utility Theory CMPUT 654: Modelling Human Strategic Behaviour S&LB 3.1 Recap: Course