Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab - PowerPoint PPT Presentation

Nov 01, 2023 •260 likes •364 views

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab Duquesne University mryan, jnoecker @ jgaap.com Tools JGAAP (Java Graphical Authorship Attribution Program) - a modular test bed for authorship attribution methods.

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab Duquesne University mryan, jnoecker @ jgaap.com
Tools  JGAAP (Java Graphical Authorship Attribution Program) - a modular test bed for authorship attribution methods.  All methods used are either available in JGAAP or were extensions of it  Source code for the methods used in this experiment is available at jgaap.com
Mixture of Experts  Combined three Authorship Attribution techniques  Each technique assigns a vote on the author of the document  If there is not majority author assume the author was not in the sample group
Centroid L1  Break documents into feature vectors of character 3- grams using relative frequencies of 3-grams  Build Centroids for the known authors  Take the average of that authors feature vectors  Measure the L1 Distance between the authors’ centroids and the unknown’s feature vector  Assign your vote to the author whose centroid had the smallest L1 Distance
WEKA SMO  Break documents into feature vectors of character 3- grams using relative frequencies of 3-grams  Train WEKA’s Sequential Minimal Optimization Support Vector Machines (SMO) using the known authors’ feature vectors  SMO will rate authors similarity  Assign a vote to the most similar author
Repeated Microdocument Analysis  Break all documents into 3,000 character chunks  Reduce all contiguous whitespace to single spaces and all character to lower case  Break chunks into feature vectors of character 11-grams using relative frequencies of 11-grams  Generate Centroids for the known authors  Take the average of the author’s feature vectors  Measure the Intersection Distance between the author centroids and chunks, assigning the closest centroid’s author to each chunk  Vote on the author who receives a majority of the chunks
Author Diarization Method  Break documents into paragraphs  Extract named entities from paragraphs  Group paragraphs with named entities in common  Assume each group is an author  Use the grouped paragraphs as known chunks with Repeated Microdocument Analysis and ungrouped paragraphs as unknowns  Add the ungrouped paragraph that is closest to a group to that group and re-run the analysis until all paragraphs are grouped
Results Problem Number Correct Total Accuracy A 6 6 100% B 7 10 70% C 7 8 87.5% D 10 17 58.8% E 83 90 92.2% F 77 80 96.3% I 12 14 85.7% J 12 16 75.0% Total 214 241 88.8%
Conclusions  These methods show promise with document accuracy of 88.8% and mean accuracy of 83.2%, respectively first and third in the competition.  The method used preformed poorly on open-class problems because they were developed with only closed class in mind, removing the open-class portions changes our accuracies to 91.6% and 88.5%
Future Work  Refine analysis of open-class problems by examining how different experts preform in identifying them and how many experts it takes to reach a conclusion.

Recommend

Renewable Energy Projects Evaluating Tax Risks, Navigating Structural Variations, Leveraging

Presenting a live 90-minute webinar with interactive Q&A Using Inverted Leases to Finance Renewable Energy Projects Evaluating Tax Risks, Navigating Structural Variations, Leveraging Pass-Through Election WEDNESDAY, MARCH 29, 2017 1pm

657 views • 50 slides

Authorship ID at PAN11 What -- Why -- How Patrick Juola Evaluating Variations in Language

Authorship ID at PAN11 What -- Why -- How Patrick Juola Evaluating Variations in Language Laboratory Duquesne University, Pittsburgh PA, USA juola@mathcs.duq.edu Authorship Identification needs little definition among this group

481 views • 11 slides

Polychromatic Colorings of Complete Graphs with Respect to 1-,2-factors and Hamiltonian Cycles

Polychromatic Colorings of Complete Graphs with Respect to 1-,2-factors and Hamiltonian Cycles Maria Axenovich John Goldwasser Ryan Hansen Bernard Lidick y Ryan R. Martin David Offner John Talbot Michael Young SIAM DM June 6, 2018

931 views • 57 slides

Ryan Loggins / RPI Open Source GIS for Hurricane Recovery michael@408group.com

Michael Uffer / 408 Group Ryan Loggins / RPI Open Source GIS for Hurricane Recovery michael@408group.com ryan.a.loggins@gmail.com The MUNICIPAL Project Multi-Network Interdependent Critical Infrastructure Program for the Analysis of

303 views • 16 slides

City of Saint Paul 3K Saint Paul Councilmember Rebecca Noecker December, 2019 To coordinate and

City of Saint Paul 3K Saint Paul Councilmember Rebecca Noecker December, 2019 To coordinate and expand access to high quality early learning to 3 and 4-year-olds in Saint Paul, so that all children are ready for kindergarten and all families

175 views • 13 slides

Enteric Fermentation: origin of gases, variations, predictions and mitigation Michael Blmmel

Enteric Fermentation: origin of gases, variations, predictions and mitigation Michael Blmmel Outline of Presentation Origin of ruminal CO 2 and CH 4 from fermentation products Causes and implications of variations in ruminal CO 2 and CH

786 views • 34 slides

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 8 More Design 3 tier 3 tier Variations

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 8 More Design 3 tier 3 tier Variations Variations Tier vs Layer Tier vs Layer Abstracting away from user Abstracting away from hardware Modes Modes Modes Modes Each mode can

526 views • 42 slides

Variations on Nonparametric Additive Models: Computational and Statistical Aspects John Lafferty

Variations on Nonparametric Additive Models: Computational and Statistical Aspects John Lafferty Department of Statistics & Department of Computer Science University of Chicago Collaborators Sivaraman Balakrishnan (CMU) Mathias Drton

490 views • 46 slides

A Multi-Level Approach for Evaluating Internet Topology Generators Ryan Rossi 1 , Sonia Fahmy 1 ,

A Multi-Level Approach for Evaluating Internet Topology Generators Ryan Rossi 1 , Sonia Fahmy 1 , Nilothpal Talukder 1 , 2 1 Purdue University, IN 2 Rensselaer Polytechnic Institute, NY Email: { rrossi,fahmy } @cs.purdue.edu, talukn@cs.rpi.edu May

429 views • 24 slides

NAC@ACK Michael Thumann & Dror-John Roecher NAC @ACK by Michael Thumann & Dror-John

NAC@ACK Michael Thumann & Dror-John Roecher NAC @ACK by Michael Thumann & Dror-John Roecher March 30th 2007 1 Agenda Part 1 Introduction (very short) Some marketing buzz on Cisco NAC Part 2 NAC Technology All

1k views • 63 slides

NAC@ACK Michael Thumann & Dror-John Roecher NAC @ACK by Michael Thumann & Dror-John

NAC@ACK Michael Thumann & Dror-John Roecher NAC @ACK by Michael Thumann & Dror-John Roecher August 1st 2007 1 Agenda Part 1 Introduction (very short) Some marketing buzz on Cisco NAC Part 2 NAC Technology All

696 views • 58 slides

Ryan Meyers, John Parker, John Williams July 14 th 2014 1 Claire Zucker LOCAL TEAM INTRODUCTION

Ryan Meyers, John Parker, John Williams July 14 th 2014 1 Claire Zucker LOCAL TEAM INTRODUCTION 2 Evan Canfield LOCAL LEADERSHIP, SUPPORT AND PROGRESS 3 John Williams NATIONAL CONTEXT 4 Envisions Economic Companion Tool - BCE Some

772 views • 60 slides

Variations in the Quality of Variations in the Quality of TN-VPK Classrooms TN-VPK Classrooms

Variations in the Quality of Variations in the Quality of TN-VPK Classrooms TN-VPK Classrooms Dale C. Farran Kerry Hofer Mark Lipsey Carol Bilbrey The Society for Research on Educational Effectiveness Washington, DC, 3/8/14 Research Team

650 views • 26 slides

Containment Strategies in Network Models The Firefighter Problem and Some Variations Lise E.

Containment Strategies in Network Models The Firefighter Problem and Some Variations Lise E. Holte, Ryan M. Wagner, Daniel P . Biebighauser Concordia College, Moorhead, MN February 8th, 2011 1 Outline Introduction 1 Introduction to the

2.01k views • 182 slides

Monthly & Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff

Monthly & Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff Variations Setting the Context KEs Request Notification of MYT KE requests NEPRA to Ministry of Energy (Power process and approve the Division)

1.04k views • 89 slides

Evaluating Effectiveness of an Embedded System Endpoint Security Technology on EDS Michael

Evaluating Effectiveness of an Embedded System Endpoint Security Technology on EDS Michael Siegel, Gregory Falco, Keman Huang, Weilian Chu, Elizabeth Reilly, Mayukha Vadari 1 Digitization of Industrial Sector Increased demand on

392 views • 15 slides

EVALUATING THE USABILITY OF A MOBILE APPLICATION FOR SELF-MANAGEMENT OF UNHEALTHY ALCOHOL USE

EVALUATING THE USABILITY OF A MOBILE APPLICATION FOR SELF-MANAGEMENT OF UNHEALTHY ALCOHOL USE Eric Hawkins, PhD, Anissa Danner, MSW, Aline Lott, MA, Carol Malte, MSW, Patrick Dulin, PhD, John Fortney, PhD, George Sayre, PsyD, John Baer, PhD

442 views • 19 slides

Evaluating State-Space Abstractions in Extensive-Form Games Michael Johanson, Neil Burch,

Evaluating State-Space Abstractions in Extensive-Form Games Michael Johanson, Neil Burch, Richard Valenzano and Michael Bowling University of Alberta, Canada Outline Using CFR-BR to evaluate abstractions Using imperfect recall in

187 views • 15 slides

Testing & Validation Evaluating Mobility Performance of Unmanned Ground Vehicles Michael P.

Modeling & Simulation, Testing & Validation Evaluating Mobility Performance of Unmanned Ground Vehicles Michael P. Cole 1 Cory M. Crean 1 David J. Gorsich, PhD 1 Paramsothy Jayakumar, PhD 1 Abhinandan Jain, PhD 2 Tulga Ersal, PhD 3 1. US

500 views • 16 slides

Evaluating the Effectiveness of Model Based Power Characterization John McCullough, Yuvraj

Evaluating the Effectiveness of Model Based Power Characterization John McCullough, Yuvraj Agarwal , Jaideep Chandrashekhar (Intel), Sathya Kuppuswamy, Alex C. Snoeren, Rajesh Gupta Computer Science and Engineering, UC San Diego

563 views • 21 slides

Evaluating an Alternative CS1 for Students with Prior Programming Experience Michael S.

Evaluating an Alternative CS1 for Students with Prior Programming Experience Michael S. Kirkpatrick Chris Mayfield SIGCSE Technical Symposium March 2017 Evaluating an Alternative CS1 for Students with Prior Programming Experience SIGCSE 2017

207 views • 20 slides

NICMOS PSF Variations and Tiny Tim Simulations John E. Krist Space Telescope Science Institute,

1997 HST Calibration Workshop Space Telescope Science Institute, 1997 S. Casertano, et al., eds. NICMOS PSF Variations and Tiny Tim Simulations John E. Krist Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA

308 views • 10 slides

Faceted Crystal Shape Evolution During Dissolution or Growth Ryan C. Snyder and Michael F.

MATERIALS, INTERFACES, AND ELECTROCHEMICAL PHENOMENA Faceted Crystal Shape Evolution During Dissolution or Growth Ryan C. Snyder and Michael F. Doherty Dept. of Chemical Engineering, University of California, Santa Barbara, CA 93106 DOI

632 views • 12 slides

Feeling the Measure: Evaluating Affective Outcomes John Oughton and Eleanor Pierre Affective

Feeling the Measure: Evaluating Affective Outcomes John Oughton and Eleanor Pierre Affective Domain Attitudes Motivation Willingness to Participate Valuing What is Being Learned Incorporating Values Into Life F eelings:

601 views • 22 slides

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab - PowerPoint PPT Presentation

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab Duquesne University mryan, jnoecker @ jgaap.com Tools JGAAP (Java Graphical Authorship Attribution Program) - a modular test bed for authorship attribution methods.

Renewable Energy Projects Evaluating Tax Risks, Navigating Structural Variations, Leveraging

Authorship ID at PAN11 What -- Why -- How Patrick Juola Evaluating Variations in Language

Polychromatic Colorings of Complete Graphs with Respect to 1-,2-factors and Hamiltonian Cycles

Ryan Loggins / RPI Open Source GIS for Hurricane Recovery michael@408group.com

City of Saint Paul 3K Saint Paul Councilmember Rebecca Noecker December, 2019 To coordinate and

Enteric Fermentation: origin of gases, variations, predictions and mitigation Michael Blmmel

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 8 More Design 3 tier 3 tier Variations

Variations on Nonparametric Additive Models: Computational and Statistical Aspects John Lafferty

A Multi-Level Approach for Evaluating Internet Topology Generators Ryan Rossi 1 , Sonia Fahmy 1 ,

NAC@ACK Michael Thumann &amp; Dror-John Roecher NAC @ACK by Michael Thumann &amp; Dror-John

NAC@ACK Michael Thumann &amp; Dror-John Roecher NAC @ACK by Michael Thumann &amp; Dror-John

Ryan Meyers, John Parker, John Williams July 14 th 2014 1 Claire Zucker LOCAL TEAM INTRODUCTION

Variations in the Quality of Variations in the Quality of TN-VPK Classrooms TN-VPK Classrooms

Containment Strategies in Network Models The Firefighter Problem and Some Variations Lise E.

Monthly &amp; Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff

Evaluating Effectiveness of an Embedded System Endpoint Security Technology on EDS Michael

EVALUATING THE USABILITY OF A MOBILE APPLICATION FOR SELF-MANAGEMENT OF UNHEALTHY ALCOHOL USE

Evaluating State-Space Abstractions in Extensive-Form Games Michael Johanson, Neil Burch,

Testing &amp; Validation Evaluating Mobility Performance of Unmanned Ground Vehicles Michael P.

Evaluating the Effectiveness of Model Based Power Characterization John McCullough, Yuvraj

Evaluating an Alternative CS1 for Students with Prior Programming Experience Michael S.

NICMOS PSF Variations and Tiny Tim Simulations John E. Krist Space Telescope Science Institute,

Faceted Crystal Shape Evolution During Dissolution or Growth Ryan C. Snyder and Michael F.

Feeling the Measure: Evaluating Affective Outcomes John Oughton and Eleanor Pierre Affective

NAC@ACK Michael Thumann & Dror-John Roecher NAC @ACK by Michael Thumann & Dror-John

NAC@ACK Michael Thumann & Dror-John Roecher NAC @ACK by Michael Thumann & Dror-John

Monthly & Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff

Testing & Validation Evaluating Mobility Performance of Unmanned Ground Vehicles Michael P.