Document Understanding Conference DUC 2007 Hoa Trang Dang - PowerPoint PPT Presentation

Document Understanding Conference DUC 2007 Hoa Trang Dang National Institute of Standards and Technology April 26, 2007

Thank You! • 32 Participating teams from: - 11 countries - 5 continents (N. America, Europe, Asia, Africa, Australia) • Assessors A, B, C, D, E, F, G, H, I, and J • DUC 2007 Program Committee: - John Conroy, Donna Harman, Ed Hovy, Kathy McKeown, Drago Radev, Lucy Vanderwende - Karen Sparck-Jones Hoa Trang Dang

Document Understanding Conferences • 2000 Summarization roadmap, progress: - simple genre ➯ complex genre - simple tasks ➯ demanding tasks ‣ extract ➯ abstract ‣ single document ➯ multiple documents ‣ English ➯ other language ‣ generic summaries ➯ focused or evolving summaries - intrinsic evaluation ➯ extrinsic evaluation Hoa Trang Dang

DUC 2001-2006 Summarization • for single, multiple newswire documents • at various lengths (10 words, 100+ words) • of various sorts (generic, viewpoint-oriented, query- oriented) • comparing automatic summaries with manual ones - intrinsic: linguistic quality, content coverage, Rouge - extrinsic (simulated): usefulness, responsiveness Hoa Trang Dang

DUC 2007 Tasks and Evaluations • Summaries focused by questions representing user need/interests 1. Main Task: 250 word-summary ‣ length requires structuring of summary ‣ evaluated for content, readability 2. Update Task: 100 word-summary ‣ assumption of some user knowledge ‣ evaluated for content Hoa Trang Dang

DUC 2007 Main Task

2005-2007 Question-focused task 25 “Relevant” docs Fluent (newswire) 250-word Answer Summary Complex question(s)

Example DUC 2007 Topic • num: D0715D • title: International Land Mine Ban Treaty • narr: Which countries have signed the Ottawa Treaty for the elimination of anti-personnel land mines, and how many have ratified it? What countries have refused to sign, and why? How effective has the treaty been? Hoa Trang Dang

Main task: topics, documents, peers • 45 topics developed by 10 NIST assessors • Documents from AP, NYT, XIN newswire • Model summaries written by 10 assessors (ID = A-J) - 4 model summaries per topic • 30 participants (ID = 3-32) • 2 Baselines (ID = 1-2): - Simple: first 250 words of most recent document - Generic: high-performance generic summarizer Hoa Trang Dang

Generic Baseline: CLASSY04 • Top in DUC 2004 (generic 100-word summary) • Topic description is not used • Sentence splitting/shortening taken from CLASSY07 • 5-state Hidden Markov Model - states represent hidden summary and non- summary sentences • Observations: log(# signature terms + 1) - signature terms computed based on given clusters • Pivoted QR to remove redundancy Hoa Trang Dang

Evaluation methods • Manual evaluation - Readability: 5 linguistic qualities - Content responsiveness - Pyramids (optional, volunteer community effort) • Automatic evaluation of content - ROUGE-2, ROUGE-SU4 (stemmed, keep stopwords) - BE (HM) Hoa Trang Dang

Manual evaluation • 10 assessors • One assessor/topic: linguistic quality, responsiveness • Assessor is topic developer, a summarizer for topic • Each score based on a 5-point scale - (1=very poor ... 5=very good) • No manual assessment of overall responsiveness (content + linguistic quality) Hoa Trang Dang

Q1: Grammaticality Humans Simple Baseline Generic Baseline Participants 200 700 30 30 600 25 25 150 500 20 20 400 Frequency 100 15 15 300 10 10 200 50 5 5 100 0 0 0 0 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 The summary should have no datelines, system- internal formatting, capitalization errors or obviously ungrammatical sentences (e.g., fragments, missing components) that make the text difficult to read.

1 2 3 4 5 6 7 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 8 9 10 11 12 13 14 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 15 16 17 18 19 20 21 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 22 23 24 25 26 27 28 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 29 30 31 32 40 40 40 40 Q1: Grammaticality 20 20 20 20 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5

Q2: Non-Redundancy Humans Simple Baseline Generic Baseline Participants 200 40 40 800 150 30 30 600 Frequency 100 20 20 400 50 10 10 200 0 0 0 0 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 There should be no unnecessary repetition in the summary. Unnecessary repetition might take the form of whole sentences that are repeated, or repeated facts, or the repeated use of a noun or noun phrase (e.g., ``Bill Clinton'') when a pronoun (``he'') would suffice.

1 2 3 4 5 6 7 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 8 9 10 11 12 13 14 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 15 16 17 18 19 20 21 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 22 23 24 25 26 27 28 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 29 30 31 32 40 40 40 40 Q2: Non-Redundancy 20 20 20 20 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5

Q3: Referential Clarity Humans Simple Baseline Generic Baseline Participants 200 500 40 40 400 150 30 30 300 Frequency 100 20 20 200 50 10 10 100 0 0 0 0 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 It should be easy to identify who or what the pronouns and noun phrases in the summary are referring to. If a person or other entity is mentioned, it should be clear what their role in the story is. So, a reference would be unclear if an entity is referenced but its identity or relation to the story remains unclear.

1 2 3 4 5 6 7 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 8 9 10 11 12 13 14 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 15 16 17 18 19 20 21 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 22 23 24 25 26 27 28 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 29 30 31 32 40 40 40 40 Q3: Referential Clarity 20 20 20 20 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5

Q4: Focus Humans Simple Baseline Generic Baseline Participants 200 500 40 40 400 150 30 30 300 Frequency 100 20 20 200 50 10 10 100 0 0 0 0 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 The summary should have a focus; sentences should only contain information that is related to the rest of the summary.

1 2 3 4 5 6 7 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 8 9 10 11 12 13 14 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 15 16 17 18 19 20 21 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 22 23 24 25 26 27 28 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 29 30 31 32 40 40 40 40 Q4: Focus 20 20 20 20 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5

Q5: Structure and Coherence Humans Simple Baseline Generic Baseline Participants 200 700 30 30 600 25 25 150 500 20 20 Frequency 400 100 15 15 300 10 10 200 50 5 5 100 0 0 0 0 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 The summary should be well-structured and well- organized. The summary should not just be a heap of related information, but should build from sentence to sentence to a coherent body of information about a topic.

1 2 3 4 5 6 7 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 8 9 10 11 12 13 14 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 15 16 17 18 19 20 21 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 22 23 24 25 26 27 28 40 40 40 40 40 40 40 20 20 20 20 20 20 20 0 0 0 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 1 3 5 29 30 31 32 40 40 40 40 Q5: Structure/Coherence 20 20 20 20 0 0 0 0 1 3 5 1 3 5 1 3 5 1 3 5

Document Understanding Conference DUC 2007 Hoa Trang Dang - PowerPoint PPT Presentation

Document Understanding Conference DUC 2007 Hoa Trang Dang National Institute of Standards and Technology April 26, 2007 Thank You! 32 Participating teams from: - 11 countries - 5 continents (N. America, Europe, Asia, Africa, Australia)

Document #15 Document #15 Document #15 Document #15 Document #15 Document #15 Document #15

Document Understanding Conference DUC 2006 Welcome! DUC 2006-2007 Program Committee John Conroy

GROWI NG YOUR GSA Cra ig L e c hma n-Pla c e r Co unty Offic e o f E duc a tio n E duc a

We ste rn Cape E duc ation De partme nt e -E duc ation Strate gy E nhanc ing L e ar ning

City He ights E duc ational Collabor ative Sa n Die g o Unifie d Sc ho o l Distric t Bo a rd

DUC Property in the Carp Hills Mark Gloutney Director Regional Operations Eastern Canada

Co st Re duc tio n Po te ntia ls in Co st Re duc tio n Po te ntia ls in the Ge rma n Ma rke t fo

GOOGL E WHAT S NE XT ? Go o g le E duc a tio n T o o ls g .c o / e duc hro me b o o

Mo ntc la ir Bo a rd o f E duc a tio n 2020-2021 Budg e t Pre se nta tio n Spe c ia l E duc

T he Pr ivile ge Of Wor king In E duc ation E duc atio n is the mo st po we r ful me

Hard-to-Compute Bits for Elliptic Curve-Based One-Way Functions Alexandre Duc 1 Dimitar Jetchev 1

Extracting Linked Data from statistic spreadsheets Tien-Duc Cao tien-duc.cao@inria.fr Ioana

2007 Regional Freight 2007 Regional Freight Conference Conference May 15, 2007 May 15, 2007

UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING

E.D.A.S. Conference 2007 E.D.A.S. Conference 2007 Arts & Regeneration: Urban & Rural

Product Features Technical Training 2007 Technical Training 2007 Technical Training 2007

An experiment on the Reddit dataset Sha hahbaz Ahm hmed ed (1155 15594 94) Viore orel Mora

Severity Modeling of Extreme Insurance Claims for Tariffication Sascha Desmettre (joint work

Java for Non Majors Final Study Guide April 26, 2017 The test consists of 1. Multiple choice

Supero: Making Haskell Faster Neil Mitchell, Colin Runciman www.cs.york.ac.uk/~ndm/supero The

The summarize verb IN TRODUCTION TO TH E TIDYVERS E David Robinson Chief Data Scientist,

Scientific Computing Maastricht Science Program Week 4 Frans Oliehoek

Goals for My Presentations You learn things from me you can use We all learn from each

Serena Pastore (INAF) Obstacles for following i18n best practices when developing content at

Sambuz

Useful Links

Newsletter

Mail Us

Document Understanding Conference DUC 2007 Hoa Trang Dang - PowerPoint PPT Presentation

Document Understanding Conference DUC 2007 Hoa Trang Dang National Institute of Standards and Technology April 26, 2007 Thank You! 32 Participating teams from: - 11 countries - 5 continents (N. America, Europe, Asia, Africa, Australia)

Document #15 Document #15 Document #15 Document #15 Document #15 Document #15 Document #15

Document Understanding Conference DUC 2006 Welcome! DUC 2006-2007 Program Committee John Conroy

GROWI NG YOUR GSA Cra ig L e c hma n-Pla c e r Co unty Offic e o f E duc a tio n E duc a

We ste rn Cape E duc ation De partme nt e -E duc ation Strate gy E nhanc ing L e ar ning

City He ights E duc ational Collabor ative Sa n Die g o Unifie d Sc ho o l Distric t Bo a rd

DUC Property in the Carp Hills Mark Gloutney Director Regional Operations Eastern Canada

Co st Re duc tio n Po te ntia ls in Co st Re duc tio n Po te ntia ls in the Ge rma n Ma rke t fo

GOOGL E WHAT S NE XT ? Go o g le E duc a tio n T o o ls g .c o / e duc hro me b o o

Mo ntc la ir Bo a rd o f E duc a tio n 2020-2021 Budg e t Pre se nta tio n Spe c ia l E duc

T he Pr ivile ge Of Wor king In E duc ation E duc atio n is the mo st po we r ful me

Hard-to-Compute Bits for Elliptic Curve-Based One-Way Functions Alexandre Duc 1 Dimitar Jetchev 1

Extracting Linked Data from statistic spreadsheets Tien-Duc Cao tien-duc.cao@inria.fr Ioana

2007 Regional Freight 2007 Regional Freight Conference Conference May 15, 2007 May 15, 2007

UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING

E.D.A.S. Conference 2007 E.D.A.S. Conference 2007 Arts &amp; Regeneration: Urban &amp; Rural

Product Features Technical Training 2007 Technical Training 2007 Technical Training 2007

An experiment on the Reddit dataset Sha hahbaz Ahm hmed ed (1155 15594 94) Viore orel Mora

Severity Modeling of Extreme Insurance Claims for Tariffication Sascha Desmettre (joint work

Java for Non Majors Final Study Guide April 26, 2017 The test consists of 1. Multiple choice

Supero: Making Haskell Faster Neil Mitchell, Colin Runciman www.cs.york.ac.uk/~ndm/supero The

The summarize verb IN TRODUCTION TO TH E TIDYVERS E David Robinson Chief Data Scientist,

Scientific Computing Maastricht Science Program Week 4 Frans Oliehoek

Goals for My Presentations You learn things from me you can use We all learn from each

Serena Pastore (INAF) Obstacles for following i18n best practices when developing content at

Sambuz

Useful Links

Newsletter

Mail Us

E.D.A.S. Conference 2007 E.D.A.S. Conference 2007 Arts & Regeneration: Urban & Rural