Cross-Language Evaluation Forum What happened at CLEF 2003 From - PDF document

Outline Cross-Language Evaluation Forum What happened at CLEF 2003 From CLEF 2003 to CLEF 2004 � Tracks and Tasks � Test Collection � Participation Carol Peters � Results Martin Braschler What is happening in CLEF 2004 * Jacques Savoy NTCIR-4 Workshop CLEF 2003: CLEF 2003: Core Tracks Additional Tracks Free-text retrieval on news corpora � Interactive Track – iCLEF (coordinated by UNED, UMD) � Multilingual: 2 tasks � Interactive document selection/query formulation � Small-multilingual: 4 “core” languages (EN,ES,FR,DE) � Multilingual QA Track (ITC-irst,UNED,U.Amsterdam,NIST) � Large-multilingual: 8 languages (+FI,IT,NL,SV) � Monolingual QA for Dutch, Italian and Spanish � Topics in 12 languages including JP and ZH � Cross-language QA to English target collection � Bilingual: Aim was comparability � ImageCLEF (coordinated by U.Sheffield) � Cross-language image retrieval using captions � IT -> ES FR -> NL � DE -> IT FI -> DE � Cross-Language Spoken Doc Retrieval (ITC-irst,U.Exeter) � Evaluation of CLIR on noisy transcripts of spoken docs � x -> RU Newcomers only: x -> EN � Low-cost development of a benchmark � Monolingual: All languages (except English) Retrieval on structured, domain-specific data � Mono- and CLIR on social science data (DE, EN) NTCIR-4 Workshop NTCIR-4 Workshop CLEF 2003 CLEF 2003: Participants Data Collections � BBN/UMD (US) � ISI U Southern Cal (US) � U Amsterdam (NL) ** � Multilingual comparable corpus � CEA/LIC2M (FR) � ITC-irst (IT) *** � U Exeter (UK) ** � CLIPS/IMAG (FR) � JHU-APL (US) *** � U Oviedo/AIC (ES) � news docs in 9 languages - DE,EN,ES,FI,FR,IT,NL,RU,SV � CMU (US) * � Kermit (FR/UK) � U Hildesheim (DE) * � Common set of 60 topics in 10 languages (+ZH) - core tracks � Clairvoyance Corp. (US) * � Medialab (NL) ** � U Maryland (US) *** � 2 sets of 200 questions for mono- and cross-language QA � COLE /U La Coruna (ES) * � NII (JP) � U Montreal/RALI (CA) *** � Daedalus (ES) � National Taiwan U (TW) ** � U Neuchâtel (CH) ** � GIRT4: German and English social science docs � DFKI (DE) � OCE Tech. BV (NL) ** � U Sheffield (UK) *** � plus German/English/Russian thesaurus � DLTG U Limerick (IE) � Ricoh (JP) � U Sunderland (UK) � 25 topics in DE/EN/RU � ENEA/La Sapienza (IT) � SICS (SV) ** � U Surrey (UK) � Fernuni Hagen (DE) � SINAI/U Jaen (ES) ** � U Tampere (FI) *** � St Andrews University Image Collection � Fondazione Ugo Bordoni (IT) * � Tagmatica (FR) * � U Twente (NL) *** � historical photo collection with EN captions � Hummingbird (CA) ** � U Alicante (ES) ** � UC Berkeley (US) *** � 50 short topics in DE,ES,FR,IT,NL � IMS U Padova (IT) * � U Buffalo (US) � UNED (ES) ** � CL-SDR TREC-8 and TREC-9 SDR collections 42 groups, 14 countries; 29 European, 10 N.American, 3 Asian 32 academia, 10 industry � noisy spoken doc. transcripts in English (*/**/*** = one/two/three previous participations) � 100 short topics in DE,ES,FR,IT,NL NTCIR-4 Workshop NTCIR-4 Workshop

From CLIR-TREC to CLEF From CLIR-TREC to CLEF Growth in Test Collection Growth in Participation (Main Tracks) 45 All 40 European # # # docs. Size # # # ass. 35 part. lang in assess. topics per topic 30 MB 25 CLEF 2003 33 9 1,611,178 4124 188,475 60 (37) ~3100 20 CLEF 2002 34 8 1,138,650 3011 140,043 50(30) ~2900 15 CLEF 2001 31 6 940,487 2522 97,398 50 1948 10 CLEF 2000 20 4 368,763 1158 43,566 40 1089 5 TREC8 CLIR 12 4 698,773 1620 23,156 28 827 0 TREC-6 TREC-7 TREC-8 CLEF- CLEF- CLEF- CLEF- 2000 2001 2002 2003 Track # Participants # Runs/Experiments CLEF 2003 Multilingual-8 Track - TD, Automatic Multilingual-8 7 33 1,0 Multilingual-4 14 53 Bilingual to FI → DE 2 3 0,9 Bilingual to X → EN UC Berkeley 3 15 Uni Neuchâtel Bilingual to IT → ES 9 25 U Amsterdam 0,8 Bilingual to DE → IT JHU/APL Details of 8 21 U Tampere Bilingual to FR → NL 3 6 0,7 Experiments Bilingual to X → RU 2 9 0,6 Monolingual DE 13 30 Precision (Monolingual EN) (5) 11 0,5 Monolingual ES 16 38 Monolingual FI 7 13 0,4 Monolingual FR 16 36 0,3 Monolingual IT 13 27 Monolingual NL 11 32 0,2 Monolingual RU 5 23 Monolingual SV 8 18 0,1 Domain-specific GIRT → DE 4 16 0,0 Domain-specific GIRT → EN 2 6 0,0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1,0 Interactive 5 10 Recall Question Answering 8 17 Image Retrieval 4 45 Spoken Document Retrieval 4 29 CLEF 2003 Multilingual-4 Track - TD, Automatic Trends in CLEF-2003 1,0 0,9 U Exeter UC Berkeley Uni Neuchâtel 0,8 CMU U Alicante 0,7 � A lot of detailed fine-tuning (per language, per 0,6 weighting scheme, per translation resource type) Precision 0,5 � People think about ways to “scale” to new languages 0,4 � Merging is still a hot issue; however, no merging 0,3 approach besides the simple ones has been widely adopted yet 0,2 � A few resources were really popular: Snowball 0,1 stemmers, UniNE stopwordlists, some MT systems, 0,0 0,0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1,0 “Freelang” dictionaries Recall � QT still rules NTCIR-4 Workshop

CLEF-2003 vs. Trends in CLEF-2003 CLEF-2002 � Many participants were back � Stemming and decompounding are still actively � Many groups tried several tasks debated; maybe even more use of linguistics than � People try each other’s ideas/methods: before? � collection-size based merging, 2step merging � Monolingual tracks were “hotly contested”, some show � (fast) document translation very similar performance among the top groups � compound splitting, stemmers � Bilingual tracks forced people to think about � Returning participants usually improve performance. “inconvenient” language pairs (“Advantage for veteran groups”) � Success of the “additional” tracks � Scaling up to Multilingual-8 takes its time (?) � Strong involvement of new groups in track coordination NTCIR-4 Workshop NTCIR-4 Workshop “Effect” of CLEF in CLEF 2003 2003 Workshop � Number of Europeans grows more slowly (29) � Results of CLEF 2002 campaign presented at � Fine-tuning for individual languages, weighting Workshop, 20-21 Aug. 2003, Trondheim schemes etc. has become a hot topic � 60 researchers and system developers from � are we overtuning to characteristics of the CLEF collection? � Some blueprints to “successful CLIR” have now academia and industry participated been widely adopted � Working Notes containing preliminary reports � Are we headed towards a monoculture of CLIR systems? and statistics on CLEF 2003 experiments � Multilingual-8 was dominated by veterans, but Multilingual-4 was very competitive available on Web site � “inconvenient” language pairs for bilingual; � Proceedings to be published by Springer in stimulated some interesting work LNCS series � Increase of groups with NLP background (effect of QA) NTCIR-4 Workshop NTCIR-4 Workshop CLEF 2004 CLEF 2004 Considerable focus on QA Reduction of “core” tracks – expansion � Multilingual Question Answering (QA at CLEF) of “new” tracks � Mono and Cross-Language QA: target collections for � Mono-, Bi-, and Multilingual IR on News DE/EN/ES/FR/IT/NL/PT Collections � Interactive CLIR - iCLEF � Just 5 target languages (EN/FI/FR/RU and new � Cross-Lang. QA from a user-inclusive perspective language - Portuguese ) � How can interaction with user help a QA system � Mono- and Cross-Language Information � How should C-L system help users locate answers Retrieval on Structured Scientific Data quickly � GIRT-4 EN and DE social science data � Coordination with QA track NTCIR-4 Workshop NTCIR-4 Workshop

CLEF 2004 CLEF 2004 Importance of non-textual media � 60 groups registered � Cross-Language Image Retrieval (ImageCLEF) � Results due end May (dates vary slightly according � Using both text and image matching techniques to the track) � bilingual ad hoc retrieval task (ES/FR/DE/IT/NL) � QA@CLEF and ImageCLEF particularly popular � an interactive search task (tentative) tasks � a medical image retrieval task � 16 groups registered for the multilingual task (target � Cross-Lang. Spoken Doc Retrieval (CL-SDR) document collection in 4 languages: EN, FI, FR, RU) � evaluation of CLIR systems on noisy automatic � 22 groups registered for QA@CLEF; 19 for transcripts of spoken documents ImageCLEF � CL-SDR from ES/FR/DE/IT/NL � Workshop: 15-17 September, Bath, UK (after � retrieval with/without known story boundaries European Conference on Digital Libraries) � use of multiple automatic transcriptions NTCIR-4 Workshop NTCIR-4 Workshop Cross-Language Evaluation Forum For further information see: http://www.clef-campaign.org or contact: Carol Peters - ISTI-CNR E-mail: carol@isti.cnr.it NTCIR-4 Workshop

Cross-Language Evaluation Forum What happened at CLEF 2003 From - PDF document

Outline Cross-Language Evaluation Forum What happened at CLEF 2003 From CLEF 2003 to CLEF 2004 Tracks and Tasks Test Collection Participation Carol Peters Results Martin Braschler What is happening in CLEF 2004 * Jacques

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Cross-Language Information Retrieval Carol Peters ISTI-CNR, Pisa Cross-Language Information

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Cross Ram Support Set Ram accessories 1 Cross Ram Support Set Set composition The Cross

Analysis of Cross Language Information Retrieval methods Introduction to Cross Language

Red Cross Disaster Communications and the Amateur Radio Community 1 American Red Cross Gold

Red Cross Clubs Red Cross Clubs Why Red Cross Clubs should be started at your school What We

MALAYSIA IN CROSS BORDER RAIL INITIATIVE 20 DECEMBER 2017 Content: i. Cross Border Railway

CROSS CULTURAL CHALLENGES IN THE CROSS CULTURAL CHALLENGES IN THE CROSS CULTURAL CHALLENGES IN

Web browsing support for cross-community activities Tomohiro Oda Agenda cross-community

From CLEF to TrebleCLEF: the Evolution of the Cross-Language Evaluation Forum Carol Peters -

Introduction to Data Science: Classifier n 1 n 1 k k Suppose you want to compare two

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

Salem County Cross- - Salem County Cross Acceptance Acceptance Public Meeting Public Meeting

BITS Pilani Pilani Campus Located in the Indian state of Rajasthan which has a rich heritage of

PROGRAMMING LANGUAGES INTERLISP SUBMITTED BY: DIVYA BHOJ RISHISH CHANDRA 2 HISTORY: Interest

Dots on the Map Driving the Sales of Last Mile Ethernet Services Intro Supra Telecom -

Cordinator Best Bar None, Durham The strength of the Best Bar None partnership in Plymouth has

Behavior Based Saf ety Behavior Based Saf ety PCL INDUSTRIAL CONSTRUCTORS INC. Objectives Upon

Behavior Based Safety CAP Safety Meeting June V.A.0.0 1 1 PPT-CAP-BBS Safety is more

Behavioral-Based safety in the Workplace WAREHO U SI NG | OMNICHANN EL FULFIL L M EN T | TRANSP

Welcome to the Presentation of e-Government Procurement (e-GP) System Presented By Proloy Kumar