Jasmine Bennhr Tbingen, December 5th, 2011 - PowerPoint PPT Presentation

COMPOST Identification of indicators for comp etence assessment o f st udents’ essays: How do Textual Indicators Evolve? Jasmine Bennöhr Tübingen, December 5th, 2011 http://ww.linguistik.hu-berlin.de/institut/professuren/korpuslinguistik/forschung/kompost/compost

Overview I Introduction Purpose, aims and examples II Data III Methodology IV Results and implications V Work in Progress VI Future Work VII Conclusion 2

I Introduction - Aim Aim: find (new/interesting) indicators for language quality in essays Measure how the indicators evolve over time 3

I Introduction: Purpose Purpose: Identify pupils with special needs in language training Side effects Data for developing or improving competence models

II Data – Essay Corpus - Origin Essay corpus – collected during the longitudinal study KESS (Kompetenzen und Einstellungen von Schülerinnen und Schülern – competences and attitudes of pupils) Programme for student assessment KESS: complete survey of a year of pupils in Hamburg Grades 4, 7, 8, 10 (and 12) in years 2003, 2006, 2007, 2009 (and 2011).

II Data – Compost Essay Corpus - Overview Essays N available digitalised [1] rated [2] Test results for N validation KESS4 – 2003 839 ca. 8000 KFT (1 topic) KESS7 – 2006 126 63 and 63 Reading (2 topics) (of appr. 1500) comprehension KESS8 - 2007 1705 1705 C-test, grammar, (13 topics) vocabulary, spelling, reading comprehension KESS10 - 2009 1189 Not yet C-test, spelling, (6 topics) rated, reading 1189 comprehension

II Data: Extract from test booklet Example: task from grade 4 Texts are digitized (typed manually) Interpretation begins when texts are digitalised That is decisions at this point affect results

III Methodology: Annotation and frequencies Annotation Operationalise features that shall be determined automatic annotation Operationalisation which can be applied automatically. How can features be identified? Check quality of annotations Determine frequencies

III Methodology: Annotation is interpretation Only what is annotated can be counted Interpretation is continued Errors can be inserted during annotation

IV Results: Word length word length 5,1 5 4,9 4,8 4,7 4,6 4,5 4,4 4,3 4,2 4,1 KESS4 KESS7 KESS8 KESS10 x: grade y: letters per word

IV Results: Commas commas 6,0000 5,0000 4,0000 3,0000 2,0000 1,0000 0,0000 KESS4 KESS7 KESS8 KESS10 x: grade y: commas per 100 words

IV Results: -heit, -keit, -ung -heit -keit -ung 0,0800 0,2500 1,8000 1,6000 0,0700 0,2000 1,4000 0,0600 1,2000 0,0500 0,1500 1,0000 0,0400 0,8000 0,1000 0,0300 0,6000 0,0200 0,4000 0,0500 0,0100 0,2000 0,0000 0,0000 0,0000 KESS4 KESS8 KESS10 KESS4 KESS8 KESS10 KESS4 KESS8 KESS10 x: grade y: -heit, -keit, -ung per 100 words

IV Results Word length is one of the most reliable features Certain suffixes show an evolvement, but not all

IV Results: Implications Good starting point But, from there we want to go further Word length is a number, cannot be interpreted in terms of content/structure An approach that is motivated more by a linguistic point of view Analysis of suffixes, problem: choice and data sparseness Combine both  look at structure of words and how that develops over time

V Work in Progress Skim through tokens with high word length Look at morphological structure, complexity For simplicity we assume Prefix, Suffix, Lexemes, Flexives We want to look at combinations E. g. Prefix + Lexeme + Suffix Case study with prefix + lexeme + -ung High number of occurrences Example: <Auf><frisch><ung>

V Work in Progress: Preliminary Results <prefix><lexeme+|prefix*|suffix*><ung> 1,2 1 0,8 0,6 0,4 0,2 0 KESS4 KESS7 KESS8 KESS10 x: grade y: forms per 100 words

V Work in Progress: Preliminary Results – KESS4 216 – ung 109 <prefix><lexeme+|prefix*|suffix*><ung> <Ver><mut><ung> <ent><vern><ung> <An><leit><ung>

V Work in Progress: Preliminary Results - KESS8 1511 – ung 569 <prefix><lexeme+|prefix*|suffix*><ung> <An><leit><ung>,<ver><spät><ung>, <Ver><pflicht><ung> <Vor><wahrn><ung>, <er><källt><ung> <Um><satztsteiger><ung> False positives: <Er><derwärm><ung>

V Work in Progress: Preliminary Results - KESS10 902 – ung 457 <prefix><lexeme+|prefix*|suffix*><ung> <Ab><mahn><ung>, <Ver><zweifl><ung>

VI Future Work Type/token ratio – ung bzw. <prefix><lexeme+|prefix*|suffix*><ung>

VI Future Work Focus: How do word structures of students develop? Prefix chains <un><ent> <schied><en> Suffix chains <Tät> <ig><keit>, <Pünkt> <lich><keit> Combination of several prefixes and suffixes <prefix><prefix><lexeme><suffix> <un><be> <greif> <lich>,<un><ver> <kenn> <bar> <prefix><lexeme><suffix><suffix> <Über> <pünkt> <lich><keit>

V Conclusion: Summary Evolvement of indicators over time From surface indicator word length and individual affixes to a more linguistically motivated analysis Word length is not well interpretable but tightly linked to morphological structure Individual affixes (suffixes) Structure of words Qualitative analysis meaningful How do students construct words and how does that develop over time?

Jasmine Bennhr Tbingen, December 5th, 2011 - PowerPoint PPT Presentation

COMPOST Identification of indicators for comp etence assessment o f st udents essays: How do Textual Indicators Evolve? Jasmine Bennhr Tbingen, December 5th, 2011

JASMINE series JASMINE Japan Astrometry Satellite Mission for INfrared Exploration

Student Voices | 2017 Erin Davison and Jasmine Torres-Germack Erin Davison and Jasmine

THE JASMINE PROGRAMME AND JASMINE MICROFINANCE DEVELOPMENT SERVICES About EMN The European

WELCOME TO 5th GRADE Meet the 5th grade teachers Meet the Teachers Welcome to 5th Grade! Glen

TYPS T bingen y east p rogesterone s creen iGEM 2012 Tbingen University Our Team 06/10/2012

Negation, Polarity, N-words Gianina Iord achioaia E BERHARD K ARLS U NIVERSITT T BINGEN

Partial agreement in German: A processing issue? Ilona Steiner SFB 441, University of Tbingen

A standard scale of well-formedness: Why syntax needs boiling and freezing points Sam Featherston

BATH & BODY SS20 TWILIGHT GARDEN TWILIGHT GARDEN FRAGRANCE JASMINE & MANDARIN A

Citizenship for Adopted Children and Stateless Individuals BY JASMINE WONG & ERIC TOH 17

Natural Language Generation Andrea Zugarini SAILab December 5th, 2019 LabMeeting, December 5th

Reinvention of the Library in the Digital Age JILL BENN, UNIVERSITY LIBRARIAN, THE UNIVERSITY OF

Documentation and Structure: Financing Rail, Aircraft, and Marine Assets Merrick Benn Marina

Paiute 101: A Paiute Perspective A CultureA History By Benn Pikyavit Presented to

Aster Group I nvestor Presentation May 2 0 1 8 Bjorn Howard, Group CEO Chris Benn, Group Finance

2014/15 and 2015/16 Neil Benn 07869 37 35 35 nmbenn@hotmail.com Presentation The 14/15

Finely cutting the stem/suffix boundary using MDL John Goldsmith October 2003 Starting

Sparse Compact Directed Acyclic Word Graphs Shunsuke Inenaga (Japan Society for the Promotion of

CMPSC 311- Introduction to Systems Programming Module: Build Processing Professor Patrick

Non-autonomous valency- changing devices in Chitimacha Daniel W. Hieber University of

Superstring Graph in compact space Bastien Cazaux and Eric Rivals LIRMM & IBC,

Efficient Seeds Computation Revisited Michalis Christou, Maxime Crochemore, Costas S.

Session 1 Introduction to C Programming Sbastien Combfis Fall 2019 This work is licensed

Saturnin A suite of lightweight symmetric algorithms for post-quantum security Anne Canteaut 1

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Jasmine Bennhr Tbingen, December 5th, 2011 - PowerPoint PPT Presentation

COMPOST Identification of indicators for comp etence assessment o f st udents essays: How do Textual Indicators Evolve? Jasmine Bennhr Tbingen, December 5th, 2011

JASMINE series JASMINE Japan Astrometry Satellite Mission for INfrared Exploration

Student Voices | 2017 Erin Davison and Jasmine Torres-Germack Erin Davison and Jasmine

THE JASMINE PROGRAMME AND JASMINE MICROFINANCE DEVELOPMENT SERVICES About EMN The European

WELCOME TO 5th GRADE Meet the 5th grade teachers Meet the Teachers Welcome to 5th Grade! Glen

TYPS T bingen y east p rogesterone s creen iGEM 2012 Tbingen University Our Team 06/10/2012

Negation, Polarity, N-words Gianina Iord achioaia E BERHARD K ARLS U NIVERSITT T BINGEN

Partial agreement in German: A processing issue? Ilona Steiner SFB 441, University of Tbingen

A standard scale of well-formedness: Why syntax needs boiling and freezing points Sam Featherston

BATH &amp; BODY SS20 TWILIGHT GARDEN TWILIGHT GARDEN FRAGRANCE JASMINE &amp; MANDARIN A

Citizenship for Adopted Children and Stateless Individuals BY JASMINE WONG &amp; ERIC TOH 17

Natural Language Generation Andrea Zugarini SAILab December 5th, 2019 LabMeeting, December 5th

Reinvention of the Library in the Digital Age JILL BENN, UNIVERSITY LIBRARIAN, THE UNIVERSITY OF

Documentation and Structure: Financing Rail, Aircraft, and Marine Assets Merrick Benn Marina

Paiute 101: A Paiute Perspective A CultureA History By Benn Pikyavit Presented to

Aster Group I nvestor Presentation May 2 0 1 8 Bjorn Howard, Group CEO Chris Benn, Group Finance

2014/15 and 2015/16 Neil Benn 07869 37 35 35 nmbenn@hotmail.com Presentation The 14/15

Finely cutting the stem/suffix boundary using MDL John Goldsmith October 2003 Starting

Sparse Compact Directed Acyclic Word Graphs Shunsuke Inenaga (Japan Society for the Promotion of

CMPSC 311- Introduction to Systems Programming Module: Build Processing Professor Patrick

Non-autonomous valency- changing devices in Chitimacha Daniel W. Hieber University of

Superstring Graph in compact space Bastien Cazaux and Eric Rivals LIRMM &amp; IBC,

Efficient Seeds Computation Revisited Michalis Christou, Maxime Crochemore, Costas S.

Session 1 Introduction to C Programming Sbastien Combfis Fall 2019 This work is licensed

Saturnin A suite of lightweight symmetric algorithms for post-quantum security Anne Canteaut 1

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

BATH & BODY SS20 TWILIGHT GARDEN TWILIGHT GARDEN FRAGRANCE JASMINE & MANDARIN A

Citizenship for Adopted Children and Stateless Individuals BY JASMINE WONG & ERIC TOH 17

Superstring Graph in compact space Bastien Cazaux and Eric Rivals LIRMM & IBC,