Optional Om Corpus Study Filler Gap Om and Gaps Collocations Om-Omission and Filler-Gap Dependencies Gosse Bouma Centre for Language and Cognition University of Groningen Structure and Evidence in Linguistics, Stanford, April 2013 Gosse Bouma 1/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Optional Complementizer Om What explains absence or presence of om ? De Indiërs aarzelen te investeren in Uganda The Indians hesitate to invest in Uganda The Indians hesitate to invest in Uganda Moser had overwogen om zijn avontuur af te blazen Moser had considered COMP his adventure PRT to cancel Moser had considered to cancel his adventure Gosse Bouma 2/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Optional Complementizer Om What explains absence or presence of om ? De Indiërs aarzelen om te investeren in Uganda The Indians hesitate COMP to invest in Uganda The Indians hesitate to invest in Uganda Moser had overwogen zijn avontuur af te blazen Moser had considered his adventure PRT to cancel Moser had considered to cancel his adventure Gosse Bouma 2/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Optional Complementizer Om What explains absence or presence of om ? De Indiërs aarzelen om te investeren in Uganda The Indians hesitate COMP to invest in Uganda The Indians hesitate to invest in Uganda Moser had overwogen zijn avontuur af te blazen Moser had considered his adventure PRT to cancel Moser had considered to cancel his adventure Filler-gap dependencies as predictor Gap locations inside Om-te-infinitives generally considered ok Gosse Bouma 2/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Optional Complementizer Om What explains absence or presence of om ? De Indiërs aarzelen om te investeren in Uganda The Indians hesitate COMP to invest in Uganda The Indians hesitate to invest in Uganda Moser had overwogen zijn avontuur af te blazen Moser had considered his adventure PRT to cancel Moser had considered to cancel his adventure Filler-gap dependencies as predictor Gap locations inside Om-te-infinitives generally considered ok But hardly occurs in corpus data (this talk) Gosse Bouma 2/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations The Dutch complementizer Om Om as optional complementizer in to-infinitive complements De Kok vraagt V (om) 1 procent van hun inkomen te geven aan het fonds De Kok asks ( COMP ) 1 percent of their income to give to the fund De Kok asks to donate 1 percent of their income to the fund Ik ben niet vrij A (om) daarover te spreken I am not free ( COMP ) about-that to speak I am not free to speak about that Ik hou er niet van P (om) Beverly Hills af te kammen I like there not PRT ( COMP ) Beverly Hills PRT to disrespect I do not like to criticize Beverly Hills Huurders krijgen het recht N (om) mee te praten tenants obtain the right ( COMP ) with to talk Tenants obtain the right to have a say Gosse Bouma 3/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Historical Development IJbema 2002 Om originated as preposition Later used as complementizer in purpose modifier clauses Use as complementizer in complement clauses is recent development (rare before 1750) Gosse Bouma 4/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Disapproval in Prescriptive Linguistics Overview from Jansen 1987 Brill (1852), no objections Woordenboek der Nederlandse Taal, 1869 (lemma om ): Om behoort altijd een doel, eene bestemming, of eene strekking aan te wijzen” ( Om should always indicate a goal, purpose, or consequence ) WNT, 1934 (lemma te ) : no objections Van Es and Van Caspel (1971-75): Om is superfluous, typical of informal language ‘Nog in 1973 moet de redactie [van Onze Taal] inzenders die om als ‘slokdarmgeluid’ betitelen verdraagzaamheid voorhouden’ ( ’Even in 1973 the editors of Onze Taal had to plea for tolerance to members who described om as a guttural sound ) Algemene Nederlandse Spraakkunst (1984): In spoken language there is a preference for om , leaving om out makes a formal impression Gosse Bouma 5/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations That-deletion in English complement clauses The athlete realized that her goals would be difficult to achieve Syntactic Complexity Features that play a role in predicting presence of that: complexity of complement clause ( CC ), distance between governor and CC , frequency of governor, complexity of CC subject, subject starts with that , ... Lexical bias (Roland et al 2006) CC s with that that-bias(governor) = ln CC s without that Information Density (Jaeger 2010) occurrences with CC complement-bias(governor) = ln 1 − occurrences with CC Gosse Bouma 6/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations That-deletion in English complement clauses The athlete realized that her goals would be difficult to achieve Syntactic Complexity Features that play a role in predicting presence of that: complexity of complement clause ( CC ), distance between governor and CC , frequency of governor, complexity of CC subject, subject starts with that , ... Lexical bias (Roland et al 2006) CC s with that that-bias(governor) = ln CC s without that Information Density (Jaeger 2010) occurrences with CC complement-bias(governor) = ln 1 − occurrences with CC Gosse Bouma 6/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations That-deletion in English complement clauses The athlete realized that her goals would be difficult to achieve Syntactic Complexity Features that play a role in predicting presence of that: complexity of complement clause ( CC ), distance between governor and CC , frequency of governor, complexity of CC subject, subject starts with that , ... Lexical bias (Roland et al 2006) CC s with that that-bias(governor) = ln CC s without that Information Density (Jaeger 2010) occurrences with CC complement-bias(governor) = ln 1 − occurrences with CC Gosse Bouma 6/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations That-deletion in English complement clauses The athlete realized that her goals would be difficult to achieve Syntactic Complexity Features that play a role in predicting presence of that: complexity of complement clause ( CC ), distance between governor and CC , frequency of governor, complexity of CC subject, subject starts with that , ... Lexical bias (Roland et al 2006) CC s with that that-bias(governor) = ln CC s without that Information Density (Jaeger 2010) occurrences with CC complement-bias(governor) = ln 1 − occurrences with CC Gosse Bouma 6/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Corpus Study Data Twente Newspaper Corpus (approx 400M words) Corpus of Spoken Dutch (10M words) Annotation Automatically parsed with the HPSG inspired Alpino parser for Dutch (van Noord 2006) Output is dependency analysis (with phrasal nodes) Gosse Bouma 7/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Alpino Dependency Analysis top smain su hd svp vc 1 spreek af 1 af 2 ti ik 0 cmp body te 5 inf su mod svp hd 1 vandaag 3 thuis 4 blijf thuis 6 Ik spreek af vandaag thuis te blijven I arrange to stay at home today Gosse Bouma 8/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Alpino Dependency Analysis top smain hd vc ben 2 ppart mod hd vc In OESO-verband is pp spreek af 3 oti afgesproken om die subsidies vanaf 1 januari hd obj1 cmp body in 0 OESO verband 1 om 4 ti te schrappen In OECD context it was cmp body inf agreed to stop those te 10 subsidies as of January, obj1 mod hd 1st np pp schrap 11 det hd hd obj1 mwu die 5 subsidie 6 vanaf 7 mwp mwp 1 8 januari 9 Gosse Bouma 9/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Spoken vs. Written Percentage Om in spoken and written material for selected verbs Spoken Written 80 60 40 20 0 vergeet weiger besluit overweeg dwing raad_aan beslis spreek_af neem_voor vraag maak verplicht nodig_uit vind Gosse Bouma 10/25
Optional Om Corpus Study Filler Gap Om and Gaps Collocations Om-bias does not correlate with Complement-bias 100 best−doe op−het−punt−sta zich−geroepen−voel kans−zie in−de−gelegenheid−stel zich−maak_op nodig−heb stem−ga_op tot−doel−heb sta−trappel noodzaak zich−ten−doel−stel schroom tot−taak−heb in−het−werk−stel verzuim van−plan−ben geen−been−zie−in sommeer 80 de−tijd−krijg dwing spoor_aan vertik Complement−bias (percentage) verplicht machtig maan in−hoofd−haal zich−neem_voor zich−beijver op−de−nominatie−sta besluit de−tijd−geef de−tijd−heb zich−span_in haal_over zich−haast 60 overweeg overreed belet bestem_voor draag_op raad_aan help_mee risico−loop voor−elkaar−krijg nodig_uit smeek 40 zich−zet_in kans−schoon−zie durf_aan raad_af moedig_aan zich−permitteer laat_na beloof zeg_toe waag aarzel verleid veroorloof kondig_aan daag_uit 20 vraag beveel_aan spreek_af acht verbied motiveer kom_overeen zich−schaam prikkel beweeg sta_toe verhinder opper spreek_af−met stimuleer vergeet doe−aan zet_aan waarschuw bied_aan belemmer suggereer overtuig presteer breng_op beslis verdien vind kijk_uit laat_toe 0 besta beschouw noem 0 20 40 60 80 Om−bias (percentage) Gosse Bouma 11/25
Recommend
More recommend