q q q t t q t t t t q t t t t t q t t t t t q t a new valence dictionary for Polish [WG1] t t t q t t t t q t t t t q t t t t t q t t t q t t t q i t t t q t t t PARSEME meeting, Athens, March 2014 Handling MWEs in Walenty, t t q t t t q t q t t t t t q t t t t t t q t t t t t q t t t q t t ul. Jana Kazimierza 5, 01-248 Warszawa POLSKIEJ AKADEMII NAUK INSTYTUT PODSTAW INFORMATYKI aep@ipipan.waw.pl Agnieszka Patejuk t t t q t t t q t t t t t q t t q t t q t t t t q t t t t t q t t t q t t t t t q t t t t q t t t t q t t t q t t t t q t t t q t t t t t t q t t t t q t t t t q t t t t t q t q t q t t t q t t t t q t t t q t t t q t t t t t q t t t t t t q t t q t t t q t t t t t q t t t q t t q t t 1/3 t t q t t t t t q t t t q t t t t t q t t
t t t t t t t q t t t t t t q t t t t t t t t t t t t t t t t t q t t t t t t t t t t t t t t q t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t q t t t t t t t q t t t t q t t t t t t t t t t j t q t t q t t t t q t t q t t t q t t t q t t t t q t t t t t t t t q t t t t q t q t q t q t q t t q t q t q t q t q t q t t q t t q t q t q t t q t q t q t t q t q t t q t t q t q t q t t q t t q t q t t q t q t t q t q t t q t q t t q t t q t q t What is this poster about? q q q t q q t q q q q q q q q q t q q q t q q t q q q q q t t t t q q q q q t q q q q t q q q q t q q t t t t q t q t q q t t t t q q q q q q t t t q q q q q q q q q q q q q q q t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t modelling Polish MWEs together with their syntactic structure framework: Lexical-Functional Grammar (LFG) platform: Xerox Linguistic Environment (XLE) Walenty, a valence dictionary of Polish: open source, available from: zil.ipipan.waw.pl/Walenty developed since 2012, spans 3 projects contains 38874 schemata for 8644 verbs created on the basis of attested data can be used by various formalisms (currently: LFG) accounts for coordination (syntactic positions as sets) accounts for MWEs: internal structure (NP/PP, fixed phrase) interactions with syntax (case assignment for NPs) displayed modification pattern 2/3
t t t t t t t q t t t t t t q t t t t t t t t t t t t t t t t t q t t t t t t t t t t t t t t q t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t q t t t t t t t q t t t t q t t t t t t t t t t j t q t t q t t t t q t t q t t t q t t t q t t t t q t t t t t t t t q t t t t q t q t q t q t q t t q t q t q t q t q t q t t q t t q t q t q t t q t q t q t t q t q t t q t t q t q t q t t q t t q t q t t q t q t t q t q t t q t q t t q t t q t q t What does this poster look like? q q q t q q t q q q q q q q q q t q q q t q q t q q q q q t t t t q q q q q t q q q q t q q q q t q q t t t t q t q t q q t t t t q q q q q q t t t q q q q q q q q q q q q q q q t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t t Handling MWEs in Walenty, a new Institute of Computer Science, Polish Academy of Sciences i valence dictionary for Polish [WG1] Agnieszka Patejuk INTRO 3. MWES IN WALENTY Aim MWE types Modelling Polish MWEs together with their internal syntactic structure ◮ fixed expressions: ⊲ cannot be modified in any way, the exact string is given Means ⊲ fixed(string) ◮ lexicalised phrases: ◮ framework: Lexical-Functional Grammar (LFG) ⊲ nominal: lexnp(case,number,lemma,mod) ◮ platform: Xerox Linguistic Environment (XLE) ⊲ prepositional: preplexnp(preposition,case,number,lemma,mod) ◮ valence dictionary: Walenty ⊲ typical information: case, preposition form ⊲ extra information: number, lemma, modification pattern 1. LFG Modification patterns Formalism ◮ natr : modification not allowed ◮ atr : modification allowed (though not necessary) ◮ constraint-based, highly lexicalised ◮ ratr : modification required (often possessive, NP or adjective) ◮ parallel levels of representation: ◮ batr : specific modification required (possessive: swój or własny , ‘own’) S ‘ PECK � 1 , 2 � ’ PRED Examples � � SUBJ 1 PRED ‘ STORK ’ ◮ subj { np(str) } + obj { np(str) } + { fixed(’na kwaśne jabłko’) } DP VP DP � � Zbił ich na (*bardzo) kwaśne jabłko/*jabłka. ( ↑ SUBJ ) = ↓ ↑ = ↓ ( ↑ OBJ ) = ↓ ‘ STARLING ’ OBJ 2 PRED beat then for very sour apple. sg/pl a stork pecked a starling TENSE PAST ‘He beat them to a pulp.’ (literally: ‘He beat them into a sour apple.’) ◮ subj { lexnp(str,sg,’krew’,atr) } + ◮ analyses of diverse languages (English, Warlpiri, Russian, Urdu. . . ) { preplexnp(w,loc,pl,’żyła’,ratr) } ◮ LFG grammars may be implemented in XLE (Gorąca) krew/*krwie płynie/*płyną w *(jej/Marysi/tych) żyłach/*żyle. ◮ attempts at commercial use (Bing search engine) hot blood. sg/pl flow. sg/pl in her/Mary’s/those vein. pl/sg ‘(Hot) blood flows in her/Mary’s/those veins.’ POLFIE 3/3 ◮ subj { np(str) } + { cp(że) } + { lexnp(str,sg,’głowa’,natr) } Daję (*swoją/mądrą) głowę/*głowy, że przyjdą. ◮ an LFG grammar of Polish implemented in XLE
Recommend
More recommend