a probabilistic approach to diachronic phonology
play

A Probabilistic Approach to Diachronic Phonology Alexandre - PowerPoint PPT Presentation

A Probabilistic Approach to Diachronic Phonology Alexandre Bouchard-C ot e Percy Liang Tom Griffiths Dan Klein Languages evolve Gloss Latin Italian Spanish Portuguese Word/verb verbum verbo verbo verbu Fruit fructus frutta


  1. A Probabilistic Approach to Diachronic Phonology Alexandre Bouchard-Cˆ ot´ e Percy Liang Tom Griffiths Dan Klein

  2. Languages evolve Gloss Latin Italian Spanish Portuguese Word/verb verbum verbo verbo verbu Fruit fructus frutta fruta fruta Laugh ridere ridere reir rir Center centrum centro centro centro August augustus agosto agosto agosto Swim natare nuotare nadar nadar . . .

  3. Language evolution Gloss Latin Italian Spanish Portuguese Word/verb verbum verbo verbo verbu Fruit fructus frutta fruta fruta Laugh ridere ridere reir rir Center centrum centro centro centro August augustus agosto agosto agosto Swim natare nuotare nadar nadar . . . • Phonological rules more regular than morphological or syntactic ones • basis of the comparative method

  4. Example of a mutation process as seen by the comparative method la vl ib it es pt • ib : Proto-ibero Romance • vl : Vulgar Latin

  5. Example of a mutation process as seen by the comparative method la u → o / some context m → / some context ........ .... .... .. . ........ vl .... ........ .. .... . .. ib it . ........ .... .. es pt . • Deterministic re-write rules at each branch • Activated by some context

  6. Example of a mutation process as seen by the comparative method /werbum/ (la) u → o / some context m → / some context ........ .... .... .. . /verbo/ (vl) ........ .... ........ .. .... . .. /ve ɾ bo/ (ib) /v ɛɾ bo/ (it) . ........ .... .. /be ɾ bo/ (es) /ve ɾ bu/ (pt) . Gloss Latin Italian Spanish Portuguese Word/verb verbum verbo verbo verbu

  7. Example of a mutation process as seen by the comparative method /kentrum/ (la) u → o / some context m → / some context ........ .... .... .. . ........ / ʧ entro/ (vl) .... ........ .. .... . .. /sent ɾ o/ (ib) / ʧ ɛ ntro/ (it) . ........ .... .. /sent ɾ o/ (es) /semt ɾ u/ (pt) . Gloss Latin Italian Spanish Portuguese Word/verb verbum verbo verbo verbu Center centrum centro centro centro . . .

  8. Example of a mutation process as seen by the comparative method la vl ib it es pt • In practice, the ancient words and/or the evolutionary tree are unknown • Methodology: manually inspecting the data

  9. Our work: • A probabilistic model that captures phonological aspects of language change. • Many usages: ? /kwinto/ ? ? /kinto/ Reconstruction of word forms (ancient and modern)

  10. Our work: • A probabilistic model that captures phonological aspects of language change. • Many usages: /kwintam/ ? ? /kinta/ /kwinto/ ? ? /kinto/ /kimtu/ Inference of phonological rules

  11. Our work: • A probabilistic model that captures phonological aspects of language change. • Many usages: /kwintam/ / k i n t a / /kinto/ vs. /kwinto/ /kimtu/ /kwintam/ /kwinto/ k i n t a / / /kinto/ /kimtu/ Selection of phylogenies

  12. Our work: • A probabilistic model that captures phonological aspects of language change. • Many usages: – Reconstruction of word forms (ancient and modern) – Inference of phonological rules – Selection of phylogenies • An inference procedure and experiments on all three applications • A new task and evaluation framework

  13. The model

  14. Big picture la vl it es • Assume for now that the tree topology is known

  15. Big picture /werbum/ la /kentrum/ ... /ve ɾ bu/ vl / ʧ entro/ ... /be ɾ bo/ /v ɛ rbo/ it es /sent ɾ o/ / ʧ entro/ ... ... • Assume for now that the tree topology is known • Track individual words

  16. ɔ Stochastic edit model /werbum/ /fokus/ # # ... f o k u s f w k o /ve ɾ bu/ /fw ɔ ko/ ... ... ... ... ... • Let’s look at how a single words evolve along one of the edges of the tree • Mutation of Latin FOCUS (/fokus/) into Italian fuoco (/fw O ko/) (fire)

  17. ɔ Stochastic edit model: operations # # f o k u s f w k o • Substitution

  18. ɔ Stochastic edit model: operations # # f o k u s f w k o • Substitution (incl. self-substitution)

  19. ɔ Stochastic edit model: operations # # f o k u s f w k o • Substitution (incl. self-substitution) • Insertion

  20. ɔ Stochastic edit model: operations # # f o k u s f w k o • Substitution (incl. self-substitution) • Insertion • Deletion

  21. ɔ Stochastic edit model: context # # f o k u s f w ? o • Distribution over operations conditioned on adjacent phonemes

  22. ɔ Stochastic edit model: generation process # # f o k u s f w k o

  23. Stochastic edit model: generation process # # f o k u s ?

  24. Stochastic edit model: generation process # # f o k u s f w • P ( f → f w / # V ) = 0 . 05

  25. Stochastic edit model: generation process # # f o k u s f w ? • P ( f → f w / # V ) = 0 . 05

  26. ɔ Stochastic edit model: generation process # # f o k u s f w • P ( f → f w / # V ) = 0 . 05 • P ( o → O / C V ) = 0 . 1

  27. ɔ Stochastic edit model: generation process # # f o k u s f w k o • P ( f → f w / # V ) = 0 . 05 • P ( o → O / C V ) = 0 . 1 • . . . • P (/fokus/ → /fw O ko/)) = 0 . 05 × 0 . 1 × · · ·

  28. Edit parameters /werbum/ la /kentrum/ ... /ve ɾ bu/ vl / ʧ entro/ ... /be ɾ bo/ /v ɛ rbo/ it es /sent ɾ o/ / ʧ entro/ ... ...

  29. Edit parameters P /werbum/ la /kentrum/ ... θ la → vl /ve ɾ bu/ vl / ʧ entro/ θ la → es ... θ la → es /be ɾ bo/ /v ɛ rbo/ it es /sent ɾ o/ / ʧ entro/ ... ... • One set of parameter θ A → B for each edge A → B in the tree • Shared across all word forms evolving along this edge

  30. Edit parameters θ la → vl /ve ɾ bu/ / ʧ entro/... • θ A → B specifies P (operation | context) context operation P (operation | context) u m # deletion 0.1 u m # substitution to /m/ 0.8 u m # substitution to /b/ 0.1 a c b deletion 0.8 a c b insertion of c 0.1 . . . . . . . . . . . .

  31. Distribution on the edit parameters • Too many parameters • Addressed by: – Sparsity prior: independent Dirichlet priors (one for each context) – Group context distributions. Example: context operation P (operation | context) V m # deletion 0.1 V m # substitution to /a/ 0.8 V m # substitution to /b/ 0.1 V c C deletion 0.8 V c C insertion of c 0.1 . . . . . . . . . . . .

  32. Inference and experiments

  33. Inference: EM • Exact E step is intractable – We use a stochastic E step based on Gibbs sampling • E: fix the edit parameters, resample the derivations • M: update the edit parameters from expected edit counts

  34. � � Automatic extraction of a Romance corpus � XML dump Wiktionary � � � � � � � � � � � � � � Align. � Closure � Cognate detector Bible � � � � � � � � � � � � � � � Align. Europarl • Noisier than manually curated cognate lists • More data available • Our model overcomes this noise Data available online: http://nlp.cs.berkeley.edu/pages/historical.html

  35. Reconstruction of ancient word forms • Task: reconstruction of Latin given all of the Spanish and Italian words, and some of the Latin words • Evaluation: uniform cost edit distance on held-out data • Baseline: pick one of the modern languages at random

  36. Reconstruction of ancient word forms • Task: reconstruction of Latin given all of the Spanish and Italian words, and some of the Latin words • Example: “teeth”, nearly correctly reconstructed /d E ntis/ i → E s → E → j E /dj E ntes/ /d E nti/ • Numbers: Language Baseline Model Improvement Latin 2.84 2.34 9%

  37. Reconstruction of word forms • Evaluation: uniform cost edit distance on held-out data • Baseline: pick one of the modern languages at random • Example: “teeth”, nearly correctly reconstructed /d E ntis/ i → E s → E → j E /dj E ntes/ /d E nti/ • Numbers: Language Baseline Model Improvement Latin 2.84 2.34 9% Spanish 3.59 3.21 11%

  38. Inference of phonological rules la vl ib it es pt • ib : Proto-ibero Romance • vl : Vulgar Latin

  39. Inference of phonological rules la m → / _ # 0.92 u → o / _ 0.87 ......... ..... ..... ... ....... .... ... . .... ... ... . ......... ..... vl ....... .... .... ... ......... ..... ....... .... ... . .... ... ib it ... . ......... ..... ....... .... .... ... es pt ... . • Reconstruct the internal nodes • Focus on the rules used most often during the last E step

  40. Hypothesized derivation for “word” along with top rules /werbum/ (la) m → / _ # u → o / _ m → w → v / many environments u → o ... w → v /verbo/ (vl) r → ɾ e → ɛ ... ... • Comparison with historical evidence: the Appendix Probi coluber non colober passim non passi

Recommend


More recommend