compositions of extended top down tree transducers
play

Compositions of Extended Top-down Tree Transducers Andreas Maletti - PowerPoint PPT Presentation

Compositions of Extended Top-down Tree Transducers Andreas Maletti March 30, 2007 Short Introduction Motivation Extended tree transducers are used in machine translation [Knight & Graehl 05, Shieber 04] Compositions occur naturally


  1. Compositions of Extended Top-down Tree Transducers Andreas Maletti March 30, 2007

  2. Short Introduction Motivation ◮ Extended tree transducers are used in machine translation [Knight & Graehl 05, Shieber 04] ◮ Compositions occur naturally 1. transducers for specific (small) tasks are easier to train 2. small transducers are simpler to understand 3. “component” tree transducers can be reused

  3. Short Introduction Motivation ◮ Extended tree transducers are used in machine translation [Knight & Graehl 05, Shieber 04] ◮ Compositions occur naturally 1. transducers for specific (small) tasks are easier to train 2. small transducers are simpler to understand 3. “component” tree transducers can be reused ◮ Extended tree transducers are (essentially) as powerful as tree substitution grammars [Knight & Graehl & Hopkins 07] ◮ Closure under composition of synchronous tree substitution grammar transformations open (since introduction in 80’s)

  4. Outline Extended Top-down Tree Transducer Bimorphism Multi Bottom-up Tree Transducer Composition

  5. Principal Problem of Top-down Tree Transducers S S PRO VP PR NP = ⇒ ∗ There VB NP Hay CD NN are CD NN dos hombres men two

  6. Principal Problem of Top-down Tree Transducers S S PRO VP PR NP = ⇒ ∗ There VB NP Hay CD NN are CD NN dos hombres men two Notes: ◮ difficult to implement without regular look-ahead ◮ solution: use copying

  7. Principal Problem of Top-down Tree Transducers S S PRO VP PR NP = ⇒ ∗ There VB NP Hay CD NN are CD NN dos hombres men two Notes: ◮ difficult to implement without regular look-ahead ◮ solution: use copying — No! — closure under composition

  8. The new device Why do we not have multi-level rules? [Knight, Graehl: Training Tree Transducers. HLT-NAACL 2004]

  9. The new device Why do we not have multi-level rules? [Knight, Graehl: Training Tree Transducers. HLT-NAACL 2004] Then we could have rules like trans S S = ⇒ PR trans PRO VP x Hay x There VB are

  10. Formal Syntax Definition (cf. Knight & Graehl 04) An extended top-down tree transducer is a tuple M = ( Q , Σ , ∆ , S , R ) ◮ Q a finite set of states ◮ Σ and ∆ input and output ranked alphabet, respectively; ◮ S ⊆ Q a set of initial states

  11. Formal Syntax Definition (cf. Knight & Graehl 04) An extended top-down tree transducer is a tuple M = ( Q , Σ , ∆ , S , R ) ◮ Q a finite set of states ◮ Σ and ∆ input and output ranked alphabet, respectively; ◮ S ⊆ Q a set of initial states ◮ R ⊆ Q ( T Σ ( X )) × T ∆ ( Q ( X )) a finite set of rules such that var ( r ) ⊆ var ( l ) and l is linear for every rule ( l , r ) ∈ R .

  12. An extended top-down tree transducer Example ◮ Q = S = { ⋆ } ; ◮ Σ = ∆ = { σ ( 2 ) , α ( 0 ) } ; ◮ R contains the rules ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α σ ⋆ σ ⋆ σ ⋆ → → α x 3 x 1 σ ⋆ ⋆ α x 1 x 2 x 2 x 3

  13. ... in action Example Rules: ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α Derivation: σ σ σ α σ σ α σ α α α α α

  14. ... in action Example Rules: ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α Derivation: ⋆ σ σ σ α σ σ α σ α α α α α

  15. ... in action Example Rules: ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α Derivation: σ ⋆ σ σ ⋆ σ σ α ⋆ ⋆ ⇒ α σ σ α σ σ σ α α α σ α σ α α α α α α α

  16. ... in action Example Rules: ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α Derivation: σ σ ⋆ σ ⋆ σ α ⋆ ⋆ α σ σ ⇒ 2 σ σ ⋆ σ ⋆ σ σ α σ α α ⋆ ⋆ α ⋆ ⋆ α α α α α α α α

  17. ... in action Example Rules: ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α Derivation: σ σ ⋆ σ α σ α σ σ σ σ ⇒ 7 ⋆ σ ⋆ σ α σ α σ α ⋆ ⋆ α ⋆ ⋆ α α α α α α α α

  18. ... in action Example Rules: ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α Derivation: σ σ σ σ α σ α σ σ α σ σ ⇒ ∗ σ α α α α σ α σ α α α α α α

  19. Semantics Definition The tree transformation computed by M is τ M ⊆ T Σ × T ∆ τ M = { ( t , u ) | q ( t ) ⇒ ∗ u for some initial state q } Notation XTOP = class of transf. computed by extended tree transducers

  20. Syntactic Restrictions Let M = ( Q , Σ , ∆ , S , R ) be an extended tree transducer. Definition M is called linear and nondeleting if for every rule l → r var ( l ) = var ( r ) and no variable appears more than once in r . Example Our example transducer with rules ⋆ ( σ ( σ ( x 1 , x 2 ) , x 3 )) → σ ( ⋆ ( x 1 ) , σ ( ⋆ ( x 2 ) , ⋆ ( x 3 ))) ⋆ ( α ) → α is linear and nondeleting.

  21. Quest Log Question Is the class of transformations computed by linear and nondeleting extended tree transducers closed under composition? Answer [Knight & Graehl & Hopkins 07]

  22. Quest Log Question Is the class of transformations computed by linear and nondeleting extended tree transducers closed under composition? Answer [Knight & Graehl & Hopkins 07] No! σ γ m γ i δ σ α γ j γ m Transform into γ k γ j γ k α α α α α Two linear and nondeleting extended tree transducers can do that; but a single one cannot.

  23. Quest Log Open Problems ◮ Understand linear and nondeleting extended tree transducers better! ◮ Find subclasses that are closed under composition! ◮ Identify a suitable superclass that is closed under composition!

  24. Quest Log Open Problems ◮ Understand linear and nondeleting extended tree transducers better! (bimorphism) ◮ Find subclasses that are closed under composition! (unsolved) ◮ Identify a suitable superclass that is closed under composition! (transformations induced by certain bottom-up devices)

  25. Extended Top-down Tree Transducer Bimorphism Multi Bottom-up Tree Transducer Composition

  26. Bimorphism Let Σ , ∆ , Γ be ranked alphabets. Definition A bimorphism is a triple ( ϕ, L , ψ ) with ◮ ϕ : T Γ → T Σ the input homomorphism; ◮ L ⊆ T Γ the recognizable center; ◮ ψ : T Γ → T ∆ the output homomorphism. Definition Let B = ( ϕ, L , ψ ) be a bimorphism. The tree transformation computed by B is τ B ⊆ T Σ × T ∆ τ B = { ( ϕ ( s ) , ψ ( s )) | s ∈ L } Equivalently: τ B = ϕ − 1 ◦ id L ◦ ψ (composition of relations)

  27. Illustration Example ( ϕ, L , ψ ) bimorphism with ◮ Σ = ∆ = { σ ( 2 ) , α ( 0 ) } and Γ = { γ ( 3 ) , α ( 0 ) } ; ◮ L = T Γ ; ◮ ϕ and ψ be the homomorphisms such that ϕ ( γ ) = σ ( σ ( x 1 , x 2 ) , x 3 ) ψ ( γ ) = σ ( x 1 , σ ( x 2 , x 3 )) ϕ ( α ) = α ψ ( α ) = α

  28. Semantics γ γ γ α α α α α α α

  29. Semantics γ γ γ α α α α α α α ϕ ψ ϕ ψ γ γ γ γ α γ γ α α α α α α α α α α α α α

  30. Semantics γ γ γ α α α α α α α ϕ ψ σ σ ϕ ψ σ σ ϕ ϕ γ α ψ ψ γ α α α α γ γ α α α α α α α α α

  31. Semantics γ γ γ α α α α α α α ϕ ψ σ σ α σ σ σ σ σ ϕ α σ σ σ σ ψ ψ ϕ ϕ ϕ σ α ϕ ϕ α α α α α ψ ψ ψ ψ α α α α α α

  32. Semantics γ γ γ α α α α α α α ϕ ψ σ σ σ σ α σ α σ σ α σ σ σ α α α α σ α σ α α α α α α

  33. A Relation Definition Homomorphism h : T Γ → T Σ is linear and complete if h ( γ ) is linear and nondeleting in X k for every k ≥ 0 and γ ∈ Γ ( k ) . Theorem (Knight & Graehl & Hopkins 07, M. 07) Bimorphisms with linear and complete homomorphisms are as powerful as linear and nondeleting extended tree transducers. BM ( LC , LC ) = ln- XTOP

  34. A Relation Definition Homomorphism h : T Γ → T Σ is linear and complete if h ( γ ) is linear and nondeleting in X k for every k ≥ 0 and γ ∈ Γ ( k ) . Theorem (Knight & Graehl & Hopkins 07, M. 07) Bimorphisms with linear and complete homomorphisms are as powerful as linear and nondeleting extended tree transducers. BM ( LC , LC ) = ln- XTOP Theorem (Arnold & Dauchet 82) Bimorphisms with linear and complete ε -free homomorphisms are not closed under composition. BM ( LCE , LCE ) ⊂ BM ( LCE , LCE ) 2 = BM ( LCE , LCE ) 3

  35. Quest Log Achievement We showed that extended tree transducers consist of three (simple) phases: ◮ an inverse homomorphism (pattern matcher) ◮ a recognizable restriction (finite control) ◮ an output homomorphism (interpretation) Question ◮ Which device can implement all phases? ◮ Is the class of transformations computed by the device closed under composition?

  36. Extended Top-down Tree Transducer Bimorphism Multi Bottom-up Tree Transducer Composition

Recommend


More recommend