Linear-Time Algorithm for Morphic Imprimitivity Testing Tomasz - PowerPoint PPT Presentation

Linear-Time Algorithm for Morphic Imprimitivity Testing Tomasz Kociumaka 1 Jakub Radoszewski 1 Wojciech Rytter 1 , 2 Tomasz Waleń 3 , 1 1 Faculty of Mathematics, Informatics and Mechanics, University of Warsaw { kociumaka,jrad,rytter,walen } @mimuw.edu.pl 2 Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruń 3 International Institute of Molecular and Cell Biology in Warsaw LATA 2013, 2013–04–05 1/24

Outline 1. Problem definition 2. Short introduction to existing solutions 3. Description of the new linear time solution 2/24

Problem definition Morphic Imprimitivity Testing For a input word w ∈ Σ n , is there a non-trivial morphism h such that: h ( w ) = w Non-trivial means that h should not be an identity function. The word w is non-primitive if such morphism exists, otherwise it is primitive . 3/24

Problem definition Morphic Imprimitivity Testing For a input word w ∈ Σ n , is there a non-trivial morphism h such that: h ( w ) = w Non-trivial means that h should not be an identity function. The word w is non-primitive if such morphism exists, otherwise it is primitive . Previous results ◮ it can be solved in O (( | Σ | + log n ) · n ) time (S. Holub 2009), ◮ slightly improved to O ( | Σ | · n ) time (S. Holub, V. Matocha, arXiv 2012). 3/24

Example Simple case Let w = abaacaca 4/24

Example Simple case Let w = abaacaca Letter b appears only once, so we can take: h ( a ) = ǫ (empty word) h ( b ) = abaacaca h ( c ) = ǫ 4/24

Example Simple case Let w = abaacaca Letter b appears only once, so we can take: h ( a ) = ǫ (empty word) h ( b ) = abaacaca h ( c ) = ǫ More complicated case Let w = aacabaaaacaacabaa 4/24

Example Simple case Let w = abaacaca Letter b appears only once, so we can take: h ( a ) = ǫ (empty word) h ( b ) = abaacaca h ( c ) = ǫ More complicated case Let w = aac abaa aac aac abaa we can take: h ( a ) = ǫ h ( b ) = abaa h ( c ) = aac 4/24

Problem applications Closely connected to several topics in formal language theory, and combinatorics on words: ◮ fixed points of morphisms, ◮ pattern languages, ◮ ambiguity of the morphisms. 5/24

Problem applications Closely connected to several topics in formal language theory, and combinatorics on words: ◮ fixed points of morphisms, ◮ pattern languages, ◮ ambiguity of the morphisms. Reviewer’s opinion Although I cannot think of any actual applications, I find this question to be very natural 5/24

How to solve it? - Intuition Theorem For a word w , if there exists non-trivial morphism h , such that h ( w ) = w , then there exists non-trivial morphism h ′ such that: ◮ h ′ ( w ) = w ◮ for all immortal letters x ∈ E : h ′ ( x ) = l x x r x (i.e. h ′ ( b ) = abaa ) ◮ for all mortal letters x �∈ E : h ′ ( x ) = ǫ 6/24

How to solve it? - Intuition Theorem For a word w , if there exists non-trivial morphism h , such that h ( w ) = w , then there exists non-trivial morphism h ′ such that: ◮ h ′ ( w ) = w ◮ for all immortal letters x ∈ E : h ′ ( x ) = l x x r x (i.e. h ′ ( b ) = abaa ) ◮ for all mortal letters x �∈ E : h ′ ( x ) = ǫ w = h ( a ) = cdac c d a c d b c d b c c d a c h ( b ) = dbc h ( c ) = ǫ h ( w ) = h ( d ) = ǫ c d a c d b c d b c c d a c a,b – immortal letters, c,d – mortal letters. 6/24

Holub’s algorithm The algorithm maintains three sets: ◮ E – set of candidates for immortal letters, ◮ L and R – sets of interpositions. 7/24

Holub’s algorithm The algorithm maintains three sets: ◮ E – set of candidates for immortal letters, ◮ L and R – sets of interpositions. Algorithm: ◮ start with empty sets E = L = R = ∅ , ◮ apply rules (a)-(e) (in any order), to obtain fixed-point. 7/24

Holub’s algorithm The algorithm maintains three sets: ◮ E – set of candidates for immortal letters, ◮ L and R – sets of interpositions. Algorithm: ◮ start with empty sets E = L = R = ∅ , ◮ apply rules (a)-(e) (in any order), to obtain fixed-point. From triple ( E , L , R ) the actual morphism can be obtained: ◮ if the set E � = Σ , then the morphism is non-trivial, ◮ from L , R we can deduce a way to divide input word to obtain morphism. 7/24

Holub’s rule (a) – initialization of the algorithm L := L ∪ { 0 , n } , R := R ∪ { 0 , n } Example: L,R L,R c c a a b a a c a a a c a a b a a c a c 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 8/24

Holub’s rule (b) – initialization of immortal letters if w [ i ] ∈ E then L := L ∪ { i − 1 } and R := R ∪ { i } , Example: L R c c a a b b a a c a a a c a a b a a c a c 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 i 9/24

Holub’s rule (c) – neighborhood marking The neighborhood of letter x – n x is the maximum factor that surrounds each occurrence of letter x in w . if w [ i .. j ] = n x for some x ∈ E then R := R ∪ { i − 1 } and L := L ∪ { j } , Example: R L c c a a b b a a c a a a c a a b b a a c a c 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 n b n b i j 10/24

Holub’s rule (d) – copying rules if w [ i .. j ] = w [ i ′ .. j ′ ] = n a for some a ∈ E and i − 1 ≤ k ≤ j then if w [ k ] ∈ L then L := L ∪ { i ′ + ( k − i ) } if w [ k ] ∈ R then R := R ∪ { i ′ + ( k − i ) } Example: R L R L R L R L c c a a b b a a c a a a c a a b b a a c a c 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 n b i ′ n b j ′ i j 11/24

Holub’s rule (d) – copying rules if w [ i .. j ] = w [ i ′ .. j ′ ] = n a for some a ∈ E and i − 1 ≤ k ≤ j then if w [ k ] ∈ L then L := L ∪ { i ′ + ( k − i ) } if w [ k ] ∈ R then R := R ∪ { i ′ + ( k − i ) } Example: R L R L R L R L c c a a b b a a c a a a c a a b b a a c a c 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 n b i ′ n b j ′ i j Problem This rule is hard to implement efficiently! 11/24

Holub’s rule (e) – new immortals letters if i < j , i ∈ L , j ∈ R then add α ( w [( i + 1 ) .. j ]) to E — letter c ∈ w [( i + 1 ) .. j ] that has smallest number of occurrences in word w . Example: L R c c a a b b a a c a a a c a a b a a c a c 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 i j 12/24

Holub’s algorithm summary Theorem Extending a correct triple ( E , L , R ) using any of the rules (a)-(e) leads to a correct triple. In particular, if any sequence of actions corresponding to (a)-(e) leads to E = Σ then w is morphically primitive. 13/24

Holub’s algorithm summary Theorem Extending a correct triple ( E , L , R ) using any of the rules (a)-(e) leads to a correct triple. In particular, if any sequence of actions corresponding to (a)-(e) leads to E = Σ then w is morphically primitive. This is quite suprising that this set of simple rules, provides the solution for the problem. 13/24

Holub’s algorithm summary ◮ simple implementation requires O ( n 2 ) time, ◮ this time complexity can be slightly improved using some preprocessing and data structures, ◮ unfortunately the obtaining linear time seems to be difficult task: ◮ the non-determinism in rules choice is problematic, ◮ rule (d) is the main bottleneck (it operates globally on the word). 14/24

What we have done? Outline ◮ modified set of rules (a),(b’)–(e’), that are equivalent to Holub’s rules but are easier to implement, ◮ strict ordering of rules application, ◮ new data structures to speed up the processing time. 15/24

What we have done? Outline ◮ modified set of rules (a),(b’)–(e’), that are equivalent to Holub’s rules but are easier to implement, ◮ strict ordering of rules application, ◮ new data structures to speed up the processing time. Result As a consequence we obtained O ( n ) running time algorithm. 15/24

New neighborhood definitions We introduced new definitions of neighborhood, to capture essential local neighborhood of the characters/word positions. R R R R R · · · e 1 e 2 · · · e i γ left ( e ) γ right ( e ) γ left ( i ) γ right ( i ) left ( i ) right ( i ) r e l e 16/24

New neighborhood definitions l e – the length of the longest common suffix of all prefixes ending with e (minus 1) in word w . r e – the length of the longest common prefix of all suffixes starting with e (minus 1) in word w , R R R R R · · · e 1 e 2 · · · e i γ left ( e ) γ right ( e ) γ left ( i ) γ right ( i ) left ( i ) right ( i ) r e l e 16/24

New neighborhood definitions left ( i ) = min ( l w [ i ] , i − pred E ( i ) − 1 ) right ( i ) = min ( r w [ i ] , succ E ( i ) − i − 1 ) R R R R R · · · e 1 e 2 · · · e i γ left ( e ) γ right ( e ) γ left ( i ) γ right ( i ) left ( i ) right ( i ) r e l e 16/24

New neighborhood definitions γ left ( i ) = i − pred R ( i ) − 1 γ right ( i ) = pred R ( i + right ( i ) + 1 ) − i R R R R R · · · e 1 e 2 · · · e i γ left ( e ) γ right ( e ) γ left ( i ) γ right ( i ) left ( i ) right ( i ) r e l e 16/24

Linear-Time Algorithm for Morphic Imprimitivity Testing Tomasz - PowerPoint PPT Presentation

Linear-Time Algorithm for Morphic Imprimitivity Testing Tomasz Kociumaka 1 Jakub Radoszewski 1 Wojciech Rytter 1 , 2 Tomasz Wale 3 , 1 1 Faculty of Mathematics, Informatics and Mechanics, University of Warsaw { kociumaka,jrad,rytter,walen }

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

PALINDROMES IN PURE MORPHIC WORDS Michelangelo Bucci with Elise Vaslet a ab b

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

can be done in linear time? Yuri Gurevich Tallinn, April 29, 2014 Agenda 1. What linear time?

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

1. Test page This page is for testing. This page is for testing. This page is for testing.

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5

Overview Objective Types of testing ECE 553: TESTING AND Verification testing

Object Oriented Testing Chapter 23 1 OO Testing Class Testing: Equivalent to unit testing

Software Testing Software testing 1 V model Software testing 2 Program testing goals To

Algorithm Analysis October 12, 2016 CMPE 250 Algorithm Analysis October 12, 2016 1 / 66

Combinatorial approaches to RNA folding Part III: Stocastic algorithms via language theory

CS 591 S2Formal Language Theory: Integrating Experimentation and ProofFall 2019 Instructor

JUST THE MATHS SLIDES NUMBER 13.4 INTEGRATION APPLICATIONS 4 (Lengths of curves) by

Superposition with Lambdas Alexander Bentkamp Jasmin Blanchette Sophie Tourret Petar Vukmirovi

MA/CSSE 474 Theory of Computation Decision Problems, Continued DFSMs Your Questions? HW1

Class 26: review for final exam solutions, 18.05, Spring 2014 Four ways to fill each slot: 4 5 .

Secure Two-Party Distribution Testing Alexandr Andoni Tal Malkin Negev Shekel Nosatzki

Explaining the Postgres Query Optimizer B RUCE M OMJIAN The optimizer is the "brain" of

Linear-Time Algorithm for Morphic Imprimitivity Testing Tomasz - PowerPoint PPT Presentation

Linear-Time Algorithm for Morphic Imprimitivity Testing Tomasz Kociumaka 1 Jakub Radoszewski 1 Wojciech Rytter 1 , 2 Tomasz Wale 3 , 1 1 Faculty of Mathematics, Informatics and Mechanics, University of Warsaw { kociumaka,jrad,rytter,walen }

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

PALINDROMES IN PURE MORPHIC WORDS Michelangelo Bucci with Elise Vaslet a ab b

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

can be done in linear time? Yuri Gurevich Tallinn, April 29, 2014 Agenda 1. What linear time?

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

1. Test page This page is for testing. This page is for testing. This page is for testing.

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5

Overview Objective Types of testing ECE 553: TESTING AND Verification testing

Object Oriented Testing Chapter 23 1 OO Testing Class Testing: Equivalent to unit testing

Software Testing Software testing 1 V model Software testing 2 Program testing goals To

Algorithm Analysis October 12, 2016 CMPE 250 Algorithm Analysis October 12, 2016 1 / 66

Combinatorial approaches to RNA folding Part III: Stocastic algorithms via language theory

CS 591 S2Formal Language Theory: Integrating Experimentation and ProofFall 2019 Instructor

JUST THE MATHS SLIDES NUMBER 13.4 INTEGRATION APPLICATIONS 4 (Lengths of curves) by

Superposition with Lambdas Alexander Bentkamp Jasmin Blanchette Sophie Tourret Petar Vukmirovi

MA/CSSE 474 Theory of Computation Decision Problems, Continued DFSMs Your Questions? HW1

Class 26: review for final exam solutions, 18.05, Spring 2014 Four ways to fill each slot: 4 5 .

Secure Two-Party Distribution Testing Alexandr Andoni Tal Malkin Negev Shekel Nosatzki

Explaining the Postgres Query Optimizer B RUCE M OMJIAN The optimizer is the &quot;brain&quot; of

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

Explaining the Postgres Query Optimizer B RUCE M OMJIAN The optimizer is the "brain" of