Efficient validation and construction of border arrays and validation of string matching automata

We present an on-line linear time and space algorithm to check if an integer array f is the border array of at least one string w built on a bounded or unbounded size alphabet Σ. First of all, we show a bijection between the border array of a string w and the skeleton of the DFA recognizing Σ*ω, cal...

Full description

Saved in:
Bibliographic Details
Published inRAIRO. Informatique théorique et applications Vol. 43; no. 2; pp. 281 - 297
Main Authors Duval, Jean-Pierre, Lecroq, Thierry, Lefebvre, Arnaud
Format Journal Article
LanguageEnglish
Published Les Ulis EDP Sciences 01.04.2009
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We present an on-line linear time and space algorithm to check if an integer array f is the border array of at least one string w built on a bounded or unbounded size alphabet Σ. First of all, we show a bijection between the border array of a string w and the skeleton of the DFA recognizing Σ*ω, called a string matching automaton (SMA). Different strings can have the same border array but the originality of the presented method is that the correspondence between a border array and a skeleton of SMA is independent from the underlying strings. This enables to design algorithms for validating and generating border arrays that outperform existing ones. The validating algorithm lowers the delay (maximal number of comparisons on one element of the array) from O(|w|) to 1 + min{|Σ|,1 + log2|ω|} compared to existing algorithms. We then give results on the numbers of distinct border arrays depending on the alphabet size. We also present an algorithm that checks if a given directed unlabeled graph G is the skeleton of a SMA on an alphabet of size s in linear time. Along the process the algorithm can build one string w for which G is the SMA skeleton.
Bibliography:istex:A246B1DE6E89EAB4E1FA94F5BE37596738C7F6EA
PII:S0988375408000301
publisher-ID:ita07065
ark:/67375/80W-R2N2F5ML-9
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0988-3754
1290-385X
DOI:10.1051/ita:2008030