Open Access
2016 Closeness to the diagonal for longest common subsequences in random words
Christian Houdré, Heinrich Matzinger
Electron. Commun. Probab. 21: 1-19 (2016). DOI: 10.1214/16-ECP4029

Abstract

The nature of the alignment with gaps corresponding to a longest common subsequence (LCS) of two independent iid random sequences drawn from a finite alphabet is investigated. It is shown that such an optimal alignment typically matches pieces of similar short-length. This is of importance in understanding the structure of optimal alignments of two sequences. Moreover, it is also shown that any property, common to two subsequences, typically holds in most parts of the optimal alignment whenever this same property holds, with high probability, for strings of similar short-length. Our results should, in particular, prove useful for simulations since they imply that the re-scaled two dimensional representation of a LCS gets uniformly close to the diagonal as the length of the sequences grows without bound.

Citation

Download Citation

Christian Houdré. Heinrich Matzinger. "Closeness to the diagonal for longest common subsequences in random words." Electron. Commun. Probab. 21 1 - 19, 2016. https://doi.org/10.1214/16-ECP4029

Information

Received: 29 December 2014; Accepted: 20 April 2016; Published: 2016
First available in Project Euclid: 27 April 2016

zbMATH: 1338.05004
MathSciNet: MR3492931
Digital Object Identifier: 10.1214/16-ECP4029

Subjects:
Primary: 05A05 , 60C05 , 60F10

Keywords: edit/Levensthein distance , Last passage percolation , longest common subsequences , optimal alignments

Back to Top