Open Access
2023 A central limit theorem for the length of the longest common subsequences in random words
Christian Houdré, Ümit Işlak
Author Affiliations +
Electron. J. Probab. 28: 1-24 (2023). DOI: 10.1214/22-EJP894

Abstract

Let (Xi)i1 and (Yi)i1 be two independent sequences of independent identically distributed (iid) random variables taking their values in a common finite alphabet and having the same law. Let LCn be the length of the longest common subsequences of the two random words X1Xn and Y1Yn. Under a lower bound assumption on the order of its variance, LCn is shown to satisfy a central limit theorem. This is in contrast to the limiting distribution of the length of the longest common subsequences in two independent uniform random permutations of {1,,n}, which is shown to be the Tracy-Widom distribution.

Funding Statement

CH’s research was supported in part by a Simons Foundation Fellowship, grant #267336 and the grants #246283 and #524678 from the Simons Foundation.

Acknowledgments

Many thanks to the LPMA of the Université Pierre et Marie Curie, to CIMAT and to Bogazici University for their hospitality while part of this research was carried out. ÜI is grateful to L. Goldstein for introducing him to Stein’s method and, in particular, to Chatterjee’s normal approximation results. Also, many thanks to the LPMA of the Université Pierre et Marie Curie for its hospitality while part of this research was carried out as well as to the School of Mathematics of the Georgia Institute of Technology while being a Hale postdoctoral Fellow.

Both authors would like to thank the French Scientific Attachés Fabien Agenès (Los Angeles) and Nicolas Florsch (Atlanta) for their consular help.Without them, this research might not have existed. Lastly many thanks to Ruoting Gong, George Kerchev, Chen Xu and referees for their detailed reading and many comments which have led to numerous improvements on this manuscript.

Citation

Download Citation

Christian Houdré. Ümit Işlak. "A central limit theorem for the length of the longest common subsequences in random words." Electron. J. Probab. 28 1 - 24, 2023. https://doi.org/10.1214/22-EJP894

Information

Received: 19 May 2022; Accepted: 16 December 2022; Published: 2023
First available in Project Euclid: 4 January 2023

MathSciNet: MR4529088
MathSciNet: MR4529085
Digital Object Identifier: 10.1214/22-EJP894

Subjects:
Primary: 05A05 , 60C05 , 60F05 , 60F10

Keywords: central limit theorem , edit/Levenshtein distance , Last passage percolation , longest common subsequences , optimal alignments , Random permutations , random words , Stein’s method , supersequences , Tracy-Widom distribution , Ulam’s problem

Vol.28 • 2023
Back to Top