The Annals of Probability
- Ann. Probab.
- Volume 33, Number 6 (2005), 2094-2126.
Power laws for family sizes in a duplication model
Qian, Luscombe and Gerstein [J. Molecular Biol. 313 (2001) 673–681] introduced a model of the diversification of protein folds in a genome that we may formulate as follows. Consider a multitype Yule process starting with one individual in which there are no deaths and each individual gives birth to a new individual at rate 1. When a new individual is born, it has the same type as its parent with probability 1−r and is a new type, different from all previously observed types, with probability r. We refer to individuals with the same type as families and provide an approximation to the joint distribution of family sizes when the population size reaches N. We also show that if 1≪S≪N1−r, then the number of families of size at least S is approximately CNS−1/(1−r), while if N1−r≪S the distribution decays more rapidly than any power.
Ann. Probab., Volume 33, Number 6 (2005), 2094-2126.
First available in Project Euclid: 7 December 2005
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Primary: 60J80: Branching processes (Galton-Watson, birth-and-death, etc.)
Secondary: 60J85: Applications of branching processes [See also 92Dxx] 92D15: Problems related to evolution 92D20: Protein sequences, DNA sequences
Durrett, Rick; Schweinsberg, Jason. Power laws for family sizes in a duplication model. Ann. Probab. 33 (2005), no. 6, 2094--2126. doi:10.1214/009117905000000369. https://projecteuclid.org/euclid.aop/1133965854