• Bernoulli
  • Volume 11, Number 6 (2005), 1093-1113.

Spectral decomposition of score functions in linkage analysis

Ola Hössjer

Full-text: Open access


We consider stochastic processes occurring in nonparametric linkage analysis for mapping disease susceptibility genes in the human genome. Under the null hypothesis that no disease gene is located in the chromosomal region of interest, we prove that the linkage process Z converges weakly to a mixture of Ornstein-Uhlenbeck processes as the number of families N tends to infinity. Under a sequence of contiguous alternatives, we prove weak convergence towards the same Gaussian process with a deterministic non-zero mean function added to it. The results are applied to power calculations for chromosome- and genome-wide scans, and are valid for arbitrary family structures. Our main tool is the inheritance vector process v, which is a stationary and continuous-time Markov process with state space the set of binary vectors w of given length. Certain score functions are expanded as a linear combination of an orthonormal system of basis functions which are eigenvectors of the intensity matrix of v.

Article information

Bernoulli, Volume 11, Number 6 (2005), 1093-1113.

First available in Project Euclid: 16 January 2006

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

continuous-time Markov process inheritance vectors invariance principle linkage analysis Ornstein-Uhlenbeck process spectral decomposition


Hössjer, Ola. Spectral decomposition of score functions in linkage analysis. Bernoulli 11 (2005), no. 6, 1093--1113. doi:10.3150/bj/1137421641.

Export citation


  • [1] Billingsley, P. (1968). Convergence of Probability Measures. New York: Wiley.
  • [2] Commenges, D. (1994) Robust genetic linkage analysis based on a score test of homogeneity: The weighted pairwise correlation statistic. Genetic Epidemiology, 11, 189-200.
  • [3] Diaconis, P. (1988) Group Representations in Probability and Statistics. Hayward, CA: Institute of Mathematical Statistics.
  • [4] Donnelly, P. (1983) The probability that some related individuals share some section of the genome identical by descent. Theoret. Popul. Biol., 23, 34-64.
  • [5] Dudoit, S. and Speed, T.P. (1999) A score test for linkage using identity by descent data from sibships. Ann. Statist., 27, 943-986.
  • [6] Dupuis, J. and Siegmund, D. (1999) Statistical methods for mapping quantitative trait loci from a dense set of markers. Genetics, 151 373-386.
  • [7] Feingold, E. and Siegmund, D. (1997) Strategies for mapping heterogeneous recessive traits by allelesharing methods. Amer. J. Hum. Genetics, 60, 965-978.
  • [8] Feingold, E., Brown, P.O. and Siegmund, D. (1993) Gaussian models for genetic linkage analysis using complete high-resolution maps of identity by descent. Amer. J. Hum. Genetics, 53, 234-251.
  • [9] Feingold, E., Song, K.K. and Weeks, D.E. (2000) Comparison of allele-sharing statistics for general pedigrees. Genetic Epidemiology, 19 (Suppl. 1), S92-S98.
  • [10] Haldane, J.B.S. (1919) The combination of linkage values and the calculation of distances between loci of unlinked factors. J. Genetics, 8, 299-309.
  • [11] Hössjer, O. (2003a) Asymptotic estimation theory of multipoint linkage analysis under perfect marker information. Ann. Statist, 31, 1075-1109.
  • [12] Hössjer, O. (2003b) Assessing accuracy in linkage analysis by means of confidence regions. Genetic Epidemiology, 25, 59-72.
  • [13] Hössjer, O. (2003c) Determining inheritance distributions via stochastic penetrances. J. Amer Statist Assoc., 98, 1035-1051.
  • [14] Hössjer, O. (2004) Information and effective number of meioses in linkage analysis. J. Math. Biol., 50(2), 208-232.
  • [15] Hössjer, O. (2005) Conditional likelihood score functions for mixed models in linkage analysis. Biostatistics, 6, 313-332. Supplementary material at
  • [16] Kong, A. and Wright, F. (1994) Asymptotic theory for gene mapping. Proc. Natl. Acad. Sci. USA, 91, 9705-9709.
  • [17] Kruglyak, L. and Lander, E.S. (1995) High-resolution gene mapping of complex traits. Amer. J. Hum. Genetics, 56, 1212-1223.
  • [18] Kruglyak, L. and Lander, E. (1998) Faster multipoint linkage analysis using Fourier transforms. J. Comput. Biol., 5(1), 1-7.
  • [19] Kruglyak, L., Daly, M.J., Reeve-Daly, M.P. and Lander, E.S. (1996) Parametric and nonparametric linkage analysis: A unified multipoint approach. Amer. J. Hum. Genetics, 58, 1347-1363.
  • [20] Lander, E. and Bolstein, D. (1989) Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics, 121, 185-199.
  • [21] Lander, E.L. and Kruglyak, L. (1995) Genetic dissection of complex traits: Guidelines for interpreting and reporting linkage results. Nature Genetics, 11, 241-247.
  • [22] McPeek, M.S. (1999) Optimal allele-sharing statistics for genetic mapping using affected relatives. Genetic Epidemiology, 16, 225-249.
  • [23] Ott, J. (1999) Analysis of Human Genetic Linkage, 3rd edn. Baltimore, MD: Johns Hopkins University Press.
  • [24] Sengul, H., Weeks, D.E. and Feingold, E. (2001) A survey of affected-sibship statistics for nonparametric linkage analysis. Amer. J. Hum. Genetics, 69, 179-190.
  • [25] Sham, P. (1998) Statistics in Human Genetics. London: Arnold.
  • [26] Sham, P., Zhao, J.H. and Curtis, D. (1997) Optimal weighting scheme for affected sib-pair analysis of sibship data. Ann. Hum. Genetics, 61, 61-69.
  • [27] Tang, H.-K. and Siegmund, D. (2001) Mapping quantitative trait loci in oligogenic models. Biostatistics, 2, 147-162.
  • [28] Whittemore A.S. and Halpern, J. (1994) A class of tests for linkage using affected pedigree members. Biometrics, 50, 118-127.