Volume 25 Issue 3 | Statistical Science

Statistical Science

VOL. 25 · NO. 3 | August 2010

< Previous Issue | Next Issue >

VIEW ALL ABSTRACTS +

Frontmatter

Editorial Board

Statist. Sci. 25 (3), (August 2010) Open Access

No abstract available

Table of Contents

Statist. Sci. 25 (3), (August 2010) Open Access

No abstract available

Articles

Connected Spatial Networks over Random Points and a Route-Length Statistic

David J. Aldous, Julian Shun

Statist. Sci. 25 (3), 275-288, (August 2010) DOI: 10.1214/10-STS335 Open Access

KEYWORDS: proximity graph, random graph, spatial network, geometric graph

Read Abstract +

To Explain or to Predict?

Galit Shmueli

Statist. Sci. 25 (3), 289-310, (August 2010) DOI: 10.1214/10-STS330 Open Access

KEYWORDS: Explanatory modeling, causality, predictive modeling, predictive power, statistical strategy, data mining, scientific research

Read Abstract +

Graphics Processing Units and High-Dimensional Optimization

Hua Zhou, Kenneth Lange, Marc A. Suchard

Statist. Sci. 25 (3), 311-324, (August 2010) DOI: 10.1214/10-STS336 Open Access

KEYWORDS: Block relaxation, EM and MM algorithms, multidimensional scaling, nonnegative matrix factorization, parallel computing, PET scanning

Read Abstract +

A Family of Generalized Linear Models for Repeated Measures with Normal and Conjugate Random Effects

Geert Molenberghs, Geert Verbeke, Clarice G. B. Demétrio, Afrânio M. C. Vieira

Statist. Sci. 25 (3), 325-347, (August 2010) DOI: 10.1214/10-STS328 Open Access

KEYWORDS: Bernoulli model, Beta–binomial model, Cauchy distribution, conjugacy maximum likelihood, frailty model, negative-binomial model, Poisson model, strong conjugacy, Weibull model

Read Abstract +

Non-Gaussian outcomes are often modeled using members of the so-called exponential family. Notorious members are the Bernoulli model for binary data, leading to logistic regression, and the Poisson model for count data, leading to Poisson regression. Two of the main reasons for extending this family are (1) the occurrence of overdispersion, meaning that the variability in the data is not adequately described by the models, which often exhibit a prescribed mean–variance link, and (2) the accommodation of hierarchical structure in the data, stemming from clustering in the data which, in turn, may result from repeatedly measuring the outcome, for various members of the same family, etc. The first issue is dealt with through a variety of overdispersion models, such as, for example, the beta-binomial model for grouped binary data and the negative-binomial model for counts. Clustering is often accommodated through the inclusion of random subject-specific effects. Though not always, one conventionally assumes such random effects to be normally distributed. While both of these phenomena may occur simultaneously, models combining them are uncommon. This paper proposes a broad class of generalized linear models accommodating overdispersion and clustering through two separate sets of random effects. We place particular emphasis on so-called conjugate random effects at the level of the mean for the first aspect and normal random effects embedded within the linear predictor for the second aspect, even though our family is more general. The binary, count and time-to-event cases are given particular emphasis. Apart from model formulation, we present an overview of estimation methods, and then settle for maximum likelihood estimation with analytic–numerical integration. Implications for the derivation of marginal correlations functions are discussed. The methodology is applied to data from a study in epileptic seizures, a clinical trial in toenail infection named onychomycosis and survival data in children with asthma.

On the Sample Information About Parameter and Prediction

Nader Ebrahimi, Ehsan S. Soofi, Refik Soyer

Statist. Sci. 25 (3), 348-367, (August 2010) DOI: 10.1214/10-STS329 Open Access

KEYWORDS: Bayesian predictive distribution, Entropy, mutual information, optimal design, reference prior, intraclass correlation, serial correlation, order statistics

Read Abstract +

The Bayesian measure of sample information about the parameter, known as Lindley’s measure, is widely used in various problems such as developing prior distributions, models for the likelihood functions and optimal designs. The predictive information is defined similarly and used for model selection and optimal designs, though to a lesser extent. The parameter and predictive information measures are proper utility functions and have been also used in combination. Yet the relationship between the two measures and the effects of conditional dependence between the observable quantities on the Bayesian information measures remain unexplored. We address both issues. The relationship between the two information measures is explored through the information provided by the sample about the parameter and prediction jointly. The role of dependence is explored along with the interplay between the information measures, prior and sampling design. For the conditionally independent sequence of observable quantities, decompositions of the joint information characterize Lindley’s measure as the sample information about the parameter and prediction jointly and the predictive information as part of it. For the conditionally dependent case, the joint information about parameter and prediction exceeds Lindley’s measure by an amount due to the dependence. More specific results are shown for the normal linear models and a broad subfamily of the exponential family. Conditionally independent samples provide relatively little information for prediction, and the gap between the parameter and predictive information measures grows rapidly with the sample size. Three dependence structures are studied: the intraclass (IC) and serially correlated (SC) normal models, and order statistics. For IC and SC models, the information about the mean parameter decreases and the predictive information increases with the correlation, but the joint information is not monotone and has a unique minimum. Compensation of the loss of parameter information due to dependence requires larger samples. For the order statistics, the joint information exceeds Lindley’s measure by an amount which does not depend on the prior or the model for the data, but it is not monotone in the sample size and has a unique maximum.

Graphical Models for Inference Under Outcome-Dependent Sampling

Vanessa Didelez, Svend Kreiner, Niels Keiding

Statist. Sci. 25 (3), 368-387, (August 2010) DOI: 10.1214/10-STS340 Open Access

KEYWORDS: Causal inference, Collapsibility, Odds ratios, selection bias

Read Abstract +

Laplace Approximated EM Microarray Analysis: An Empirical Bayes Approach for Comparative Microarray Experiments

Haim Bar, James Booth, Elizabeth Schifano, Martin T. Wells

Statist. Sci. 25 (3), 388-407, (August 2010) DOI: 10.1214/10-STS339 Open Access

KEYWORDS: EM algorithm, Empirical Bayes, Laplace approximation, LEMMA, LIMMA, linear mixed models, local false discovery rate, microarray analysis, mixture model, two-groups model

Read Abstract +

A Conversation with George C. Tiao

Daniel Peña, Ruey S. Tsay

Statist. Sci. 25 (3), 408-428, (August 2010) DOI: 10.1214/09-STS292 Open Access

Read Abstract +

George C. Tiao was born in London in 1933. After graduating with a B.A. in Economics from National Taiwan University in 1955 he went to the US to obtain an M.B.A from New York University in 1958 and a Ph.D. in Economics from the University of Wisconsin, Madison in 1962. From 1962 to 1982 he was Assistant, Associate, Professor and Bascom Professor of Statistics and Business at the University of Wisconsin, Madison, and in the period 1973–1975 was Chairman of the Department of Statistics. He moved to the Graduate School of Business at the University of Chicago in 1982 and is the W. Allen Wallis Professor of Econometrics and Statistics (emeritus).

George Tiao has played a leading role in the development of Bayesian Statistics, Time Series Analysis and Environmental Statistics. He is co-author, with G.E.P. Box, of Bayesian Inference in Statistical Analysis and is the developer of a model-based approach to seasonal adjustment (with S. C. Hillmer), of outlier analysis in time series (with I. Chang), and of new ways of vector ARMA model building (with R. S. Tsay). He is the author/co-author/co-editor of 7 books and over 120 articles in refereed econometric, environmental and statistical journals and has been thesis advisor of over 25 students. He is a leading figure in the development of Statistics in Taiwan and China and is the Founding President of the International Chinese Statistical Association 1987–1988 and the Founding Chair Editor of the journal Statistica Sinica 1988–1993. He played a leading role (over the 20 year period 1979–1999) in the organization of the annual NBER/NSF Time Series Workshop and he was a founding member of the annual conference “Making Statistics More Effective in Schools of Business” 1986–2006. Among other honors he was elected ASA Fellow (1973), IMS Fellow (1974), member of Academia Sinica, Taiwan (1976) and ISI (1980), and was recipient of the Distinguished Service Medal, DGBAS, Taiwan 1993, the Julius Shiskin Award, 2001, the Wilks Memorial Medal Award, 2001, and the Statistician of the Year Award in 2005 (ASA Chicago Chapter). He received honorary doctorates in 2003 from the Universidad Carlos III de Madrid and National Tsinghua University, Hsinchu, Taiwan.

KEYWORDS/PHRASES

PUBLICATION TITLE:

PUBLICATION YEARS