The Annals of Applied Statistics
- Ann. Appl. Stat.
- Volume 8, Number 2 (2014), 777-800.
Hypothesis setting and order statistic for robust genomic meta-analysis
Meta-analysis techniques have been widely developed and applied in genomic applications, especially for combining multiple transcriptomic studies. In this paper we propose an order statistic of $p$-values ($r$th ordered $p$-value, rOP) across combined studies as the test statistic. We illustrate different hypothesis settings that detect gene markers differentially expressed (DE) “in all studies,” “in the majority of studies” or “in one or more studies,” and specify rOP as a suitable method for detecting DE genes “in the majority of studies.” We develop methods to estimate the parameter $r$ in rOP for real applications. Statistical properties such as its asymptotic behavior and a one-sided testing correction for detecting markers of concordant expression changes are explored. Power calculation and simulation show better performance of rOP compared to classical Fisher’s method, Stouffer’s method, minimum $p$-value method and maximum $p$-value method under the focused hypothesis setting. Theoretically, rOP is found connected to the naïve vote counting method and can be viewed as a generalized form of vote counting with better statistical properties. The method is applied to three microarray meta-analysis examples including major depressive disorder, brain cancer and diabetes. The results demonstrate rOP as a more generalizable, robust and sensitive statistical framework to detect disease-related markers.
Ann. Appl. Stat., Volume 8, Number 2 (2014), 777-800.
First available in Project Euclid: 1 July 2014
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Song, Chi; Tseng, George C. Hypothesis setting and order statistic for robust genomic meta-analysis. Ann. Appl. Stat. 8 (2014), no. 2, 777--800. doi:10.1214/13-AOAS683. https://projecteuclid.org/euclid.aoas/1404229514
- Supplementary material A: Supplement Text. Details of one-sided test modification to avoid discordant effect sizes.
- Supplementary material B: Supplement Theorems 1 and 2. Theorem 1—Asymptotic property of vote counting as $K\rightarrow\infty$. Theorem 2—Asymptotic property of rOP as $K\rightarrow\infty$.
- Supplementary material C: Supplement Tables 1 and 2. Table 1—Detail information of combined data sets. Table 2—FDRs for simulation analysis without correlated genes.
- Supplementary material D: Supplement Figures 1 to 7. Figure 1—Results of brain cancer data set using one-sided corrected rOP. Figure 2—Results of MDD data set. Figure 3—Results of diabetes data set. Figure 4—Permutation results of diabetes data set. Figure 5—Results of brain cancer and 1 random MDD data set. Figure 6—Simulation results without correlated genes. Figure 7—Mean rank of different methods for the top $U$ pathways.