• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail

Optimal Sample Size for Multiple Testing: The Case of Gene Expression Microarrays

Müller, Peter; Parmigiani, Giovanni; Robert, Christian P.; Rousseau, Judith (2004), Optimal Sample Size for Multiple Testing: The Case of Gene Expression Microarrays, Journal of the American Statistical Association, 99, 468, p. 990-1001. http://dx.doi.org/10.1198/016214504000001646

View/Open
2002-46.ps (1.021Mb)
optimal_muller.PDF (405.0Kb)
Type
Article accepté pour publication ou publié
Date
2004
Journal name
Journal of the American Statistical Association
Volume
99
Number
468
Publisher
American Statistical Association
Pages
990-1001
Publication identifier
http://dx.doi.org/10.1198/016214504000001646
Metadata
Show full item record
Author(s)
Müller, Peter
Parmigiani, Giovanni
Robert, Christian P.
Rousseau, Judith
Abstract (EN)
We consider the choice of an optimal sample size for multiple-comparison problems. The motivating application is the choice of the number of microarray experiments to be carried out when learning about differential gene expression. However, the approach is valid in any application that involves multiple comparisons in a large number of hypothesis tests. We discuss two decision problems in the context of this setup: the sample size selection and the decision about the multiple comparisons. We adopt a decision-theoretic approach, using loss functions that combine the competing goals of discovering as many differentially expressed genes as possible, while keeping the number of false discoveries manageable. For consistency, we use the same loss function for both decisions. The decision rule that emerges for the multiple-comparison problem takes the exact form of the rules proposed in the recent literature to control the posterior expected false-discovery rate. For the sample size selection, we combine the expected utility argument with an additional sensitivity analysis, reporting the conditional expected utilities and conditioning on assumed levels of the true differential expression. We recognize the resulting diagnostic as a form of statistical power facilitating interpretation and communication. As a sampling model for observed gene expression densities across genes and arrays, we use a variation of a hierarchical gamma/gamma model. But the discussion of the decision problem is independent of the chosen probability model. The approach is valid for any model that includes positive prior probabilities for the null hypotheses in the multiple comparisons and that allows for efficient marginal and posterior simulation, possibly by dependent Markov chain Monte Carlo simulation.
Subjects / Keywords
Genomic data analysis; False discovery rate; Multiple comparison

Related items

Showing items related by title and author.

  • Thumbnail
    Identifying the Salient Genes in Microarray Data: A Novel Game Theoretic Model for the Co-Expression Network 
    Neog Bora, Papori; Baruah, Vishwa Jyoti; Borkotokey, Surajit; Gogoi, Loyimee; Mahanta, Priyakshi; Sarmah, Ankumon; Kumar, Rajnish; Sun, Min Woo; Moretti, Stefano (2020) Article accepté pour publication ou publié
  • Thumbnail
    Testing hypotheses via a mixture estimation model 
    Kamary, Kaniav; Mengersen, Kerrie; Robert, Christian P.; Rousseau, Judith (2014) Document de travail / Working paper
  • Thumbnail
    Rethinking the Effective Sample Size 
    Elvira, Víctor; Martino, Luca; Robert, Christian (2022) Article accepté pour publication ou publié
  • Thumbnail
    Hidden Markov models for complex stochastic processes: A case study in electrophysiology. 
    Mengersen, Kerrie; Rousseau, Judith; Silburn, Peter; Johnson, Helen; White, Nicole M. (2012) Chapitre d'ouvrage
  • Thumbnail
    Using informative priors in the estimation of mixtures over time with application to aerosol particle size distributions 
    Hussein, Tareq; Rousseau, Judith; Alston, Clair; Mengersen, Kerrie; Wraith, Darren (2014) Article accepté pour publication ou publié
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo