Limits...
Identifying significant temporal variation in time course microarray data without replicates.

Billups SC, Neville MC, Rudolph M, Porter W, Schedin P - BMC Bioinformatics (2009)

Bottom Line: An important component of time course microarray studies is the identification of genes that demonstrate significant time-dependent variation in their expression levels.Until recently, available methods for performing such significance tests required replicates of individual time points.These results were confirmed in follow-up laboratory experiments.

View Article: PubMed Central - HTML - PubMed

Affiliation: Department of Mathematical and Statistical Sciences, University of Colorado, Denver, CO, USA. Stephen.Billups@ucdenver.edu

ABSTRACT

Background: An important component of time course microarray studies is the identification of genes that demonstrate significant time-dependent variation in their expression levels. Until recently, available methods for performing such significance tests required replicates of individual time points. This paper describes a replicate-free method that was developed as part of a study of the estrous cycle in the rat mammary gland in which no replicate data was collected.

Results: A temporal test statistic is proposed that is based on the degree to which data are smoothed when fit by a spline function. An algorithm is presented that uses this test statistic together with a false discovery rate method to identify genes whose expression profiles exhibit significant temporal variation. The algorithm is tested on simulated data, and is compared with another recently published replicate-free method. The simulated data consists both of genes with known temporal dependencies, and genes from a distribution. The proposed algorithm identifies a larger percentage of the time-dependent genes for a given false discovery rate. Use of the algorithm in a study of the estrous cycle in the rat mammary gland resulted in the identification of genes exhibiting distinct circadian variation. These results were confirmed in follow-up laboratory experiments.

Conclusion: The proposed algorithm provides a new approach for identifying expression profiles with significant temporal variation without relying on replicates. When compared with a recently published algorithm on simulated data, the proposed algorithm appears to identify a larger percentage of time-dependent genes for a given false discovery rate. The development of the algorithm was instrumental in revealing the presence of circadian variation in the virgin rat mammary gland during the estrous cycle.

Show MeSH
True discoveries as function of fdr rates.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC2682797&req=5

Figure 4: True discoveries as function of fdr rates.

Mentions: The second set of tests analyzed 8 simulated data sets. In each data set, half of the genes were generated with time-dependent variation, and half were generated from the distribution. For each method, we determined for each gene the smallest false discovery rate that would result in that gene being selected. By sorting the genes according to these threshold rates, we calculated the number of true discoveries and the number of false discoveries that would result from every possible fdr rate. The results are shown in Figure 4.


Identifying significant temporal variation in time course microarray data without replicates.

Billups SC, Neville MC, Rudolph M, Porter W, Schedin P - BMC Bioinformatics (2009)

True discoveries as function of fdr rates.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC2682797&req=5

Figure 4: True discoveries as function of fdr rates.
Mentions: The second set of tests analyzed 8 simulated data sets. In each data set, half of the genes were generated with time-dependent variation, and half were generated from the distribution. For each method, we determined for each gene the smallest false discovery rate that would result in that gene being selected. By sorting the genes according to these threshold rates, we calculated the number of true discoveries and the number of false discoveries that would result from every possible fdr rate. The results are shown in Figure 4.

Bottom Line: An important component of time course microarray studies is the identification of genes that demonstrate significant time-dependent variation in their expression levels.Until recently, available methods for performing such significance tests required replicates of individual time points.These results were confirmed in follow-up laboratory experiments.

View Article: PubMed Central - HTML - PubMed

Affiliation: Department of Mathematical and Statistical Sciences, University of Colorado, Denver, CO, USA. Stephen.Billups@ucdenver.edu

ABSTRACT

Background: An important component of time course microarray studies is the identification of genes that demonstrate significant time-dependent variation in their expression levels. Until recently, available methods for performing such significance tests required replicates of individual time points. This paper describes a replicate-free method that was developed as part of a study of the estrous cycle in the rat mammary gland in which no replicate data was collected.

Results: A temporal test statistic is proposed that is based on the degree to which data are smoothed when fit by a spline function. An algorithm is presented that uses this test statistic together with a false discovery rate method to identify genes whose expression profiles exhibit significant temporal variation. The algorithm is tested on simulated data, and is compared with another recently published replicate-free method. The simulated data consists both of genes with known temporal dependencies, and genes from a distribution. The proposed algorithm identifies a larger percentage of the time-dependent genes for a given false discovery rate. Use of the algorithm in a study of the estrous cycle in the rat mammary gland resulted in the identification of genes exhibiting distinct circadian variation. These results were confirmed in follow-up laboratory experiments.

Conclusion: The proposed algorithm provides a new approach for identifying expression profiles with significant temporal variation without relying on replicates. When compared with a recently published algorithm on simulated data, the proposed algorithm appears to identify a larger percentage of time-dependent genes for a given false discovery rate. The development of the algorithm was instrumental in revealing the presence of circadian variation in the virgin rat mammary gland during the estrous cycle.

Show MeSH