Abstract
Rapid development in genomics in recent years has allowed the simultaneous measurement of the expression levels of thousands of genes using DNA microarrays. This has offered tremendous potential for growth in our understanding of the pathophysiology of many diseases. When microarray studies also contain information about an outcome variable such as time to an event or death, one of the goals of an investigator is to understand how the expression levels of genes (covariates) relate to the time-to-event (referred to as survival time) in the course of a disease. In this article, we examine the problem of predicting the survival probability of patients when the number of covariates exceeds the number of observations, a setting typical of microarray gene expression data. This is an ill-conditioned problem further compounded by the presence of possibly censored survival times. We propose a model that combines the partial least squares approach for dimensionality reduction with the accelerated failure time model, a widely used log-linear model for linking censored survival time to covariates. We develop parametric methods to account for censoring as well as for predicting patient survival probabilities. We illustrate the applicability of our methods using cancer microarray data and explore the biological relevance of our results using pathway analysis. Finally, we evaluate the performance of our methods using extensive simulation studies.
| Original language | English |
|---|---|
| Title of host publication | 10th IEEE International Conference on Bioinformatics and Bioengineering 2010, BIBE 2010 |
| Pages | 26-31 |
| Number of pages | 6 |
| Volume | 2010 |
| Edition | 5521718 |
| DOIs | |
| State | Published - 2010 |
| Event | 10th IEEE International Conference on Bioinformatics and Bioengineering, BIBE-2010 - Philadelphia, PA, United States Duration: May 31 2010 → Jun 3 2010 |
Publication series
| Name | 10th IEEE International Conference on Bioinformatics and Bioengineering 2010, BIBE 2010 |
|---|
Conference
| Conference | 10th IEEE International Conference on Bioinformatics and Bioengineering, BIBE-2010 |
|---|---|
| Country/Territory | United States |
| City | Philadelphia, PA |
| Period | 05/31/10 → 06/3/10 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Accelerated failure time
- Censored survival data
- Gene expression
- High-dimensional data
- Partial least squares
Fingerprint
Dive into the research topics of 'A supervised approach for predicting patient survival with gene expression data'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver