Abstract
DNA microarray technology has made it possible to simultaneously measure the expression levels of tens of thousands of genes. In this paper, we address the problem of class discovery which involves unsupervised clustering of tissue samples based on corresponding gene expression data. Non-negative matrix factorization (NMF) by multiplicative updates algorithm is a powerful method for decomposing the gene expression matrix V into two matrices with nonnegative entries, V - WH, where each column of W defines a metagene and each column of H represents the metagene expression pattern of the corresponding sample. We describe a method for class discovery and dimensionality reduction using NMF, based on various measures of distance between two non-negative matrices. Our approach provides a unique framework for class discovery. We demonstrate the applicability of this method using cancer microarray as well as simulated data.
Original language | English |
---|---|
Pages (from-to) | 457-467 |
Number of pages | 11 |
Journal | American Journal of Mathematical and Management Sciences |
Volume | 28 |
Issue number | 3-4 |
DOIs | |
State | Published - 2008 |
Keywords
- Class discovery
- Dimensionality reduction
- Gene expression matrix
- Kullback-Leibler divergence
- Nonnegative matrix factorization
- Renyi's divergence