TY - GEN
T1 - A machine learning approach to DNA microarray biclustering analysis
AU - Kung, S. Y.
AU - Mak, Man Wai
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2005
Y1 - 2005
N2 - Based on well-established machine learning techniques and neural networks, several biclustering algorithms can be developed for DNA microarray analysis. It has been recognized that genes (even though they may belong to the same gene group) may be co-expressed via a diversity of coherence models. One convincing argument is that a gene may participate in multiple pathways that may or may not be co-active under all conditions. It is biologically more meaningful to cluster both genes and conditions in gene expression data - leading to the so-called biclustering analysis. In addition, we have developed a set of systematic preprocessing methods to effectively comply with various coherence models. This paper will show that the proposed framework enjoys a vital advantage of ease of visualization and analysis. Because a gene may follow more than one coherence models, a multivariate biclustering analysis based on fusion of scores derived from different preprocessing methods appears to be very promising. This is evidenced by our simulation study. In summary, this paper shows that machine learning techniques offers a viable approach to identifying and classifying biologically relevant groups in genes and conditions.
AB - Based on well-established machine learning techniques and neural networks, several biclustering algorithms can be developed for DNA microarray analysis. It has been recognized that genes (even though they may belong to the same gene group) may be co-expressed via a diversity of coherence models. One convincing argument is that a gene may participate in multiple pathways that may or may not be co-active under all conditions. It is biologically more meaningful to cluster both genes and conditions in gene expression data - leading to the so-called biclustering analysis. In addition, we have developed a set of systematic preprocessing methods to effectively comply with various coherence models. This paper will show that the proposed framework enjoys a vital advantage of ease of visualization and analysis. Because a gene may follow more than one coherence models, a multivariate biclustering analysis based on fusion of scores derived from different preprocessing methods appears to be very promising. This is evidenced by our simulation study. In summary, this paper shows that machine learning techniques offers a viable approach to identifying and classifying biologically relevant groups in genes and conditions.
UR - http://www.scopus.com/inward/record.url?scp=33749057720&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33749057720&partnerID=8YFLogxK
U2 - 10.1109/MLSP.2005.1532936
DO - 10.1109/MLSP.2005.1532936
M3 - Conference contribution
AN - SCOPUS:33749057720
SN - 0780395174
SN - 9780780395176
T3 - 2005 IEEE Workshop on Machine Learning for Signal Processing
SP - 399
EP - 404
BT - 2005 IEEE Workshop on Machine Learning for Signal Processing
T2 - 2005 IEEE Workshop on Machine Learning for Signal Processing
Y2 - 28 September 2005 through 30 September 2005
ER -