TY - GEN
T1 - Building a classifier for integrated microarray datasets through two-stage approach
AU - Yoon, Youngmi
AU - Lee, Jongchan
AU - Park, Sanghyun
PY - 2006
Y1 - 2006
N2 - Since microarray data acquire tens of thousands of gene expression values simultaneously, they could be very useful in identifying the phenotypes of diseases. However, the results of analyzing several microarray datasets which were independently carried out with the same biological objectives, could turn out to be different. One of the main reasons is attributable to the limited number of samples involved in one microarray experiment. In order to increase the classification accuracy, it is desirable to augment the sample size by integrating and maximizing the use of independently-conducted microarray datasets. In this paper, we propose a two-stage approach which firstly integrates individual microarray datasets to overcome the problem caused by limited number of samples, and identifies informative genes, secondly builds a classifier using only the informative genes. The classifier from large samples by integrating independent microarray datasets achieves high accuracy, sensitivity, and specificity on independent test sample dataset.
AB - Since microarray data acquire tens of thousands of gene expression values simultaneously, they could be very useful in identifying the phenotypes of diseases. However, the results of analyzing several microarray datasets which were independently carried out with the same biological objectives, could turn out to be different. One of the main reasons is attributable to the limited number of samples involved in one microarray experiment. In order to increase the classification accuracy, it is desirable to augment the sample size by integrating and maximizing the use of independently-conducted microarray datasets. In this paper, we propose a two-stage approach which firstly integrates individual microarray datasets to overcome the problem caused by limited number of samples, and identifies informative genes, secondly builds a classifier using only the informative genes. The classifier from large samples by integrating independent microarray datasets achieves high accuracy, sensitivity, and specificity on independent test sample dataset.
UR - http://www.scopus.com/inward/record.url?scp=34547441268&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547441268&partnerID=8YFLogxK
U2 - 10.1109/BIBE.2006.253321
DO - 10.1109/BIBE.2006.253321
M3 - Conference contribution
AN - SCOPUS:34547441268
SN - 0769527272
SN - 9780769527277
T3 - Proceedings - Sixth IEEE Symposium on BioInformatics and BioEngineering, BIBE 2006
SP - 94
EP - 102
BT - Proceedings - Sixth IEEE Symposium on BioInformatics and BioEngineering, BIBE 2006
T2 - 6th IEEE Symposium on BioInformatics and BioEngineering, BIBE 2006
Y2 - 16 October 2006 through 18 October 2006
ER -