TY - GEN
T1 - When discriminative K-means meets Grassmann manifold
T2 - 3rd IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2016
AU - Li, Danping
AU - Wang, Lei
AU - Xue, Zhong
AU - Wong, Stephen T.C.
N1 - Funding Information:
This work is supported in part by National Natural Science Foundation of China (Grant No. 61203137, 61401328), Natural Science Foundation of Shaanxi Province (Grant No. 2014JQ8306, 2015JM6279), the Fundamental Research Funds for the Central Universities (Grant No. 72135832, 72135942), and a John S. Dunn Research Foundation grant.
Publisher Copyright:
© 2016 IEEE.
PY - 2016/4/18
Y1 - 2016/4/18
N2 - Understanding the role of genetics in diseases is a challenging process that has multiple applications within functional genomics and precision medicine. In this paper, we present a general clustering method to identify disease genes under a multi-view setting. First, by incorporating the graph Laplacian of spectral clustering (SC) into the discriminative K-means, we obtain a single-view subspace representation, which is endowed with both discriminant power and geometrical structure information of that data layer. Then, integrating these individual subspaces together on the Grassmann manifold, we can further find a unified low-dimensional representation under the multi-view SC framework. The proposed two-stage method generalizes the single-view discriminative K-means and the multi-view Grassmann clustering, and can directly handle the case where both attribute-based data and interaction-based networks are available, which is extremely useful in biological research. As a case study of disease gene identification, we apply this method to a benchmark dataset that contains nine gene-by-term text profiles. Experimental results show that our method provides competitive results compared to the state-of-art clustering methods, including a similar one that fuses multiple kernels and Laplacians.
AB - Understanding the role of genetics in diseases is a challenging process that has multiple applications within functional genomics and precision medicine. In this paper, we present a general clustering method to identify disease genes under a multi-view setting. First, by incorporating the graph Laplacian of spectral clustering (SC) into the discriminative K-means, we obtain a single-view subspace representation, which is endowed with both discriminant power and geometrical structure information of that data layer. Then, integrating these individual subspaces together on the Grassmann manifold, we can further find a unified low-dimensional representation under the multi-view SC framework. The proposed two-stage method generalizes the single-view discriminative K-means and the multi-view Grassmann clustering, and can directly handle the case where both attribute-based data and interaction-based networks are available, which is extremely useful in biological research. As a case study of disease gene identification, we apply this method to a benchmark dataset that contains nine gene-by-term text profiles. Experimental results show that our method provides competitive results compared to the state-of-art clustering methods, including a similar one that fuses multiple kernels and Laplacians.
UR - http://www.scopus.com/inward/record.url?scp=84968619396&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84968619396&partnerID=8YFLogxK
U2 - 10.1109/BHI.2016.7455910
DO - 10.1109/BHI.2016.7455910
M3 - Conference contribution
AN - SCOPUS:84968619396
T3 - 3rd IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2016
SP - 364
EP - 367
BT - 3rd IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2016
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 February 2016 through 27 February 2016
ER -