This paper attempts to cluster leukemia patients described by gene expression data, and to discover the most discriminating genes that are responsible for the clustering. A combined approach of Principal Direction Divisive Partitioning and bisect K-means algorithms is applied to the clustering of the investigated leukemia dataset. Both unsupervised and supervised methods are considered in order to get optimal result. The combination of PDDP and bisect K-means successfully clusters leukemia patients, and efficiently discovers salient genes able to the discriminate the clusters. The combined approach works well on the automatic clustering of leukemia patients depending merely on the gene expression information, and it has great potential on solving similar problems, like classifying pancreatic tumors. The salient identified genes may thus enhance relevant information for discriminating among leukemias.
Principal Directon Divising Partitioning initialisation of K-means Clustering allows to identify the most salient genes in discriminating among Leukemias
Diego Liberati
2017
Abstract
This paper attempts to cluster leukemia patients described by gene expression data, and to discover the most discriminating genes that are responsible for the clustering. A combined approach of Principal Direction Divisive Partitioning and bisect K-means algorithms is applied to the clustering of the investigated leukemia dataset. Both unsupervised and supervised methods are considered in order to get optimal result. The combination of PDDP and bisect K-means successfully clusters leukemia patients, and efficiently discovers salient genes able to the discriminate the clusters. The combined approach works well on the automatic clustering of leukemia patients depending merely on the gene expression information, and it has great potential on solving similar problems, like classifying pancreatic tumors. The salient identified genes may thus enhance relevant information for discriminating among leukemias.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.