The Mathematical Institute, University of Oxford, Eprints Archive

DifFUZZY: A fuzzy spectral clustering algorithm for complex data sets

Cominetti, Ornella and Matzavinos, Anastasios and Samarasinghe, Sandhya and Kulasiri, Don and Maini, Philip K. and Liu,, Sijia and Erban, Radek (2009) DifFUZZY: A fuzzy spectral clustering algorithm for complex data sets. International Journal of Computational Intelligence in Bioinformatics and Systems Biology . (Submitted)



Motivation: Soft (fuzzy) clustering techniques are often used in the study of high-dimensional data sets, such as microarray and other high-throughput bioinformatics data. The most widely used method is the Fuzzy C-means algorithm (FCM), but it can present difficulties when dealing with some data sets.

Results: A spectral fuzzy clustering algorithm, DifFUZZY, applicable to a larger class of clustering problems than other fuzzy clustering algorithms is developed. Examples of data sets (synthetic and real)for which this method outperforms other frequently used algorithms are presented, including two benchmark biological data sets, a genetic expression data set and a data set that contains taxonomic measurements. This method is better than traditional fuzzy clustering algorithms at handling data sets that are “curved”, elongated or those which contain clusters of different dispersion.

Item Type:Article
Subjects:D - G > General
Research Groups:Oxford Centre for Collaborative Applied Mathematics
ID Code:953
Deposited By: Ruby Hawkins
Deposited On:02 Sep 2010 09:47
Last Modified:29 May 2015 18:38

Repository Staff Only: item control page