Loading...

A soft computing model for data clustering and application to gene grouping

Abstract: Data clustering aims at discovering groups and identifying patterns in data. A large number of clustering algorithms and their variations exist in literature. In this work, we consider data that has a natural ordering based on some criterion. The problem can be stated as clustering of sequ...

Full description

Bibliographic Details
Main Author: Jacob, Elizabeth
Corporate Author: Mahatma Gandhi University, School of Computer Sciences
Format: Ph.D Thesis
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/10603/6802
LEADER 02488nam a22001577a 4500
100 |a Jacob, Elizabeth.  |9 31701 
245 |a A soft computing model for data clustering and application to gene grouping 
260 |c 2013 
300 |a 143 Pages. 
520 |a Abstract: Data clustering aims at discovering groups and identifying patterns in data. A large number of clustering algorithms and their variations exist in literature. In this work, we consider data that has a natural ordering based on some criterion. The problem can be stated as clustering of sequential data based on multiple features. It belongs to the class of grouping problems. When pre-ordered data is clustered, it results in contiguous blocks. In the general clustering problem, an all-against-all comparison of data objects is required. However, in sequential data clustering, the data objects are position dependent which imposes the condition that only data objects appearing close together in the data stream will belong to the same cluster, thus avoiding an all-against-all comparison. The classical approach to data clustering has given rise to a large number of algorithms that mainly fall into the hierarchical and partitional categories. Soft Computing paradigms of genetic algorithms, fuzzy logic and artificial neural networks have also contributed towards data clustering. Hybrid algorithms mix different computing families to evolve algorithms that perform better than their constitutive elements. The proposed soft computing model belongs to the class of hybrid algorithms. It draws upon the capabilities of genetic algorithms and fuzzy logic to design a methodology to partition the data set into clusters based on the contribution of a set of factors that are known to have some influence in the formation of clusters. The model consists of a fuzzy guided genetic algorithm based on multiple criteria/features. The model has been successfully applied to the problem of gene grouping in the area of bioinformatics. An organism s genome consists of a sequence of genes. The algorithm attempts to discover groups of related genes that lie adjacent on the genome. 
650 |a Soft computing,  |a Data clustering,  |a Gene grouping.  |9 31702 
710 |a Mahatma Gandhi University,  |b School of Computer Sciences  |f January, 2013  |k Hard bound  |9 31703 
856 |u http://hdl.handle.net/10603/6802 
942 |c THES 
999 |c 85422  |d 85422 
952 |0 0  |1 0  |4 0  |7 0  |9 77766  |a SOCS  |b SOCS  |d 2016-04-27  |l 0  |p SOCST12  |r 2016-04-27  |w 2016-04-27  |y THES