Google scholar arxiv informatics ads IJAIS publications are indexed with Google Scholar, NASA ADS, Informatics et. al.

Call for Paper

-

April Edition 2021

International Journal of Applied Information Systems solicits high quality original research papers for the April 2021 Edition of the journal. The last date of research paper submission is March 15, 2021.

Kohonen Self Organizing Map with Modified K-means clustering For High Dimensional Data Set

Madhusmita Mishra, H. S. Behera Published in

International Journal of Applied Information Systems
Year of Publication 2012
© 2010 by IJAIS Journal
10.5120/ijais12-450310
Download full text
  1. Madhusmita Mishra and H s Behera. Article: Kohonen Self Organizing Map with Modified K-means clustering For High Dimensional Data Set. International Journal of Applied Information Systems 2(3):34-39, May 2012. BibTeX

    @article{key:article,
    	author = "Madhusmita Mishra and H.s. Behera",
    	title = "Article: Kohonen Self Organizing Map with Modified K-means clustering For High Dimensional Data Set",
    	journal = "International Journal of Applied Information Systems",
    	year = 2012,
    	volume = 2,
    	number = 3,
    	pages = "34-39",
    	month = "May",
    	note = "Published by Foundation of Computer Science, New York, USA"
    }
    

Abstract

Since it was first proposed, it is amazing to notice how K-Means algorithm has survive over the years. It has been one among the well known algorithms for data clustering in the field of data mining. Day in and day out new algorithms are evolving for data clustering purposes but none can be as fast and accurate as the K-Means algorithm. But in spite of its huge speed, accuracy and simplicity K-Means has suffered from some of its own problem. Such as, the exact number of cluster is not known prior to clustering. The other thing that is causing problem is that it is quite sensitive to initial centroids. Not just that, K-Means fails to give optimum result when it comes to clustering high dimensional data set because its complexity tends to make things more complicated when more number of dimensions are added. In Data Mining this problem is known as "Curse of High Dimensionality". Here in our paper we proposed a new Modified K-Means algorithm that will overcome the problem faced by the standard K-Means algorithm. We proposed the use of Kohonen Self Organizing Map (KSOM) so as to visualize exact number of clusters before clustering and genetic algorithm is applied for initialization. The Kohonen Self Organizing Map (KSOM) with Modified K-Means algorithm is tested on an iris data set and its performance is compared with other clustering algorithm and is found out to be more accurate, with less number of classification and quantization errors and can be applied even for high dimensional dataset.

Reference

  1. Dash, R. et. al , "A Hybridized k-Means Clustering Algorithm for High Dimensional Dataset", International Journal of Engineering, Science and Technology, vol. 2, No. 2, pp. 59-66,2010.
  2. Behera, H. S. et al, "An improved hybridized k-means clustering algorithm(IHKMCA) for high dimensional dataset and it's performance analysis" International journal of Computer science & Engineering,Vol-3 no-2,pp 1183-1190,2011.
  3. Vesanto, J. and Alhoniemi, E. , "Clustering of the Self-Organizing Map", IEEE Transactions on Neural Networks, Vol. 11, No. 3, May 2000, pp. 586-600.
  4. Vesanto J. , "SOM-based data visualization methods", Intell, Data Analysis, vol. 3, No. 2, pp. 111-126, 1999.
  5. M. N. M and Moheb, E. , "Hybrid Self Organizing Map for Overlapping Clusters", International Journal of Signal Processing, Image Processing and Pattern Recognition,pp-11-20.
  6. Bohling, J. , "Dimension Reduction And Cluster Analysis", EECS 833, 6 March 2006.
  7. Yedla, M. et al, "Enhancing K means algorithm with improved initial center", (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 1 (2) , pp- 121-125,2010.
  8. Fahim A. M. , et al, "An efficient k-means with good initial starting points", Georgian Electronic Scientific Journal: Computer Science and Telecommunications, Vol. 2, No. 19, pp. 47-57,2009.
  9. Zhang, C. , Xia, S. , et al, "K-means Clustering Algorithm with Improved Initial Center," Second International Workshop on Knowledge Discovery and Data Mining, wkdd, pp. 790-792,2009.
  10. Bashar Al Shboul et. al "Initializing K-Means Clustering Algorithm by using Genetic Algorithm" , World Academy of Science, Engineering and Technology 54 2009.

Keywords

K-means, Kohonen Self Organizing Map, Genetic Algorithm, Curse Of Dimensionality, Classification Error