首页 | 本学科首页   官方微博 | 高级检索  
     检索      

An efficient enhanced k-means clustering algorithm
作者姓名:FAHIM  A.M  SALEM  A.M.  TORKEY  F.A.  RAMADAN  M.A.
作者单位:[1]Department of Mathematics, Faculty of Education, Suez Canal University, Suez city, Egypt [2]Department of Computer Science, Faculty of Computers & Information, Ain Shams University Cairo city, Egypt [3]Department of Computer Science, Faculty of Computers & Information, Minufiya University, Shbeen El Koom City, Egypt) [4]Department of Mathematics, Faculty of Science, Minufiya University, Shbeen El Koom City, Egypt
摘    要:INTRODUCTION The huge amount of data collected and stored in databases increases the need for effective analysis methods to use the information contained implicitly there. One of the primary data analysis tasks is cluster analysis, intended to help a user understand the natural grouping or structure in a dataset. Therefore, the development of improved clustering algorithms has received much attention. The goal of a clustering algorithm is to group the objects of a database into a set of m…

关 键 词:聚类算法  聚合分析  数据分析  数据点
收稿时间:2006-03-15
修稿时间:11 May 2006

An efficient enhanced k-means clustering algorithm
FAHIM A.M SALEM A.M. TORKEY F.A. RAMADAN M.A..An efficient enhanced k-means clustering algorithm[J].Journal of Zhejiang University Science,2006,7(10):1626-1633.
Authors:A M Fahim  A M Salem  F A Torkey  M A Ramadan
Institution:(1) Department of Mathematics, Faculty of Education, Suez Canal University, Suez city, Egypt;(2) Department of Computer Science, Faculty of Computers & Information, Ain Shams University, Cairo city, Egypt;(3) Department of Computer Science, Faculty of Computers & Information, Minufiya University, Shbeen El Koom City, Egypt;(4) Department of Mathematics, Faculty of Science, Minufiya University, Shbeen El Koom City, Egypt
Abstract:In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation.
Keywords:Clustering algorithms  Cluster analysis  k-means algorithm  Data analysis
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号