A Modified Version of the K-means Clustering Algorithm

Article ID

CSTSDE333S8

A Modified Version of the K-means Clustering Algorithm

Juhi Katara
Juhi Katara College of Technology and Engineering, MPUAT
Naveen Choudhary
Naveen Choudhary
DOI

Abstract

Clustering is a technique in data mining which divides given data set into small clusters based on their similarity. K-means clustering algorithm is a popular, unsupervised and iterative clustering algorithm which divides given dataset into k clusters. But there are some drawbacks of traditional k-means clustering algorithm such as it takes more time to run as it has to calculate distance between each data object and all centroids in each iteration. Accuracy of final clustering result is mainly depends on correctness of the initial centroids, which are selected randomly. This paper proposes a methodology which finds better initial centroids further this method is combined with existing improved method for assigning data objects to clusters which requires two simple data structures to store information about each iteration, which is to be used in the next iteration. Proposed algorithm is compared in terms of time and accuracy with traditional k-means clustering algorithm as well as with a popular improved k-means clustering algorithm.

A Modified Version of the K-means Clustering Algorithm

Clustering is a technique in data mining which divides given data set into small clusters based on their similarity. K-means clustering algorithm is a popular, unsupervised and iterative clustering algorithm which divides given dataset into k clusters. But there are some drawbacks of traditional k-means clustering algorithm such as it takes more time to run as it has to calculate distance between each data object and all centroids in each iteration. Accuracy of final clustering result is mainly depends on correctness of the initial centroids, which are selected randomly. This paper proposes a methodology which finds better initial centroids further this method is combined with existing improved method for assigning data objects to clusters which requires two simple data structures to store information about each iteration, which is to be used in the next iteration. Proposed algorithm is compared in terms of time and accuracy with traditional k-means clustering algorithm as well as with a popular improved k-means clustering algorithm.

Juhi Katara
Juhi Katara College of Technology and Engineering, MPUAT
Naveen Choudhary
Naveen Choudhary

No Figures found in article.

Juhi Katara. 2015. “. Global Journal of Computer Science and Technology – C: Software & Data Engineering GJCST-C Volume 15 (GJCST Volume 15 Issue C7): .

Download Citation

Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Classification
GJCST-C Classification: B.2.4, B.7.1
Keywords
Article Matrices
Total Views: 8242
Total Downloads: 2142
2026 Trends
Research Identity (RIN)
Related Research
Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

A Modified Version of the K-means Clustering Algorithm

Juhi Katara
Juhi Katara College of Technology and Engineering, MPUAT
Naveen Choudhary
Naveen Choudhary

Research Journals