TY - GEN
T1 - Performance Analysis of Clustering Using Modified Grey Wolf Optimization
AU - Shafeeq, B. M.Ahamed
AU - Ansari, Zahid Ahmed
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023.
PY - 2023
Y1 - 2023
N2 - Data clustering is the widely used technique in academia and industry to analyse large volumes of data with unknown patterns. Data clustering approaches that draw inspiration from biology are increasingly widely used. In this research, we propose a parallelized automated data clustering using a modified Grey Wolf Optimization technique based on the hunting style of the grey wolf which involves Tracking, chasing and approaching the prey. It will find the optimal solution from the generated ‘N’ solutions. However, the nature of massive data available in the repositories is unknown. So, it is a tedious task to guess the right number of clusters for the massive data. By repeating the procedure with clusters K = 2 to N, the suggested technique determines the best number of clusters. The ideal number of clusters are detected based on the best values of Silhouette index, Davies-Bouldin index and Calinski-Harabasz index. This research aims to propose a more efficient Intelligent clustering framework. The suggested approach operates in both the scenarios, i.e. with a predetermined number of clusters and an uncertain number of clusters. The user can either fix the number of clusters or let the system identify the optimal number of clusters. The proposed method parallelizes and automates cluster analysis in the most effective manner for determining the best clusters and forming natural clusters.
AB - Data clustering is the widely used technique in academia and industry to analyse large volumes of data with unknown patterns. Data clustering approaches that draw inspiration from biology are increasingly widely used. In this research, we propose a parallelized automated data clustering using a modified Grey Wolf Optimization technique based on the hunting style of the grey wolf which involves Tracking, chasing and approaching the prey. It will find the optimal solution from the generated ‘N’ solutions. However, the nature of massive data available in the repositories is unknown. So, it is a tedious task to guess the right number of clusters for the massive data. By repeating the procedure with clusters K = 2 to N, the suggested technique determines the best number of clusters. The ideal number of clusters are detected based on the best values of Silhouette index, Davies-Bouldin index and Calinski-Harabasz index. This research aims to propose a more efficient Intelligent clustering framework. The suggested approach operates in both the scenarios, i.e. with a predetermined number of clusters and an uncertain number of clusters. The user can either fix the number of clusters or let the system identify the optimal number of clusters. The proposed method parallelizes and automates cluster analysis in the most effective manner for determining the best clusters and forming natural clusters.
UR - https://www.scopus.com/pages/publications/85174524248
UR - https://www.scopus.com/pages/publications/85174524248#tab=citedBy
U2 - 10.1007/978-981-99-4284-8_1
DO - 10.1007/978-981-99-4284-8_1
M3 - Conference contribution
AN - SCOPUS:85174524248
SN - 9789819942831
T3 - Lecture Notes in Networks and Systems
SP - 3
EP - 13
BT - Advanced Computational and Communication Paradigms - Proceedings of ICACCP 2023
A2 - Borah, Samarjeet
A2 - Gandhi, Tapan K.
A2 - Piuri, Vincenzo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 4th International Conference on Advanced Computational and Communication Paradigms, ICACCP 2023
Y2 - 16 February 2023 through 18 February 2023
ER -