TY - GEN
T1 - A FSM based approach for efficient implementation of K-means algorithm
AU - Ratnakumar, Rahul
AU - Nanda, Satyasai Jagannath
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/10/10
Y1 - 2017/10/10
N2 - After Fifty years of it's existence the K-means clustering is still popular among researchers due to lower computational complexity. Real time embedded applications require hardwiring of unsupervised learning algorithms like K-means within System-on-Chip for prompt processing in applications like image segmentation, pattern classification, speech recognition etc. This requirement is a must while analyzing Big Datasets. In this manuscript a FSM based architecture is developed for the efficient implementation of K-means algorithm. The proposed architecture has lower computational requirement due to the introduction of concepts like simplified Convergence Checker as well as Fibonacci linear feedback shift register for centroid initialization. To reduce hardware further, Manhattan distance is used as the distance metric instead of the conventional Euclidean distance. Benchmark IRIS flower dataset is used for testing the clustering performance of the proposed architecture. Results obtained after synthesis in Xilinx FPGA Artix7, reveals that the hardware performance is better than previous works, with respect to power (82mW), number of gates, area etc. and has good system clock frequency of 162MHz (6.1592ns), without using any DSP Blocks.
AB - After Fifty years of it's existence the K-means clustering is still popular among researchers due to lower computational complexity. Real time embedded applications require hardwiring of unsupervised learning algorithms like K-means within System-on-Chip for prompt processing in applications like image segmentation, pattern classification, speech recognition etc. This requirement is a must while analyzing Big Datasets. In this manuscript a FSM based architecture is developed for the efficient implementation of K-means algorithm. The proposed architecture has lower computational requirement due to the introduction of concepts like simplified Convergence Checker as well as Fibonacci linear feedback shift register for centroid initialization. To reduce hardware further, Manhattan distance is used as the distance metric instead of the conventional Euclidean distance. Benchmark IRIS flower dataset is used for testing the clustering performance of the proposed architecture. Results obtained after synthesis in Xilinx FPGA Artix7, reveals that the hardware performance is better than previous works, with respect to power (82mW), number of gates, area etc. and has good system clock frequency of 162MHz (6.1592ns), without using any DSP Blocks.
UR - https://www.scopus.com/pages/publications/85034757742
UR - https://www.scopus.com/pages/publications/85034757742#tab=citedBy
U2 - 10.1109/ISVDAT.2016.8064848
DO - 10.1109/ISVDAT.2016.8064848
M3 - Conference contribution
AN - SCOPUS:85034757742
T3 - 2016 20th International Symposium on VLSI Design and Test, VDAT 2016
BT - 2016 20th International Symposium on VLSI Design and Test, VDAT 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 20th International Symposium on VLSI Design and Test, VDAT 2016
Y2 - 24 May 2016 through 27 May 2016
ER -