TY - GEN
T1 - Optimizing Feature Selection in Big Data
T2 - 2024 IEEE International Conference on Computing, Semiconductor, Mechatronics, Intelligent Systems and Communications, COSMIC 2024
AU - Hada, Aman Singh
AU - Sahoo, Gyanaballav Samir
AU - Vamsi, Chinnapareddy Krishna
AU - Hegde, Anusha
AU - Bhowmik, Biswajit
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The exponential growth of big data presents both immense opportunities and significant challenges. While vast datasets hold the key to unlocking groundbreaking insights, efficiently extracting value requires sophisticated feature selection techniques. Traditional methods often struggle with the sheer volume and complexity of big data. This paper addresses this challenge by proposing a novel hybrid feature selection algorithm by leveraging Apache PySpark's distributed computing power. Combining a robust feature selection technique with a novel weighting scheme, our method outperforms existing hypercuboid and fuzzy Rough Set methods. The hybrid approach achieves superior accuracy of 72.1% with a reduced feature set, demonstrating its effectiveness in identifying salient features for big data analysis.
AB - The exponential growth of big data presents both immense opportunities and significant challenges. While vast datasets hold the key to unlocking groundbreaking insights, efficiently extracting value requires sophisticated feature selection techniques. Traditional methods often struggle with the sheer volume and complexity of big data. This paper addresses this challenge by proposing a novel hybrid feature selection algorithm by leveraging Apache PySpark's distributed computing power. Combining a robust feature selection technique with a novel weighting scheme, our method outperforms existing hypercuboid and fuzzy Rough Set methods. The hybrid approach achieves superior accuracy of 72.1% with a reduced feature set, demonstrating its effectiveness in identifying salient features for big data analysis.
UR - https://www.scopus.com/pages/publications/85219118098
UR - https://www.scopus.com/pages/publications/85219118098#tab=citedBy
U2 - 10.1109/COSMIC63293.2024.10871408
DO - 10.1109/COSMIC63293.2024.10871408
M3 - Conference contribution
AN - SCOPUS:85219118098
T3 - COSMIC 2024 - IEEE International Conference on Computing, Semiconductor, Mechatronics, Intelligent Systems and Communications, Proceedings
SP - 195
EP - 199
BT - COSMIC 2024 - IEEE International Conference on Computing, Semiconductor, Mechatronics, Intelligent Systems and Communications, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 22 November 2024 through 23 November 2024
ER -