TY - GEN
T1 - Feature Selection and Modeling using Statistical and Machine learning Methods
AU - D'souza, Sofia
AU - Prema, K. V.
AU - Balaji, S.
N1 - Publisher Copyright:
© 2020 IEEE.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2020/10/30
Y1 - 2020/10/30
N2 - Feature selection is a necessary step in machine learning regression problems that aims to find relevant and reduced set of features. In this research, we assessed the performance of three different learning models on a Quantitative structure activity relationship (QSAR) dataset. Learning models were developed from a pool of features selected by three different variable selection techniques. The results indicate that the final learning models built using statistically significant features exhibit improved predictive performance. Further, Partial least squares (PLS) learning model has shown better predictive performance compared to other learning models on the external test set.
AB - Feature selection is a necessary step in machine learning regression problems that aims to find relevant and reduced set of features. In this research, we assessed the performance of three different learning models on a Quantitative structure activity relationship (QSAR) dataset. Learning models were developed from a pool of features selected by three different variable selection techniques. The results indicate that the final learning models built using statistically significant features exhibit improved predictive performance. Further, Partial least squares (PLS) learning model has shown better predictive performance compared to other learning models on the external test set.
UR - http://www.scopus.com/inward/record.url?scp=85099717438&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85099717438&partnerID=8YFLogxK
U2 - 10.1109/DISCOVER50404.2020.9278093
DO - 10.1109/DISCOVER50404.2020.9278093
M3 - Conference contribution
AN - SCOPUS:85099717438
T3 - 2020 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2020 - Proceedings
SP - 18
EP - 22
BT - 2020 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2020
Y2 - 30 October 2020 through 31 October 2020
ER -