TY - GEN
T1 - Effect of Dimensionality Reduction on Classification Accuracy for Protein–Protein Interaction Prediction
AU - Mahapatra, Satyajit
AU - Kumar, Anish
AU - Sharma, Animesh
AU - Sahu, Sitanshu Sekhar
N1 - Publisher Copyright:
© 2020, Springer Nature Singapore Pte Ltd.
PY - 2020
Y1 - 2020
N2 - “Large Dimension” of features derived from protein sequences is a major problem in protein–protein interaction (PPI) prediction. Thus, reduction of the feature dimension may increase the classification accuracy. In this paper, particle swarm optimization (PSO) and principal component analysis (PCA) have been used for dimensionality reduction of PPI sequence features. The performance of the algorithm has been assessed using the intraspecies E coli protein–protein interaction database, containing an equal number of positive and negative interacting pairs. Standard sequence-based features such as amino acid composition (AAC), dipeptide composition (Dipep), and conjoint triad composition (CTD) are extracted. From the results, it is seen that the PSO-based dimensionality reduction method provides steady and better performance in terms of accuracy when applied to the features.
AB - “Large Dimension” of features derived from protein sequences is a major problem in protein–protein interaction (PPI) prediction. Thus, reduction of the feature dimension may increase the classification accuracy. In this paper, particle swarm optimization (PSO) and principal component analysis (PCA) have been used for dimensionality reduction of PPI sequence features. The performance of the algorithm has been assessed using the intraspecies E coli protein–protein interaction database, containing an equal number of positive and negative interacting pairs. Standard sequence-based features such as amino acid composition (AAC), dipeptide composition (Dipep), and conjoint triad composition (CTD) are extracted. From the results, it is seen that the PSO-based dimensionality reduction method provides steady and better performance in terms of accuracy when applied to the features.
UR - https://www.scopus.com/pages/publications/85081167275
UR - https://www.scopus.com/pages/publications/85081167275#tab=citedBy
U2 - 10.1007/978-981-15-1081-6_1
DO - 10.1007/978-981-15-1081-6_1
M3 - Conference contribution
AN - SCOPUS:85081167275
SN - 9789811510809
T3 - Advances in Intelligent Systems and Computing
SP - 3
EP - 12
BT - Advanced Computing and Intelligent Engineering - Proceedings of ICACIE 2018
A2 - Pati, Bibudhendu
A2 - Panigrahi, Chhabi Rani
A2 - Buyya, Rajkumar
A2 - Li, Kuan-Ching
PB - Springer
T2 - 3rd International Conference on Advanced Computing and Intelligent Engineering, ICACIE 2018
Y2 - 22 December 2018 through 24 December 2018
ER -