TY - JOUR
T1 - A unified model of explainable regression and AES-based data security for soybean yield prediction
AU - Prabhu, Omkar
AU - Samanth, Snehal
AU - Chadaga, Krishnaraj
N1 - Publisher Copyright:
© 2025 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license. http://creativecommons.org/licenses/by-nc-nd/4.0/
PY - 2025/12
Y1 - 2025/12
N2 - Accurate and interpretable crop yield forecasting is critical for enhancing agricultural productivity and ensuring food security. Soybean, a globally essential crop for protein, oil, and biofuel production, plays a pivotal role in food systems and agro-economies. Therefore, reliable yield prediction is crucial for strategic planning and resource allocation. This study proposes a comprehensive framework that integrates machine learning, explainable AI (XAI), and data encryption for robust and transparent soybean yield prediction. Twelve state-of-the-art regression models are systematically evaluated and optimized using Grid Search, Random Search, and Bayesian Optimization, with K-Nearest Neighbors (KNN) emerging as the top performer (R2= 0.8636). To enhance model transparency, eight XAI techniques including permutation importance, partial dependence plots (PDP), individual conditional expectation (ICE), SHAP, LIME, counterfactual explanations, residual analysis, and surrogate modeling are applied, highlighting ‘Cultivar’ and ‘NS’ as the most influential features. Furthermore, AES encryption in CBC mode is applied using AES-128, AES-192, and AES-256 to secure the dataset, with AES-256 demonstrating the highest confusion (50.02%) and diffusion (49.99%) while preserving decryption integrity. This unified approach delivers not only high prediction accuracy and interpretability but also ensures data security, establishing a strong foundation for secure, trustworthy, and scalable smart agriculture systems.
AB - Accurate and interpretable crop yield forecasting is critical for enhancing agricultural productivity and ensuring food security. Soybean, a globally essential crop for protein, oil, and biofuel production, plays a pivotal role in food systems and agro-economies. Therefore, reliable yield prediction is crucial for strategic planning and resource allocation. This study proposes a comprehensive framework that integrates machine learning, explainable AI (XAI), and data encryption for robust and transparent soybean yield prediction. Twelve state-of-the-art regression models are systematically evaluated and optimized using Grid Search, Random Search, and Bayesian Optimization, with K-Nearest Neighbors (KNN) emerging as the top performer (R2= 0.8636). To enhance model transparency, eight XAI techniques including permutation importance, partial dependence plots (PDP), individual conditional expectation (ICE), SHAP, LIME, counterfactual explanations, residual analysis, and surrogate modeling are applied, highlighting ‘Cultivar’ and ‘NS’ as the most influential features. Furthermore, AES encryption in CBC mode is applied using AES-128, AES-192, and AES-256 to secure the dataset, with AES-256 demonstrating the highest confusion (50.02%) and diffusion (49.99%) while preserving decryption integrity. This unified approach delivers not only high prediction accuracy and interpretability but also ensures data security, establishing a strong foundation for secure, trustworthy, and scalable smart agriculture systems.
UR - https://www.scopus.com/pages/publications/105019204064
UR - https://www.scopus.com/inward/citedby.url?scp=105019204064&partnerID=8YFLogxK
U2 - 10.1016/j.atech.2025.101506
DO - 10.1016/j.atech.2025.101506
M3 - Article
AN - SCOPUS:105019204064
SN - 2772-3755
VL - 12
JO - Smart Agricultural Technology
JF - Smart Agricultural Technology
M1 - 101506
ER -