TY - JOUR
T1 - Performance evaluation of efficient interpretable CNN-transformer model for redshift prediction
AU - Sen, Snigdha
AU - Pandit, Ambuj Kumar
AU - Chakraborty, Pavan
AU - Singh, Krishna Pratap
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.
PY - 2025/4
Y1 - 2025/4
N2 - The popularity of deep learning models precisely transformer-based models has immensely increased in image classification tasks recently. In this article, we investigate the performance of a transformer based model on redshift estimation, a regression task from cosmology. Primarily estimation of redshift through galaxy images is a crucial application to comprehend the structure of our universe. Here, we propose a new variant of deep learning framework consisting of a Convolution Neural network (CNN) coupled with a inception and attention mechanism to estimate redshift from images and photometric features acquired through Sloan Digital Sky Survey (SDSS). The proposed architecture combines the inception block layer together with transformer layers to improve both generalization and capacity of the model and designed specifically to handle mixed input data (image and magnitude features) together in parallel. Extensive experiment shows that model achieves slightly better results in terms of Mean Squared Error (0.026), bias (-0.0002), and competetive Normalized Mean Absolute Deviation (0.013), than contemporary models when tested on our dataset. We empirically show that the proposed model leverages the data better than non-transformer-based models. Furthermore, to gain better insight about inner working procedure of the proposed framework, we applied Saliency map and SHapley Additive exPlanations (SHAP) to explore the essential information in activation maps.
AB - The popularity of deep learning models precisely transformer-based models has immensely increased in image classification tasks recently. In this article, we investigate the performance of a transformer based model on redshift estimation, a regression task from cosmology. Primarily estimation of redshift through galaxy images is a crucial application to comprehend the structure of our universe. Here, we propose a new variant of deep learning framework consisting of a Convolution Neural network (CNN) coupled with a inception and attention mechanism to estimate redshift from images and photometric features acquired through Sloan Digital Sky Survey (SDSS). The proposed architecture combines the inception block layer together with transformer layers to improve both generalization and capacity of the model and designed specifically to handle mixed input data (image and magnitude features) together in parallel. Extensive experiment shows that model achieves slightly better results in terms of Mean Squared Error (0.026), bias (-0.0002), and competetive Normalized Mean Absolute Deviation (0.013), than contemporary models when tested on our dataset. We empirically show that the proposed model leverages the data better than non-transformer-based models. Furthermore, to gain better insight about inner working procedure of the proposed framework, we applied Saliency map and SHapley Additive exPlanations (SHAP) to explore the essential information in activation maps.
UR - https://www.scopus.com/pages/publications/85218418877
UR - https://www.scopus.com/pages/publications/85218418877#tab=citedBy
U2 - 10.1007/s11760-025-03855-9
DO - 10.1007/s11760-025-03855-9
M3 - Article
AN - SCOPUS:85218418877
SN - 1863-1703
VL - 19
JO - Signal, Image and Video Processing
JF - Signal, Image and Video Processing
IS - 4
M1 - 326
ER -