TY - GEN
T1 - Autoencoders for Insider Threat Detection of Healthcare Records
AU - Deshpande, Ankita R.
AU - Ignisha Rajathi, G.
AU - Mohanalin, J.
AU - Yashaswini, K. A.
AU - Devadas, Raghavendra M.
AU - Rama Krishna, K.
AU - Sandhya, Kaipa
AU - Hiremani, Vani
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026
Y1 - 2026
N2 - Digitization of healthcare records has introduced challenges to data privacy, security and compliance. Sensitive information contained in the records make them prime targets for cyber threats, unauthorized access and data breaches. Often these acts are committed by malicious insider actors who have access to the records. This paper explores a possible way of identifying the insider threats by analyzing the activity of an actor over a period of time. We propose using Machine Learning, specifically Autoencoders and Variational Autoencoders to detect abnormal activity. We have used the Computer Emergency Response Team Insider Threat Dataset as a test dataset to train the models. We convert the textual data into vector form and use Long short-term memory to extract, reduce the number of features and reconstruct the inputs. We evaluate the performance and accuracy of Autoencoders and Variational Autoencoders by comparing the time taken to train the model on the sample dataset and the full dataset, and the reconstruction errors of the two methods. We conclude that the Variational Autoencoder is better suited for threat detection as it trains faster and has lesser reconstruction errors.
AB - Digitization of healthcare records has introduced challenges to data privacy, security and compliance. Sensitive information contained in the records make them prime targets for cyber threats, unauthorized access and data breaches. Often these acts are committed by malicious insider actors who have access to the records. This paper explores a possible way of identifying the insider threats by analyzing the activity of an actor over a period of time. We propose using Machine Learning, specifically Autoencoders and Variational Autoencoders to detect abnormal activity. We have used the Computer Emergency Response Team Insider Threat Dataset as a test dataset to train the models. We convert the textual data into vector form and use Long short-term memory to extract, reduce the number of features and reconstruct the inputs. We evaluate the performance and accuracy of Autoencoders and Variational Autoencoders by comparing the time taken to train the model on the sample dataset and the full dataset, and the reconstruction errors of the two methods. We conclude that the Variational Autoencoder is better suited for threat detection as it trains faster and has lesser reconstruction errors.
UR - https://www.scopus.com/pages/publications/105028363896
UR - https://www.scopus.com/pages/publications/105028363896#tab=citedBy
U2 - 10.1007/978-3-032-08246-6_35
DO - 10.1007/978-3-032-08246-6_35
M3 - Conference contribution
AN - SCOPUS:105028363896
SN - 9783032082459
T3 - Lecture Notes in Networks and Systems
SP - 434
EP - 445
BT - Smart Computing Paradigms
A2 - Bhateja, Vikrant
A2 - Hoong, Angela Lee Siew
A2 - William, Yeoh Ging Sun
A2 - Rana, Muhammad Ehsan
PB - Springer Science and Business Media Deutschland GmbH
T2 - 7th International Conference on Smart Computing and Informatics, SCI 2025
Y2 - 8 April 2025 through 9 April 2025
ER -