Enhancing healthcare data integrity: fraud detection using unsupervised learning techniques

Maithri Bairy, Balachandra Muniyal*, Nisha P. Shetty

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Data in healthcare forms the backbone of any treatment and decision-making for patients. However, the data from a healthcare institution can sometimes be prone to abnormalities, thereby putting treatment and patient safety in jeopardy. This paper points to the dire need for reliable anomaly detection systems in the healthcare industry. It employs various unsupervised learning methods, including Isolation Forest, Local Outlier Factor (LOF), K-Nearest Neighbours (KNN), and autoencoder models for detecting abnormalities in healthcare data with better accuracy. Anomaly detection capabilities also allow health providers to reduce risks and provide some assurance of the integrity of the data, as these capabilities are more likely to indicate unusual patient profiles or incorrect test results. Isolation Forest, LOF, and KNN are the preliminary methods for performing anomaly detection in this work, with Isolation Forest yielding the best results. Then, autoencoder models that learn subtle variations and complex patterns in data are employed. This paper seeks to enhance anomaly detection in terms of accuracy and reliability to ensure better healthcare data quality and patient safety.

Original languageEnglish
Pages (from-to)1006-1019
Number of pages14
JournalInternational Journal of Computers and Applications
Volume46
Issue number11
DOIs
Publication statusPublished - 2024

All Science Journal Classification (ASJC) codes

  • Software
  • Hardware and Architecture
  • Computer Science Applications
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'Enhancing healthcare data integrity: fraud detection using unsupervised learning techniques'. Together they form a unique fingerprint.

Cite this