TY - GEN
T1 - Phenotyping Health Insurance Claims Fraud with Unsupervised Anomaly Detection Methods
AU - Seshagiri, Supriya
AU - Prema, K. V.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - The Covid-19 pandemic has caused a surge in healthcare claims fraud due to increased digitization in the health insurance domain. The dependence on digital systems during the pandemic has led to greater accumulation of data which has in turn spurred newer and innovative frauds. Insurance frauds are a social menace which result in increasing healthcare costs for all, compromising quality of care, breaking the trust of the policyholders with the Insurer and creating legal and regulatory issues for the Insurer. This research aims to identify phenotypes or observable claim characteristics that indicate fraudulent behavior using unsupervised anomaly detection (AD) methods. The experiments are done on the CMS Medicare datasets. They identify unknown phenotypic patterns using the unsupervised AD models, one-class support vector machine (OC-SVM), Isolation Forest (IForest) and local outlier factor (LOF) to identify behavior patterns leading to fraud. These models are ensembled and the resulting outcome is analyzed for feature importance to derive leading characteristics for fraudulent behavior.
AB - The Covid-19 pandemic has caused a surge in healthcare claims fraud due to increased digitization in the health insurance domain. The dependence on digital systems during the pandemic has led to greater accumulation of data which has in turn spurred newer and innovative frauds. Insurance frauds are a social menace which result in increasing healthcare costs for all, compromising quality of care, breaking the trust of the policyholders with the Insurer and creating legal and regulatory issues for the Insurer. This research aims to identify phenotypes or observable claim characteristics that indicate fraudulent behavior using unsupervised anomaly detection (AD) methods. The experiments are done on the CMS Medicare datasets. They identify unknown phenotypic patterns using the unsupervised AD models, one-class support vector machine (OC-SVM), Isolation Forest (IForest) and local outlier factor (LOF) to identify behavior patterns leading to fraud. These models are ensembled and the resulting outcome is analyzed for feature importance to derive leading characteristics for fraudulent behavior.
UR - https://www.scopus.com/pages/publications/105007987652
UR - https://www.scopus.com/pages/publications/105007987652#tab=citedBy
U2 - 10.1007/978-981-96-1744-9_23
DO - 10.1007/978-981-96-1744-9_23
M3 - Conference contribution
AN - SCOPUS:105007987652
SN - 9789819617432
T3 - Lecture Notes in Networks and Systems
SP - 265
EP - 279
BT - Information Systems for Intelligent Systems - Proceedings of ISBM 2024
A2 - Iglesias, Andres
A2 - Shin, Jungpil
A2 - Patel, Bharat
A2 - Joshi, Amit
PB - Springer Science and Business Media Deutschland GmbH
T2 - 3rd World Conference on Information Systems for Business Management, ISBM 2024
Y2 - 12 September 2024 through 13 September 2024
ER -