TY - JOUR
T1 - Supervised models for loan fraud analysis using big data approach
AU - Attigeri, Girija
AU - Pai M M, Manohara
AU - Pai, Radhika M.
N1 - Publisher Copyright:
© 2021, International Association of Engineers. All rights reserved.
PY - 2021
Y1 - 2021
N2 - —Banking and Financial Institutions are facing the pressure of increased defaults by individuals and firms in the last few years repercussions due to fraudulent activities. It is not only adversely affecting banks but also other financial sectors which depend on them. This makes it imperative to study the ways to prevent them rather than curing the situations. However, banks face two challenges in identifying NPAs and Wilful defaults. The first one is the due diligence of firms/individuals before an extension of the loan. The second one is, need for the placement of automated safeguards to reduce frauds originating out from human behavior. The wilful defaults are committed mainly in loan and credit services for personal benefits and are getting converted into bad loans. Bad loans are the Non-Performing Assets (NPAs) and wilful defaults are a subset of these. Hence, it is very important to control NPAs. The objective of the paper is to design and evaluate machine learning based supervised models for NPA detection. To design models, the entire historical and current data needs to be considered, which requires, faster access to large volumes of heterogeneous data. Hence, the supervised models are implemented using big data techniques for fraud detection and analytics. The various supervised models namely Logistic Regression, Support Vector Machine, Random Forest, Neural Network, and Naive Bayes are designed for loan data and experimented using Map Reduce on Hadoop platform. These models are evaluated considering various performance metrics. The empirical result shows that the Neural Network model performs best considering precision, recall, relative commission error, and kappa statistics for NPA prediction. The best-performed model can be integrated into the existing loan management system for the early identification of NPA cases.
AB - —Banking and Financial Institutions are facing the pressure of increased defaults by individuals and firms in the last few years repercussions due to fraudulent activities. It is not only adversely affecting banks but also other financial sectors which depend on them. This makes it imperative to study the ways to prevent them rather than curing the situations. However, banks face two challenges in identifying NPAs and Wilful defaults. The first one is the due diligence of firms/individuals before an extension of the loan. The second one is, need for the placement of automated safeguards to reduce frauds originating out from human behavior. The wilful defaults are committed mainly in loan and credit services for personal benefits and are getting converted into bad loans. Bad loans are the Non-Performing Assets (NPAs) and wilful defaults are a subset of these. Hence, it is very important to control NPAs. The objective of the paper is to design and evaluate machine learning based supervised models for NPA detection. To design models, the entire historical and current data needs to be considered, which requires, faster access to large volumes of heterogeneous data. Hence, the supervised models are implemented using big data techniques for fraud detection and analytics. The various supervised models namely Logistic Regression, Support Vector Machine, Random Forest, Neural Network, and Naive Bayes are designed for loan data and experimented using Map Reduce on Hadoop platform. These models are evaluated considering various performance metrics. The empirical result shows that the Neural Network model performs best considering precision, recall, relative commission error, and kappa statistics for NPA prediction. The best-performed model can be integrated into the existing loan management system for the early identification of NPA cases.
UR - http://www.scopus.com/inward/record.url?scp=85119960144&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119960144&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:85119960144
SN - 1816-093X
VL - 29
SP - 1422
EP - 1435
JO - Engineering Letters
JF - Engineering Letters
IS - 4
ER -