Detection of fraudulent credit card transactions: A comparative analysis of data sampling and classification techniques

Konduri Praveen Mahesh, Shaik Ashar Afrouz, Anu Shaju Areeckal

Research output: Contribution to journalConference articlepeer-review

4 Citations (Scopus)

Abstract

Every year there is an increasing loss of a huge amount of money due to fraudulent credit card transactions. Recently there is a focus on using machine learning algorithms to identify fraud transactions. The number of fraud cases to non-fraud transactions is very low. This creates a skewed or unbalanced data, which poses a challenge to training the machine learning models. The availability of a public dataset for this research problem is scarce. The dataset used for this work is obtained from Kaggle. In this paper, we explore different sampling techniques such as under-sampling, Synthetic Minority Oversampling Technique (SMOTE) and SMOTE-Tomek, to work on the unbalanced data. Classification models, such as k-Nearest Neighbour (KNN), logistic regression, random forest and Support Vector Machine (SVM), are trained on the sampled data to detect fraudulent credit card transactions. The performance of the various machine learning approaches are evaluated for its precision, recall and F1-score. The classification results obtained is promising and can be used for credit card fraud detection.

Original languageEnglish
Article number012072
JournalJournal of Physics: Conference Series
Volume2161
Issue number1
DOIs
Publication statusPublished - 11-01-2022
Event1st International Conference on Artificial Intelligence, Computational Electronics and Communication System, AICECS 2021 - Manipal, Virtual, India
Duration: 28-10-202130-10-2021

All Science Journal Classification (ASJC) codes

  • Physics and Astronomy(all)

Fingerprint

Dive into the research topics of 'Detection of fraudulent credit card transactions: A comparative analysis of data sampling and classification techniques'. Together they form a unique fingerprint.

Cite this