Every year there is an increasing loss of a huge amount of money due to fraudulent credit card transactions. Recently there is a focus on using machine learning algorithms to identify fraud transactions. The number of fraud cases to non-fraud transactions is very low. This creates a skewed or unbalanced data, which poses a challenge to training the machine learning models. The availability of a public dataset for this research problem is scarce. The dataset used for this work is obtained from Kaggle. In this paper, we explore different sampling techniques such as under-sampling, Synthetic Minority Oversampling Technique (SMOTE) and SMOTE-Tomek, to work on the unbalanced data. Classification models, such as k-Nearest Neighbour (KNN), logistic regression, random forest and Support Vector Machine (SVM), are trained on the sampled data to detect fraudulent credit card transactions. The performance of the various machine learning approaches are evaluated for its precision, recall and F1-score. The classification results obtained is promising and can be used for credit card fraud detection.
|Journal||Journal of Physics: Conference Series|
|Publication status||Published - 11-01-2022|
|Event||1st International Conference on Artificial Intelligence, Computational Electronics and Communication System, AICECS 2021 - Manipal, Virtual, India|
Duration: 28-10-2021 → 30-10-2021
All Science Journal Classification (ASJC) codes
- Physics and Astronomy(all)