Penerapan Modified ADASYN untuk Meningkatkan Akurasi Pendeteksian Pola Fraud pada Transaksi Kartu Kredit

Authors

  • Ebhen Haezer Sitohang Program Studi Informatika
  • Djoni Haryadi Setiabudi Program Studi Informatika
  • Stephanus Antonius Ananda Program Studi Informatika

Abstract

One of the most influential factors in accuracy is class imbalance, this is reviewed in a study conducted by Gameng et al. (2019). In the study of Bagga et al. (2020), the Pipelining method combined with ADASYN the accuracy can reach 0.99999. The problem in this study is that accuracy may not necessarily reach 0.99999 if using a dataset outside the dataset they are using and if using a Classificationnn algorithm other than pipelining. In a study conducted by Dornadula & Geetha (2019), the highest accuracy was only 0.9994. In the research conducted by Makki et al. (2019), the Classificationnn model that uses the class balancing method has lower accuracy.

In this thesis, Modified ADASYN is used because in the research of Gameng et al. (2019) its accuracy, precision and f1-score surpassed ADASYN and SMOTE. Pipelining method is used because in the study of Bagga et al. (2020), Pipelining can make Classificationnn accuracy up to 0.99999.

As a result of testing, this thesis concludes that Modified ADASYN has not been able to obtain an accuracy of 0.999999 on two different datasets. In this thesis, Modified ADASYN is able to increase the accuracy of K-NN to 0.9995148 and 0.97617554 using the first and second datasets. Modified ADASYN can outperform SMOTE, ADASYN, One-Class Classificationnn and Cost Sensitive. In this thesis, it is found that the optimal K value in Modified ADASYN can vary depending on many parameters and sample data.

References

[1] Bagga, S., Goyal, A., Gupta, N., & Goyal, A. 2020. Credit Card Fraud Detection using Pipeling and Ensemble Learning. Procedia Computer Science, 173, 104-112. DOI= https://doi.org/10.1016/j.procs.2020.06.014

[2] Dornadula, V. N., & Geetha, S. 2019. Credit card fraud detection using machine learning algorithms. Procedia Computer Science, 165, 631-641. DOI= https://doi.org/10.1016/j.procs.2020.01.057

[3] Fraugster. The State Of Credit Card Fraud 2021. 2021. Google Book Search. URI= https://fraugster.cdn.prismic.io/fraugster/9cedffd2-9339-4111-9b0d-d54a148de932_Fraugsters+State+Of+Credit+Card+Fraud+2021.pdf

[4] Gameng, H. A., Gerardo, B.B., & Medina, R.P. 2019. Modified Adaptive Synthetic SMOTE to Improve Classification Performance in Imbalanced Datasets. 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS), pp. 1-5. DOI= https://doi.org/10.1109/ICETAS48360.2019.9117287.

[5] Han, J., Kamber, M., & Pei, J. 2011. Data Mining: Concepts and Techniques 3rd edition. Elsevier Science. URI= http://myweb.sabanciuniv.edu/rdehkharghani/files/2016/02/The-Morgan-Kaufmann-Series-in-Data-Management-Systems-Jiawei-Han-Micheline-Kamber-Jian-Pei-Data-Mining.-Concepts-and-Techniques-3rd-Edition-Morgan-Kaufmann-2011.pdf

[6] Johnson, J. M., & Khoshgoftaar, T.M. 2019. Survey on Deep Learning with Class Imbalance. Journal Of Big Data, 6:27. DOI= https://doi.org/10.1186/s40537-019-0192-5

[7] Kho, J. R., & Vea, L. A. 2017. Credit card fraud detection based on transaction behavior. TENCON 2017 - 2017 IEEE Region 10 Conference. DOI= https://doi.org/10.1109/TENCON.2017.8228165

[8] Makki, S., Assaghir, Z., Taher, Y., Haque, R., Hacid, M. S., & Zeineddine, H. 2019. An experimental study with imbalanced classification approaches for credit card fraud detection. IEEE Access, 7, 93010-93022. DOI= https://doi.org/10.1109/ACCESS.2019.2927266

[9] Sklearn.feature_selection.f_regression. 2020. sckit.(n. d). URI= https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.f_regression.html

[10] Sklearn.feature_selection.selectKBest. 2020.scikit. (n.d). URI= https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.SelectKBest.html

[11] Sklearn.pipeline.Pipeline. 2020. scikit. (n. d). URI= https://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.htmls

Downloads

Published

2021-10-13

Issue

Section

Articles