Penerapan Modified ADASYN untuk Meningkatkan Akurasi Pendeteksian Pola Fraud pada Transaksi Kartu Kredit

Ebhen Haezer Sitohang(1*), Djoni Haryadi Setiabudi(2), Stephanus Antonius Ananda(3),


(1) Program Studi Informatika
(2) Program Studi Informatika
(3) Program Studi Informatika
(*) Corresponding Author

Abstract


One of the most influential factors in accuracy is class imbalance, this is reviewed in a study conducted by Gameng et al. (2019). In the study of Bagga et al. (2020), the Pipelining method combined with ADASYN the accuracy can reach 0.99999. The problem in this study is that accuracy may not necessarily reach 0.99999 if using a dataset outside the dataset they are using and if using a Classificationnn algorithm other than pipelining. In a study conducted by Dornadula & Geetha (2019), the highest accuracy was only 0.9994. In the research conducted by Makki et al. (2019), the Classificationnn model that uses the class balancing method has lower accuracy.

In this thesis, Modified ADASYN is used because in the research of Gameng et al. (2019) its accuracy, precision and f1-score surpassed ADASYN and SMOTE. Pipelining method is used because in the study of Bagga et al. (2020), Pipelining can make Classificationnn accuracy up to 0.99999.

As a result of testing, this thesis concludes that Modified ADASYN has not been able to obtain an accuracy of 0.999999 on two different datasets. In this thesis, Modified ADASYN is able to increase the accuracy of K-NN to 0.9995148 and 0.97617554 using the first and second datasets. Modified ADASYN can outperform SMOTE, ADASYN, One-Class Classificationnn and Cost Sensitive. In this thesis, it is found that the optimal K value in Modified ADASYN can vary depending on many parameters and sample data.

Keywords


credit-card fraud; modified ADASYN; pipelining; imbalanced classification; class imbalance

Full Text:

PDF

References


Bagga, S., Goyal, A., Gupta, N., & Goyal, A. 2020. Credit Card Fraud Detection using Pipeling and Ensemble Learning. Procedia Computer Science, 173, 104-112. DOI= https://doi.org/10.1016/j.procs.2020.06.014

Dornadula, V. N., & Geetha, S. 2019. Credit card fraud detection using machine learning algorithms. Procedia Computer Science, 165, 631-641. DOI= https://doi.org/10.1016/j.procs.2020.01.057

Fraugster. The State Of Credit Card Fraud 2021. 2021. Google Book Search. URI= https://fraugster.cdn.prismic.io/fraugster/9cedffd2-9339-4111-9b0d-d54a148de932_Fraugsters+State+Of+Credit+Card+Fraud+2021.pdf

Gameng, H. A., Gerardo, B.B., & Medina, R.P. 2019. Modified Adaptive Synthetic SMOTE to Improve Classification Performance in Imbalanced Datasets. 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS), pp. 1-5. DOI= https://doi.org/10.1109/ICETAS48360.2019.9117287.

Han, J., Kamber, M., & Pei, J. 2011. Data Mining: Concepts and Techniques 3rd edition. Elsevier Science. URI= http://myweb.sabanciuniv.edu/rdehkharghani/files/2016/02/The-Morgan-Kaufmann-Series-in-Data-Management-Systems-Jiawei-Han-Micheline-Kamber-Jian-Pei-Data-Mining.-Concepts-and-Techniques-3rd-Edition-Morgan-Kaufmann-2011.pdf

Johnson, J. M., & Khoshgoftaar, T.M. 2019. Survey on Deep Learning with Class Imbalance. Journal Of Big Data, 6:27. DOI= https://doi.org/10.1186/s40537-019-0192-5

Kho, J. R., & Vea, L. A. 2017. Credit card fraud detection based on transaction behavior. TENCON 2017 - 2017 IEEE Region 10 Conference. DOI= https://doi.org/10.1109/TENCON.2017.8228165

Makki, S., Assaghir, Z., Taher, Y., Haque, R., Hacid, M. S., & Zeineddine, H. 2019. An experimental study with imbalanced classification approaches for credit card fraud detection. IEEE Access, 7, 93010-93022. DOI= https://doi.org/10.1109/ACCESS.2019.2927266

Sklearn.feature_selection.f_regression. 2020. sckit.(n. d). URI= https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.f_regression.html

Sklearn.feature_selection.selectKBest. 2020.scikit. (n.d). URI= https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.SelectKBest.html

Sklearn.pipeline.Pipeline. 2020. scikit. (n. d). URI= https://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.htmls


Refbacks

  • There are currently no refbacks.


Jurnal telah terindeks oleh :