Prediksi Retensi Customer Berdasarkan Support Vector Machine dengan Preprocessing Menggunakan Hadoop

Giovanni Gabriela Gunadi, Silvia Rostianingsih, Andre Gunawan


Nowadays, many companies often offer a discount to customer. The goal is that the company can attract new customers. However, the offer given cannot be done randomly. Offers must be directed to the right customers so the company doesn’t need to spend much money. This study will analyze to determine whether there is an influence of the association rule on customer retention. The data used is the Acquire Valued Shopper Challenge taken from the Kaggle website. As the program used is java with the Hadoop framework that uses the association rule method. Tests carried out twice using different data. The objective to be achieved from testing is that the model created can predict customers who will retain if they get an offer. With the tests carried out, the best results to answer the objective is to use data in the second experiment with the ratio of training data to testing data is 7: 3 and gamma 0.1 with accuracy obtained is 58.21%.


Customer Retention; Hadoop; Support Vector Machine

Full Text:



Apache. 2015. Apache Hadoop 2.7.1 – MapReduce Tutorial. URI=

Apache. 2015. Apache Hadoop 2.7.1 – HDFS Architecture. URI=

Awad, M., & Khanna, R. 2015. Efficient Learning Machines. Springer Science.

Bhosale, H.S. & Gadekar, P. D. 2014. A Review Paper on Big Data and Hadoop. International Journal of Scientific and Research Publications, Volume 4, Issue 10, 1-6.

Kowalczyk, A. 2017. Support Vector Machines Succinctly. Syncfusion. [6] Permana, Y. 2016. Mengenal Big Data. URI=

SAS. 2019. Big Data: What it is and why it matters. URI= [8] Turkington, G. 2013. Hadoop Beginner’s Guide. Packt Publishing Ldt. [9] Maitrey, S. & Jha, C.K. 2015. MapReduce:Simplified Data Analysis of Big Data. Procedia Computer Science 57, 563 – 571.


  • There are currently no refbacks.

Jurnal telah terindeks oleh :