Aplikasi Pendeteksi Unsur Hinaan dalam Komentar di Media Sosial Berbahasa Indonesia

Yohanes Adam Sastrodikoro Wiryoadikusmo(1*), Henry Novianus Palit(2), Justinus Andjarwirawan(3),


(1) Program Studi Teknik Informatika
(2) Program Studi Teknik Informatika
(3) Program Studi Teknik Informatika
(*) Corresponding Author

Abstract


Communication in social media has became a lifestyle of Indonesian people. Most of the kind of communication between users are commentary text. Often there are comments that contain elements of insults directed at others. Detection of insult elements in comment text in social media is needed.
The first step in the method is to normalize the text so that the comment text is more organized and standardized. The replacement of the insult verbal variation into the original insult word is also done. Then do the stemming process to take the basic word of each word. By performing feature extraction of the text, obtained every feature of the text is then predicted using the SVM method.
The application used in this paper successfully detected insulting elements in the text of comments from social media using the three methods of SVM classification, Logistic Regression, and Naive Bayes.


Keywords


texts, social media comments, insult elements

Full Text:

PDF

References


Domingos, P. dan Pazzani, M. 1997. On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning , 29 (2-3), 103-130.

Gunn, S., Brown, M., dan Bossley, K. 1997. Network performance assessment for neuro- fuzzy data modelling. Advances in Intelligent Data Analysis Reasoning about Data (pp. 313-323). Berlin: Springer.

Hajeer, S.I., Ismail, R.M., Badr, N.L., dan Tolba, M.F. 2017. A New Stemming Algorithm for Efficient Information Retrieval Systems and Web Search Engines. In: Hassanien A., Mostafa Fouad M., Manaf A., Zamani M., Ahmad R., Kacprzyk J. (eds) Multimedia Forensics and Security. Intelligent Systems Reference Library, vol 115. Springer, Cham.

Manning, D. M., Raghavan, P., dan Schutze, H. 2008. Introduction to Information Retrieval. Cambridge, United Kingdom: Cambridge University Press.

Nugroho, A., Witarto, A., dan Handoko, D. 2003. Support Vector Machine Teori dan Aplikasinya dalam Bioinformatika. Kuliah Umum IlmuKomputer.Com. URI= http://asnugroho.net/papers/ikcsvm.pdf.

Osborne, J. W. 2015. Best Practices in Logistic Regression. Thousand Oaks: SAGE Publications.

Powers, D.M.W. 2011. Evaluation from precisions, recall, and F-measure to ROC, informedness, markedness and correlation. International Journal of Machine Learning Technology, vol 2, 37-63.

Provost, F., & Fawcett, T. 2013. Data Science for Business. Sebastopol: O’Reilly Media, Inc.

Sammut, C., dan Webb, G. I. 2017. Encyclopedia of Machine Learning and Data Mining. New York: Springer US.

Vapnik, V. 1995. The Nature of Statistical Learning Theory. New York: Springer


Refbacks

  • There are currently no refbacks.


Jurnal telah terindeks oleh :