Penerapan Long-Short Term Memory dengan Word2Vec Model untuk Mendeteksi Hoax dan Clickbait News pada Berita Online di Indonesia

Soni Marko Nathanniel Tannady, Djoni Haryadi Setiabudi, Alvin Nathaniel Tjondrowiguno

Abstract


News has become information that is routinely consumed every day and can be accessed easily as technology develops. However, the easy access of readers to news also opens up space for some people to spread clickbait or hoax news to attract readers' attention for personal gain. To overcome this, one of the efforts that can be realized is with a detection model for clickbait and hoax news with machine learning methods. The method used is Long-Short Term Memory. However, with several additional applications such as adding a dropout layer, implementing a callback function and using k-fold cross validation to overcome the problem of overfitting the model which often occurs in related studies. The built model will be tested in a webpage application where users can detect news labels. On the best testing result, testing accuracy for the clickbait detection model are 72.93% and the hoax detection model are 79.17%.

Keywords


Machine Learning; Natural Language Processing; Long-Short Term Memory; Clickbait; Hoax; Clickbait

Full Text:

PDF

References


Adilah, R.Y. 2021, March 4. Setahun Pandemi,

Kemenkominfo Temukan 2.624 Hoaks Tentang Covid-19.

Merdeka.com. Retrieved from Merdeka.com:

https://www.merdeka.com/peristiwa/setahun-pandemi-keme

nkominfo-temukan-2624-hoaks-tentang-covid-19.html.

Arika, Y. 2021, April 24. Setahun Pandemi, Muncul Tren

Positif untuk Masa Depan Media. Kompas.id. Retrieved

from Kompas.id:

https://www.kompas.id/baca/dikbud/2021/04/24/setahun-pan

demi-muncul-tren-positif-untuk-masa-depan-media/.

Goyal, P., Pandey, S., & Jain, K. 2018. Deep learning for

natural language processing creating neural networks with

python. Apress. doi:

https://doi.org/10.1007/978-1-4842-3685-7.

Hadiyat, Y.D. 2019. Clickbait on Indonesia Online Media.

Jurnal Pekommas, Vol. 4 No. 1, April 2019: 1-10. doi:

30818/jpkm.2019.2040101.

Houdt, G.V., et al. 2020. A Review on the Long Short-Term

Memory Model. Artificial Intelligence Review 53(1). doi:

1007/s10462-020-09838-1.

Lararenjana, E. 2020, May 13. Mengenal Arti Hoax Atau

Berita Bohong, Ketahui Jenis dan Ciri-Cirinya.

Merdeka.com. Retrieved from Merdeka.com:

https://www.merdeka.com/jatim/mengenal-arti-hoax-atau-be

rita-bohong-dan-cara-tepat-menyikapinya-kln.html.

Mormol, P. 2019. ’I Urge You To See This...’. Clickbait as

One of the Dominant Features of Contemporary Online

Headlines. Social Communication. 5(2): 1-10. doi:

2478/sc-2019-0004.

NLP APIs. gensim.models.Word2Vec.accuracy. Retrieved

from NLP APIs:

https://tedboy.github.io/nlps/generated/generated/gensim.mo

dels.Word2Vec.accuracy.html.

Oshikawa, R. et al. 2020. A Survey on Natural Language

Processing for Fake News Detection. Proceedings of the

th Language Resources and Evaluation Conference

(LREC 2020) pp. 6086-6093. doi:

https://doi.org/10.48550/arXiv.1811.00770.

Pujahari, A & Sisodia, D.S. 2020, March 29. Clickbait

Detection using Multiple Categorization Techniques. Journal

of Information Science arXiv:2003.12961. doi:

https://doi.org/10.1177%2F0165551519871822.

Rahutomo, F., et al. 2018, “INDONESIAN HOAX NEWS

DETECTION DATASET”. Mendeley Data, V1. doi:

17632/p3hfgr5j3m.1.

Siregar, B. et al. 2021. Identification of Indonesian clickbait

news headlines with long short-term memory recurrent

neural network algorithm. J. Phys.: Conf. Ser. 1882 012129.

doi: 10.1088/1742-6596/1882/1/012129.

William, A. & Sari, Y. 2020. “CLICK-ID: A Novel Dataset

for Indonesian Clickbait Headlines”. Mendeley Data, V1.

doi: 10.17632/k42j7x2kpn.1.

Zhou, J., et al. (2021). Evaluating the Quality of Machine

Learning Explanations: A Survey on Methods and Metrics.

Electronics 10(5):593. doi: 10.3390/electronics10050593.


Refbacks

  • There are currently no refbacks.


Jurnal telah terindeks oleh :