Feature Selection pada Phishing Detection dengan Menggunakan Parallel Genetic Algorithm dan Ensemble Learning

Alles Sandro Oktavio Gandadireja(1*), Henry Novianus Palit(2), Alvin Nathaniel Tjondrowiguno(3),


(1) Program Studi Informatika
(2) Program Studi Informatika
(3) Program Studi Informatika
(*) Corresponding Author

Abstract


Phishing sites could become a threat, which retrieves personal information without the user knowing this action. Every site has numerous records, which will be converted to features. Not all features extracted are relevant. Feature selection becomes the main topic of this case. This research uses Genetic Algorithm, using Ensemble Learning as fitness function. This process requires a lot of time, parallelization then used to improve the execution time of the system. The results show that with feature selection, an improvement could be obtained. Parallelization also helps improving execution time up to 2 times faster. Using this system, it is possible to improve the effectiveness of phishing detection.


Keywords


Parallelization; Ensemble Learning; Genetic Algorithm; Feature Selection; Phishing

Full Text:

PDF

References


Ali, W. and Ahmed, A.A. 2019. Hybrid intelligent phishing website prediction using deep neural networks with genetic algorithm-based feature selection and weighting. IET Information Security. 13, 6, 659–669. DOI:https://doi.org/10.1049/iet-ifs.2019.0006.

Baykara, M. and Gürel, Z.Z. 2018. Detection of phishing attacks. 6th International Symposium on Digital Forensic and Security, ISDFS 2018 - Proceeding. 2018-Janua, 1–5. DOI:https://doi.org/10.1109/ISDFS.2018.8355389.

Cantú-Paz, E. 1999. A survey of parallel distributed genetic algorithms. Complexity. 4, 4, 31–52. DOI:https://doi.org/10.1002/(SICI)1099-0526(199903/04)4:43.3.CO;2-W.

Chawla, M. and Singh Chouhan, S. 2014. A Survey of Phishing Attack Techniques. International Journal of Computer Applications. 93, 3, 32–35. DOI:https://doi.org/10.5120/16197-5460.

Chen, Z. et al. 2016. A parallel genetic algorithm based feature selection and parameter optimization for support vector machine. Scientific Programming. 2016. DOI:https://doi.org/10.1155/2016/2739621.

Dhamija, R. and Tygar, J.D. 2005. The battle against phishing: Dynamic security skins. ACM International Conference Proceeding Series. 93, 77–88. DOI:https://doi.org/10.1145/1073001.1073009.

Haldurai, L. et al. 2016. A Study on Genetic Algorithm and its Applications. International Journal of Computer and Engineering. 4, 10, 139–143. DOI:https://doi.org/10.26438/ijcse/v7i4.184190.

Phishing Websites Data Set: 2015. https://archive.ics.uci.edu/ml/datasets/Phishing+Websites. Accessed: 2020-06-12.

Ubing, A.A. et al. 2019. Phishing website detection: An improved accuracy through feature selection and ensemble learning. International Journal of Advanced Computer Science and Applications. 10, 1, 252–257. DOI:https://doi.org/10.14569/IJACSA.2019.0100133.

Vayansky, I. and Kumar, S. 2018. Phishing – challenges and solutions. Computer Fraud and Security. 2018, 1, 15–20. DOI:https://doi.org/10.1016/S1361-3723(18)30007-1.

Zhou, Z.-H. 2012. Ensemble Methods: Foundations and Algorithms. Chapman & Hall/CRC Press.


Refbacks

  • There are currently no refbacks.


Jurnal telah terindeks oleh :