Perbandingan Performa Tools Web Scraping pada Website dengan Data Statis dan Dinamis

Authors

  • Michael Levi Program Studi Informatika
  • Henry Novianus Palit Program Studi Informatika
  • Silvia Rostianingsih Program Studi Informatika

Abstract

In scraping a website, the main concern is the type of website whether it is a static or dynamic website, and also the data structure of the website. With different website characteristics and diverse web scraping tools, it will make users quite difficult in choosing tools that suit their needs. The purpose of this research is to compare web scraping tools from different website characteristics, and to provide recommendations for web scraping tools for future research by knowing the right tools in handling each website's characteristics. Based on the results of tests that have been done, it can be concluded which tools are more effective and efficient in certain conditions.

References

[1] Ambre, A., Gaikwad, P., Pawar, K., & Patil, V. 2019. Web

and Android Application for Comparison of E-Commerce

Products. International Journal of Advanced Engineering,

Management and Science (IJAEMS) [Vol-5, Issue-4, Apr2019], 266-268. URI=

http://d.researchbib.com/f/3jnJcuMJ1mYzAioF91pTkiLJEsn

J1uM2ImY2ymp3IyK2McoTImYmHgFHcOEH1GYHSDHv

0lZQR5YGVgI2IvLJ5xYaOxMt.pdf.

[2] curl. command line tool and library. URI=

https://curl.haxx.se/.

[3] Draxl, V. 2018. BACHELOR PAPER Web Scraping Data

Extraction from websitesURI=

https://www.academia.edu/35901535/BACHELOR_PAPER

_Web_Scraping_Data_Extraction_from_websites.

[4] Irawan, B., Palit, H. N., & Andjarwirawan, J. 2018. Aplikasi

Android untuk Mencari Harga Tiket Pesawat Termurah dari

Beberapa Situs Travel di Indonesia. Jurnal Infra VOL 7, NO

2 (2019), 49-54. URI=

http://publication.petra.ac.id/index.php/teknikinformatika/article/view/8752/7900.

[5] json. Introducing JSON. URI= https://www.json.org/jsonen.html.

[6] MDN Web Docs. 2020. Document Object Model (DOM).

URI= https://developer.mozilla.org/enUS/docs/Web/API/Document_Object_Model.

[7] Mitchell, R. 2015. Web Scraping with Python. Sebastopol:

O'Reilly Media, Inc.

[8] MuleSoft. What is an API? (Application Programming

Interface). URI=

https://www.mulesoft.com/resources/api/what-is-an-api.

[9] Saurkar, A. V., Pathare, K. G., & Gode, S. A. 2018. An

Overview on Web Scraping Techniques and Tools.

International Journal on Future Revolution in Computer

Science & Communication Engineering Volume: 4 Issue: 4,

363-367. URI=

http://www.ijfrcsce.org/download/browse/Volume_4/April_1

8_Volume_4_Issue_4/1524638955_25-04-2018.pdf.

[10] Selenium. Selenium Projects.URI=

https://www.selenium.dev/projects/.

[11] TechTarget. 2005. XPath. URI=

https://whatis.techtarget.com/definition/XPath.

[12] what-is-web-scraping. 2019. URI=

https://hirinfotech.com/what-is-web-scraping/.

Downloads

Published

2020-10-03

Issue

Section

Articles