Perbandingan Performa Tools Web Scraping pada Website dengan Data Statis dan Dinamis

Michael Levi, Henry Novianus Palit, Silvia Rostianingsih


In scraping a website, the main concern is the type of website whether it is a static or dynamic website, and also the data structure of the website. With different website characteristics and diverse web scraping tools, it will make users quite difficult in choosing tools that suit their needs. The purpose of this research is to compare web scraping tools from different website characteristics, and to provide recommendations for web scraping tools for future research by knowing the right tools in handling each website's characteristics. Based on the results of tests that have been done, it can be concluded which tools are more effective and efficient in certain conditions.


Web Scraping; CURL; Scrapy; Cheerio; Headless Browser; Dynamic Web Content

Full Text:



Ambre, A., Gaikwad, P., Pawar, K., & Patil, V. 2019. Web

and Android Application for Comparison of E-Commerce

Products. International Journal of Advanced Engineering,

Management and Science (IJAEMS) [Vol-5, Issue-4, Apr2019], 266-268. URI=



curl. command line tool and library. URI=

Draxl, V. 2018. BACHELOR PAPER Web Scraping Data

Extraction from websitesURI=


Irawan, B., Palit, H. N., & Andjarwirawan, J. 2018. Aplikasi

Android untuk Mencari Harga Tiket Pesawat Termurah dari

Beberapa Situs Travel di Indonesia. Jurnal Infra VOL 7, NO

(2019), 49-54. URI=

json. Introducing JSON. URI=

MDN Web Docs. 2020. Document Object Model (DOM).


Mitchell, R. 2015. Web Scraping with Python. Sebastopol:

O'Reilly Media, Inc.

MuleSoft. What is an API? (Application Programming

Interface). URI=

Saurkar, A. V., Pathare, K. G., & Gode, S. A. 2018. An

Overview on Web Scraping Techniques and Tools.

International Journal on Future Revolution in Computer

Science & Communication Engineering Volume: 4 Issue: 4,

-367. URI=


Selenium. Selenium Projects.URI=

TechTarget. 2005. XPath. URI=

what-is-web-scraping. 2019. URI=


  • There are currently no refbacks.

Jurnal telah terindeks oleh :