Implementasi Hadoop: Studi Kasus Pengolahan Data Peminjaman Perpustakaan Universitas Kristen Petra

Kenny Basuki(1*), Henry Novianus Palit(2), Lily Puspa Dewi(3),


(1) Program Studi Teknik Informatika
(2) Program Studi Teknik Informatika
(3) Program Studi Teknik Informatika
(*) Corresponding Author

Abstract


than SQL is the general idea of this Hadoop implementation. The advancement of technology generates growing amount of data and demands a new method to process the big data. The performance of this hadoop implementation was also compared with that of SQL to prove hadoop’s novelty in processing big data. Moreover different hadoop’s implementations – such as various number of nodes, use of a combiner, and use of different block sizes – were evaluated.

Hadoop was implemented for five queries (or problems) in processing the library circulation data. Those five problems are finding the numbers of borrowing transactions categorized by the audio-video types, collection types, titles, locations, and users’ departments.

Some conclusions can be drawn based on the hadoop mapreduce implementation. Hadoop’s performance tops SQL’s when large data are being processed. The more the number of computer nodes, the faster the mapreduce application is to complete its execution. Use of a combiner can speed up the application’s execution. The arrangement with full data blocks can give better execution time than that with non-full data blocks does. In this hadoop implementation, the execution time using the block size of 128 MB is smaller than that of 28 MB and 512 MB.


Keywords


Hadoop, Big Data, Mapreduce

Full Text:

PDF

References


Coulouris, G., Dollimore, J., Kindberg, T., & Blair, G. 2012. Distributed Systems: Concepts and Design 5th edition. Pearson Education.

Holmes, A. 2012. Hadoop in Practice. New York: Manning Publications Co.

Perpustakaan Universitas Kristen Petra. Sejarah Perpustakaan. URI=http://library.petra.ac.id/index.php?r=site/sejarah_perpustakaan

Potts, A., & Friedel, J. D. 1996. Java Programming Language Handbook. Scottsdale: Keith Weiskamp.

Rouse, M. 2013. Big Data. URI=http://whatis.techtarget.com/definition/3Vs

Taggart, A. 2011. How Map and Reduce operations are actually carried out. URI=http://wiki.apache.org/hadoop/HadoopMapReduce

White, T. 2012. Hadoop: The Definitive Guide (3rd ed.). O’Reilly Media, Inc


Refbacks

  • There are currently no refbacks.


Jurnal telah terindeks oleh :