Penggunaan Convolutional Recurrent Neural Network dan RLSA untuk Mengambil Data pada Akta Kelahiran
Keywords:
Alih kode, Campur Kode, Konflik, Interpretasi, SosiolinguistikAbstract
Birth certificate is one of the documents that is mandatory for every citizen to have. This document records the information upon someone’s birth and an official acknowledgement of a country on that person’s existence. Birth certificate is a legal document and is an acceptable form of identification for other documents such as a diploma. As one of Indonesia’s learning institute, Petra Christian University needs its students birth certificate as a solid proof upon their identification and as a base to publish a diploma. The extraction of information is being done manually but with the rapid development of technology, it is now possible to obtain the information within a birth certificate automatically. Research about information extraction on Birth Certificate hasn’t been done yet before, but similar research with the object of Identity Card has been done using Template Matching with the accuracy of 17-39%. This research uses Run Length Smoothing Algorithm and Convolutional Recurrent Neural Network as its primary methods. Run Length Smoothing Algorithm is used to segment words in a birth certificate image. The word in an image will then be translated into a text in string form by Convolutional Recurrent Neural Network. To know which words that contain the wanted information, the sequence of the words and specific keywords are being used. The result of this research will be information upon the full name, birth date, place of birth and the gender of the birth certificate holder. The result from tests that were done is an accuracy of 12.936% upon finding the wanted information and 60.086% for words translation from image to string by CRNN.References
[1] Albelwi, S., & Mahmood, A. 2017. A Framework for Designing the Architectures of Deep Convolutional Neural Network. Entropy, 5.
[2] Bishop, C. 1998. Neural Networks and Pattern Recognition. London: Academic Press.
[3] Fausett, L. 1994. Fundamentals of Neural Networks. New Jersey: Prentice-Hall, Inc.
[4] Goodfellow, I., Bengio, Y., & Courville, A. 2016. Deep Learning. Cambridge: MIT Press.
[5] Ryan, M., & Hanafiah, N. 2015. An Examination of Character Recognation on ID Card using Template Matching Approach. Procedia Computer Science, 520-529.
[6] Shafait, F., Keysers, D., & Breuel, T. M. 2006. Performance Comparison of Six Algorithms for Page Segmentation. Dalam H. Banke, & A. L. Spitz, Document Anaysis Systems VII (hal. 368-379). New Zealand: Springer International Publishing.
[7] Shi, B., Bai, X., & Yao, C. 2016. An End-to-End Trainable Neural Network for Image-based Sequence Recognation and Its Application to Scene Text Recognation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2298-2304.
[8] Skymind. 2018, Mei 31. A Beginner's Guide to Recurrent Network and LSTMs. Retrieved from Deeplearning4j: https://deeplearning4j.org/lstm.html#recurrent
[9] Sukabumi, D. K. 2018, September 18. Akta Kelahiran - Dukcapil Kabupaten Sukabumi. Retrieved from Dukcapilkabsukabumi: https://www.dukcapilkabsukabumi.org/pelayanan/akta-kelahiran/
[10] Wong, K. Y., Casey, R. G., & Wahl, F. M. 1982. Document Analysis System. IBM Journal of Research and Development, 26(6), 647-656.