A deep learning approach for recognizing ancient Tamil scripts from historical artifacts
A. Umamageswari1, S. Deepa2, L. Sherin Beevi3 and A. Sangari4
Assistant Professor, Department of Computer Science and Engineering,SRM Institute of Science and Technology, Ramapuram, Chennai,India2
Assistant Professor, Department of Computer Science and Engineering,R.M.D Engineering College, Kavaraipettai, Tamilnadu,India3
Associate Professor, Department of Electrical and Electronic Engineering,Rajalakshmi Engineering College, Thandalam, Tamilnadu,India4
Corresponding Author : A. Umamageswari
Recieved : 21-February-2024; Revised : 28-April-2025; Accepted : 12-May-2025
Abstract
The preservation and interpretation of ancient scripts are essential for uncovering the rich cultural heritage and historical knowledge of early civilizations. Tamil, one of the world’s oldest languages, contains a vast repository of information preserved in inscriptions, manuscripts, and other historical artifacts. This research proposes a novel methodology for the recognition and deciphering of ancient Tamil words from historical documents and artifacts using image processing techniques and deep learning (DL) algorithms. The proposed framework consists of three main stages: pre-processing, feature extraction, and DL-based recognition. A region-based convolutional neural network (RCNN) architecture is employed to automatically learn and identify the intricate patterns and structural elements of ancient Tamil characters. To enhance feature extraction for irregularly shaped characters, this work introduces Adaptive region of interest (ROI) pooling, which dynamically adjusts to variations in stroke patterns and inscription styles, thereby improving recognition accuracy. The RCNN is trained on a large annotated dataset of ancient Tamil word images, with labels verified by domain experts to ensure data accuracy and reliability. Extensive experiments were conducted on a diverse dataset comprising stone inscriptions, palm leaf manuscripts, and clay tablets. The proposed approach achieved a high recognition accuracy of 98.6%, demonstrating robust performance even under challenging conditions such as stylistic variations, surface degradation, and image noise. This method significantly contributes to the preservation of cultural heritage by enabling the digitization and accessibility of historical texts and inscriptions. By safeguarding valuable linguistic and cultural knowledge, the proposed system ensures its availability for future scholarly research and education.
Keywords
Ancient Tamil, Character recognition, Segmentation, RCNN, Deep learning, Computer vision.
Cite this article
Umamageswari A, Deepa S, Beevi LS, Sangari A. A deep learning approach for recognizing ancient Tamil scripts from historical artifacts. International Journal of Advanced Technology and Engineering Exploration. 2025;12(126):819-832. DOI : 10.19101/IJATEE.2024.111100193
