Handwritten English text and digit recognition using ResNet284
Manoj Kumar Sharma1
Corresponding Author : Manoj Kumar Sharma
Recieved : 08-June-2024; Revised : 26-October-2025; Accepted : 23-November-2025
Abstract
The recording of knowledge through writing has enabled future generations to understand their cultural legacy and build a better future. However, ancient records were often created on plant leaves or other fragile materials. Although written documentation of human civilization dates back nearly 5,000 years, the quality of these handwritten manuscripts deteriorates over time due to ink fading and the biodegradation of materials such as paper. As a result, the preservation of historical documents remains a significant challenge. For several decades, computer scientists have been developing algorithms to recognize and digitize handwritten content. In this work, a residual neural network (ResNet284) is proposed for the recognition of handwritten English text and digits. Experiments were conducted on six benchmark datasets: Institute of Automated Machine Analysis Dataset (IAM) Handwritten dataset (115,320 words, segmented into approximately 576,600 characters), Kaggle English Handwritten (370,000 samples), Kaggle Digits (3,410 samples), Modified National Institute of Standards and Technology (MNIST) Digits (70,000 samples), Extended MNIST (EMNIST), treated as a unified dataset with subsets including Digits (350,000 samples comprising Digits-I with 280,000 and Digits-II with 70,000), EMNIST English Handwritten (ByClass: 814,255 characters; ByMerge: 814,255 characters; Balanced: 131,600 characters; and Letters: 145,600 characters), and the Centre of Excellence for Document Analysis and Recognition (CEDAR) dataset (100,000 samples). The proposed model achieved a test accuracy of 99.87% with an error rate of 0.13% on the combined digits and characters dataset comprising 2,768,780 samples. For the individual datasets, the model also demonstrated consistently high performance: 98.89% accuracy (1.11% error) on the IAM Handwritten dataset, 98.94% accuracy (1.06% error) on the Kaggle English Handwritten dataset, 98.91% accuracy (1.09% error) on the Kaggle Digits dataset, and 98.95% accuracy (1.05% error) on the MNIST Digits dataset. Similarly, the EMNIST English Handwritten dataset yielded 98.97% accuracy (1.03% error), while the CEDAR dataset resulted in 98.42% accuracy (1.58% error). Additionally, the combined digits dataset achieved 99.42% accuracy (0.58% error), and the combined characters dataset recorded 99.01% accuracy (0.99% error), further demonstrating the robustness and generalization capability of the proposed approach
Keywords
Handwritten text recognition, Residual neural network (ResNet284), Document digitization, Character classification, Digit recognition.
Cite this article
Sharma MK. Handwritten English text and digit recognition using ResNet284. International Journal of Advanced Technology and Engineering Exploration. 2025;12(132):1722-1748. DOI : 10.19101/IJATEE.2024.111101131
