Hybrid non-maximum suppression-based Telugu scene text detection
Vishnuvardhan Atmakuri1, Vayunandan Kumar Konakalla2, M. Radha3, Sape Chittibabulu4, Nalluri Brahma Naidu5 and G. Satyanarayana6
Senior Software Engineer,Fidelity Investments, Dallas,Texas-75201,USA2
Assistant Professor, Department of Information Technology,Vallurupalli Nageswara Rao, Vignana Jyothi Institute of Engineering and Technology, Bachupally, Hyderabad,Telangana- 500118,India3
Assistant Professor, Department of Computer Science and Engineering,Aditya University, Surampalem,Andhra Pradesh-533437,India4
Assistant Professor, Department of Computer Science and Engineering,Vignan's Foundation for Science, Technology & Research, Guntur -Tenali Rd, Vadlamudi,Andhra Pradesh-522213,India5
Associate Professor, Department of Computer Science and Engineering,Vasireddy Venkatadri International Technological University Guntur,Andhra Pradesh-522508,India6
Corresponding Author : Vishnuvardhan Atmakuri
Recieved : 22-April-2025; Revised : 23-May-2026; Accepted : 26-May-2026
Abstract
Detecting text in natural scenes is crucial for bridging visual and linguistic information and supports applications such as multilingual information retrieval, assistive technologies, and scene understanding. Telugu script presents unique detection challenges due to its rounded glyph structures, complex ligatures, and frequent background interference. This paper proposes an enhanced Telugu scene text detection framework based on the efficient and accurate scene text detector (EAST), augmented with a hybrid non-maximum suppression (NMS) strategy that integrates Soft-NMS and adaptive NMS. The proposed hybrid NMS improves post-processing reliability by suppressing redundant detections while preserving overlapping true positives (TPs), resulting in a more balanced precision–recall (PR) performance. The proposed method is evaluated on the Indian Institute of Information Technology-Indic Language Scene Text (IIIT-ILST) dataset, achieving a precision of 0.75, recall of 0.80, and an F1-score of 0.77, thereby outperforming existing Telugu text detection approaches. Further analysis using PR curves at intersection over union (IoU) thresholds of 0.5 and 0.75, along with mean average precision (mAP) evaluation, confirms the robustness of the proposed model under varying localization constraints. Overall, the hybrid NMS-enhanced EAST method establishes a strong benchmark for Telugu scene text detection and provides a practical solution for language-specific scene understanding applications.
Keywords
Telugu scene text detection, Efficient and accurate scene text detector (EAST), Hybrid non-maximum suppression, Scene understanding, Indic language processing, Optical character recognition
Cite this article
Atmakuri V, Konakalla VK, Radha M, Chittibabulu S, Naidu NB, Satyanarayana G. Hybrid non-maximum suppression-based Telugu scene text detection. International Journal of Advanced Technology and Engineering Exploration. 2026;13(138):749-768. DOI : 10.19101/IJATEE.2025.121220525
