Conference Paper
2025

Intelligent Recognition of Bangla Handwritten Digits

Authors
D.M. Asadujjaman (Computer Science and Engineering)
Abstract
Optical Character Recognition (OCR) devices are essential for digitizing printed and handwritten materials because they transform text into machine-readable formats. Although OCR technologies have seen remarkable advancements in lan- guages such as English, Tamil, and the Bangla script continues to pose significant challenges due to its structural intricacy and the limited availability of high-quality datasets and advanced recog- nition models. Addressing these challenges, this study evaluates the performance of deep Convolutional Neural Network (CNN) architectures namely InceptionV3, ResNet50, and DenseNet121 for Bangla handwritten numeral recognition using the NumtaDB dataset, which is thorough and varied benchmark dataset es- pecially selected for Bangla handwritten digits. We also used an ensemble learning approach to further improve recognition performance by combining the outputs of separate models and utilizing their complementing advantages to increase generaliza- tion and accuracy. Among the models evaluated, InceptionV3 achieved the highest individual accuracy of 99.86% followed closely by ResNet50 and DenseNet121, which obtained 99.73% and 99.69% respectively. The ensemble approach outperformed all standalone models, achieving a superior test accuracy of 99.97%. These results underscore the effectiveness of CNN-based architecture for Bangla numeral recognition and demonstrate the considerable potential of ensemble learning in enhancing the accuracy, robustness, and reliability of OCR systems designed for complex scripts like Bangla.
Publication Details
Published In:
2025 IEEE International Conference on Quantum Photonics, Artificial Intelligence, and Networking(QPAIN)
Publication Year:
2025
Publication Date:
July 2025
Type:
Conference Paper
Total Authors:
1