TY - GEN
T1 - Exploring Advancements in Multilingual OCR Systems for Enhanced Document Analysis and Text Recognition
AU - Patel, Kinjal
AU - Parashar, Deepak
AU - Bahadure, Nilesh
AU - Shah, Bhoomi
AU - Kumar, Rohit
AU - Patni, Jagdish Chandra
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Optical Character Recognition (OCR) systems use robust software for searching words from scanned multilingual Indian documents. Manually searching such documents is tedious and time- consuming. These documents suffer from their improper layout, and low print quality, and contain intermixed texts (Machine-printed and handwritten). OCR is used to detect text from images if the text is not visible then it detects the actual text and gives the visible text. The system improves text recognition accuracy and it takes less time to identify the original text. The system uses many algorithms or methods to perform these tasks like Convolutional Neural Network (CNN), Byte Pair Encoding (BPE), and Language model (LM). It gives experimental results that demonstrate significant advancement in text recognition performance and scalability. It offers a comprehensive solution for multilingual OCR tasks.
AB - Optical Character Recognition (OCR) systems use robust software for searching words from scanned multilingual Indian documents. Manually searching such documents is tedious and time- consuming. These documents suffer from their improper layout, and low print quality, and contain intermixed texts (Machine-printed and handwritten). OCR is used to detect text from images if the text is not visible then it detects the actual text and gives the visible text. The system improves text recognition accuracy and it takes less time to identify the original text. The system uses many algorithms or methods to perform these tasks like Convolutional Neural Network (CNN), Byte Pair Encoding (BPE), and Language model (LM). It gives experimental results that demonstrate significant advancement in text recognition performance and scalability. It offers a comprehensive solution for multilingual OCR tasks.
UR - https://www.scopus.com/pages/publications/105018461645
UR - https://www.scopus.com/pages/publications/105018461645#tab=citedBy
U2 - 10.1109/IC2E365635.2025.11167177
DO - 10.1109/IC2E365635.2025.11167177
M3 - Conference contribution
AN - SCOPUS:105018461645
T3 - 2025 IEEE International Conference on Computer, Electronics, Electrical Engineering and their Applications, IC2E3 2025
BT - 2025 IEEE International Conference on Computer, Electronics, Electrical Engineering and their Applications, IC2E3 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 IEEE International Conference on Computer, Electronics, Electrical Engineering and their Applications, IC2E3 2025
Y2 - 15 May 2025 through 16 May 2025
ER -