Artificial intelligence and machine learning
Research Article
Recognition of digital sequences using convolutional neural networks
Igor Victorovich Vinokurov
Financial University under the Government of the Russian Federation, Moscow, Russia | |
igvvinokurov@fa.ru |
Abstract. The relevance of identifying tabular information and recognizing its contents for processing scanned documents is shown. The formation of a data set for training, validation and testing of a deep learning neural network (DNN) YOLOv5s for the detection of simple tables is described. The effectiveness of using this DNN when working with scanned documents is shown. Using the Keras Functional API, a convolutional neural network (CNN) was formed to recognize the main elements of tabular information — numbers, basic punctuation marks and Cyrillic letters. The results of a study of the work of this CNN are given. The implementation of the identification and recognition of tabular information on scanned documents in the developed IS updating information in databases for the Unified State Register of Real Estate system is described. (Linked article texts in Russian and in English).
Keywords: Convolutional Neural Networks, Deep Learning Neural Networks, CNN, DNN, YOLOv5s, Keras, Python
MSC-2020 68T20; 68T07, 68T45For citation: Igor V. Vinokurov. Recognition of digital sequences using convolutional neural networks. Program Systems: Theory and Applications, 2023, 14:3, pp. 3–36. (In Russ., in Engl.). https://psta.psiras.ru/2023/3_3-36.
Full text of article (PDF): https://psta.psiras.ru/read/psta2023_3_3-36.pdf.
The article was submitted 14.04.2023; approved after reviewing 04.07.2023; accepted for publication 04.07.2023; published online 13.08.2023.