a-review-on-handwritten-character-recognition-using-advanced-3kism6wv
a-review-on-handwritten-character-recognition-using-advanced-3kism6wv
Abstract
Due to its widespread use, handwriting recognition has drawn a lot of interest in the domains of pattern
recognition and machine learning. The application domain for optical character recognition (OCR) and
handwritten character recognition (HCR) is specific. For character recognition in a system for
handwriting recognition, several strategies have been proposed. Despite this, a substantial number of
studies and papers outline the methods for transforming the text of a paper document into a machine-
readable format. Character recognition (CR) technology may be crucial in the near future in order to
process and digitize existing paper documents in order to establish a paperless environment. This essay
offers a thorough analysis of the handwritten character recognition field.
1. Introduction
The subject of pattern recognition has many practical applications, but character recognition is one of its
most fundamental and difficult subfields. Due to the fact that it is a natural way for humans and
computers to interact, it has been a highly active area of research since the early days of computer
science. More specifically, character recognition is the process of identifying and extracting characters
from an input image and converting them into an editable machine-readable format, such as ASCII.
Handwriting recognition system is a method that enables a computer to read characters and other
symbols that are handwritten in human handwriting. Online and offline handwriting recognition are the
two categories used to categorize handwriting recognition [6]. Offline handwriting recognition is the
process of scanning handwriting and having the computer interpret it. Online handwriting recognition is
the process of identifying handwriting when it is being written on a touch pad with a stylus pen.
Character recognition systems are divided into two primary groups from the classifier's perspective:
segmentation free (global) and segmentation based (analytic). The holistic method to character
recognition without segmenting it into subunits or characters is often referred to as segmentation free.
Algorithms for deep learning and machine learning have been extensively used in previous literature.
Feature extraction is also incredibly important. For this objective, some common strategies include
graph-based features, histograms, mathematical transforms, and moment-based features. Pre-processing,
segmentation, representation, training, identification, and post-processing are a few of the essential
phases in handwritten character recognition. In terms of practical applications, numerous mobile apps
and web applications offer character recognition features to their users since end users continue to want
better services that are technically defined as accurate. Character recognition presents a number of
difficulties and obstacles, and our goal is to look at both existing and novel techniques in order to
resolve the issue at hand.
This study's contribution includes a comparison of various machine learning and deep learning methods
for recognizing handwritten characters based on the dataset and method employed. The following is how
the paper is organized: A detailed overview on the evolution of CR is provided in Section II.
Methodologies adopted in character recognition system are covered in Section III followed by the
discussion on various character recognition systems in Section IV. Conclusion and further work are
included in Section V.
feature extraction and classification. A feedback mechanism is employed in some approaches to update
the outcome of each stage, while others combine or remove some of the stages. The standard
methodology followed in CR system is seen in Fig. 1 followed by the detailed discussion.
While offline CR systems used optical scanners or cameras to collect data from documents, online
character recognition systems used digitizers that directly record the pattern based on the knowledge of
strokes, pen-up, pen-down and speed.
Pre-processing
Pre-processing is done to make the image better so that it can be utilized for additional processing [2], to
make the input data more consistent, and to make it better suited for the following stage of the
recognition system. This step involves a variety of techniques, including conversion from grayscale to
binary, noise removal, binarization, normalization, etc. The input data are subjected to a grayscale shift,
binary figuring, and subsequently an outcry reduction approach. After the grayscale and binary
conversion, the researcher segmented the data using edge detection under the assumption of the results
in [3]. When converting a grayscale image to a binary image, thresholding and Otsu's technique are
frequently utilized.
Segmentation
Segmentation is a technique used to divide an input text data picture into lines and individual characters.
The noise or outcast section of the data image moves. There are two types of segmentation: internal and
exterior. The sentences are divided into paragraphs, lines, and words using external segmentation. On
the other hand, internal segmentation is the division of the incoming text data into individual characters
[1]. There are various segmentation algorithms available. Histogram profiles and linked component
analysis are the fundamental techniques for line segmentation.
Feature Extraction
One of a recognition system's most crucial functions is image representation. The most straightforward
method is to input binary or grayscale images to a recognizer. In most recognition systems, a more
compact and characteristic representation is necessary in order to avoid the unnecessary complexity and
increase algorithm accuracy. Because of this, a set of traits that aid to distinguish each class from other
classes while being invariant to characteristic variations within the class are retrieved for each class [5].
The following is a breakdown of hundreds of document picture representation techniques into three main
categories.
It is a technique used to gather various, extremely relevant information about an object or a collection of
things so that, using the information gathered; we can twine the objects to discover new, untapped
material. The strength of the representation of the raw data is a characteristic. These are some of the
crucial feature extraction techniques that include zone-based, structural, mathematical, sliding window,
chain code histogram, gradient feature, and hybrid methods [4]. The 4-neighbourhood and the 8-
neighbourhood are the two main chain code pathways.
Classification
An undefined sample is assigned to the pre-determined class in the categorization. As the attributes are
extracted, the digits are classified and identified appropriately. The selection or approval process is
generally advantageous for the decision-making process because the new character fits the class or has a
similar appearance. This suggests that marking and assignment to marking occur during the
categorization stage. The effective extraction and selection of information's key points is a constant
requirement for classification information production. There are various order systems available, and
each one is primarily based on picture preparation and artificial reasoning techniques. Template
matching, Statistical techniques, structural techniques using neural networks, genetic approaches, and
fuzzy logic based on soft computing are a few examples of categorization methods based on image
processing.
In the handwritten character recognition systems (HCR), machine learning (ML) has used a variety of
techniques, including support vector machine, Naive Bayes, artificial neural networks (ANN), neuro-
fuzzy, decision trees, nearest neighbor algorithms, etc.
The human brain created the deep learning algorithm for using hierarchical level of artificial neural
networks to deal with machine learning processes. It has gained momentum as a result of various
hardware advancements and commutative researches on deep learning algorithms, including recurrent
neural network [18], convolutional neural network [19], Auto-encoder [21], deep neural network, deep
belief network [20], etc.
quantitative and structural information. For many patterns identification issues, NNs or HMMs combine
statistical and structural information quite successfully. Although they are somewhat robust to
deformation, their end purpose in the classification stage can result in significant non matching. In the
notion that an input plane is tested against a pattern restricted on and X-Y plane, template matching
methods deal with a character as a whole. In future, instead of just adding the data available, the design
of the training set should be approached methodically. Training sets should be large and contain random
examples, including ones with bad writing. To improve character outputs, deep learning may eventually
replace the handwritten character recognition system in future. The precision rate can be raised by using
several feature extraction techniques. Additionally, it should be noted that larger data sets tend to
perform better and provide the necessary precision.
6. References
1. Sing, A. Bist A wide scale survey on handwritten digit recognition using machine learning. Int. J.
Comput Sci. Eng. 124–134 (2019)
2. S.R. Patel, J. Jha, Handwritten character recognition using machine learning approach-a survey, in
International Conference on Electrical, Electronics, Signals, Communication and Optimization
(EESCO) (2015)
3. A. Tahir, A. Pervaiz, Hand written character recognitoin using SVM. Pacific Int. J. 39–43 (2020)
4. B.M.Vinijit, M.K. Bhojak, S. Kumar, G. Chalak, A review on hand-written character recognition me-
thods and techniques, In International Conference on Communication and Signal Processing (2020),
1224–1228
5. I. S. Oh, J. S. Lee, and C. Y. Suen, ―Analysis of class separation and combination of class-dependent
features for handwriting recognition‖, IEEE Trans. Pattern Anal. Machine Intell., vol. 21, pp. 1089–
1094, Oct. 1999.
6. Nafiz Arica and Fatos T. Yarman-Vural, ―‖An Overview of Character Recognition Focused on Off-
Line Handwriting‖, IEEE Transactions on Systems, Man, and Cybernetics—Part C: Applications and
Reviews, vol. 31, no. 2, pp.216-233, May 2001.
7. J. Mantas, ―An overview of character recognition methodologies", Pattern Recognition, vol. 19, no. 6,
pp. 425 - 430, 1986.
8. T. S. El-Sheikh and R. M. Guindi, ―Computer recognition of arabic cursive script,", Pattern Recogni-
tion, vol. 21, no. 4, pp. 293 - 302,1988.
9. C. Tappert, C. Suen, and T. Wakahara, ―The state of the art in online handwriting recognition", Pat-
tern Analysis and Machine Intelligence, IEEE Transactions on, vol. 12, pp. 787 -808, Aug. 1990.
10. Q. Tian, P. Zhang, T. Alexander, and Y. Kim, ―Survey: omnifont-printed character recognition,", in
Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series (K.-H. Tzou & T.
Koga, ed.), vol. 1606 of Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Se-
ries, pp. 260-268, Nov. 1991.
11. Y. Ding, F. Kimura, Y. Miyake, and M. Shridhar, ―Evaluation and improvement of slant estimation
for handwritten words", in Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of
the Fifth International Conference on, pp. 753 -756, Sept. 1999.
12. S. M. Lucas, E. Vidal, A. Amiri, S. Hanlon, and J.-C. Amengual, ―A comparison of syntactic and
statistical techniques for off-line ocr‖, in Proceedings of the Second International Colloquium on
Grammatical Inference and Applications, (London, UK), pp. 168-179, Springer-Verlag, 1994.
13. K.-F. Chan and D.-Y. Yeung, ―Recognizing on-line handwritten alphanumeric characters through
exible structural matching‖, Department of Computer Science, Volume 32, Issue 7 , Pages 1099–
1114, 1999.
14. S. Mori, C. Suen, and K. Yamamoto, ―Historical review of ocr research and development", Proceed-
ings of the IEEE, vol. 80, pp. 1029 - 1058, July 1992.
15. I. Bazzi, R. Schwartz, and J. Makhoul, ―An omnifont open-vocabulary ocr system for english and
arabic", Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 21, pp. 495 -504,
June 1999.
16. J. Hu, S. G. Lim, and M. K. Brown, ―Writer independent on-line handwriting recognition using an
hmm approach", Pattern Recognition, vol. 33, no. 1, pp. 133 - 147, 2000.
17. R. Vaidya, D. Trivedi, S. Satra,M. Pimpale, Handwritten digit recognition using deep learning, in
2nd International Conference on Inventive Communication and Computational Technologies
(ICICCT) (2018), 772–775.
18. A. Sherstinsky, Fundamentals of recurrent neural network (RNN) and long short-term memory
(LSTM) network. Physica D: Nonlinear Phenomena, 404, 132306, 2020.
19. I. Banerjee, Y. Ling, M. C. Chen, S. A. Hasan, C. P. Langlotz, N. Moradzadeh, B. Chapman, T.
Amrhein, D. Mong,D. L. Rubin,O. Farri, M. P. Lungren, Comparative effectiveness of convolutional
neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report
classification. Artificial intelligence in medicine, vol. 97, pp. 79-88, 2019.
20. Y. Chen, X. Zhao, X. Jia, Spectral–spatial classification of hyperspectral data based on deep belief
network. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. ,
no. 6, pp. 2381-2392, 2015.
21. J. Masci, U. Meier, D. Cireşan, J. Schmidhuber, Stacked convolutional auto-encoders for hierarchical
feature extraction. In International conference on artificial neural networks, pp. 52-59, (2011), Sprin-
ger, Berlin, Heidelberg.