Open Access   Article Go Back

A Study on Text Recognition using Image Processing with Datamining Techniques

U.Karthikeyan 1 , M. Vanitha2

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-2 , Page no. 783-787, Feb-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i2.783787

Online published on Feb 28, 2019

Copyright © U.Karthikeyan, M. Vanitha . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: U.Karthikeyan, M. Vanitha, “A Study on Text Recognition using Image Processing with Datamining Techniques,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.2, pp.783-787, 2019.

MLA Style Citation: U.Karthikeyan, M. Vanitha "A Study on Text Recognition using Image Processing with Datamining Techniques." International Journal of Computer Sciences and Engineering 7.2 (2019): 783-787.

APA Style Citation: U.Karthikeyan, M. Vanitha, (2019). A Study on Text Recognition using Image Processing with Datamining Techniques. International Journal of Computer Sciences and Engineering, 7(2), 783-787.

BibTex Style Citation:
@article{Vanitha_2019,
author = {U.Karthikeyan, M. Vanitha},
title = {A Study on Text Recognition using Image Processing with Datamining Techniques},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {2 2019},
volume = {7},
Issue = {2},
month = {2},
year = {2019},
issn = {2347-2693},
pages = {783-787},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=3744},
doi = {https://doi.org/10.26438/ijcse/v7i2.783787}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i2.783787}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=3744
TI - A Study on Text Recognition using Image Processing with Datamining Techniques
T2 - International Journal of Computer Sciences and Engineering
AU - U.Karthikeyan, M. Vanitha
PY - 2019
DA - 2019/02/28
PB - IJCSE, Indore, INDIA
SP - 783-787
IS - 2
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
392 282 downloads 166 downloads
  
  
           

Abstract

Text recognition is a technique that recognizes text from the paper document in the desired format (such as .doc or .txt). The text recognition process involves several steps, including pre-processing, segmentation, feature extraction, classification, and post-processing. The preprocessing is performed as a binarized image to convert a grayscale image, and noise is reduced on the input image of the basic operation performed by removing the noise of the image signal. The segmentation phase is used to segment the image given online and segment each character of the segmentation line. Feature extraction is to compute the characteristics of the image document. This document describes techniques for converting the textual content of a paper document into a machine-readable format. This paper analyzes and compares the technical challenges, methods, and performance of text detection and recognition studies in color images. It summarizes the basic issues and lists the factors that should be considered when addressing them. The prior art is classified as step-by-step or integrated and highlights sub-problems including text localization, verification, segmentation and identification of text. This survey provides a basic comparison and analysis of the scope and challenges in the field of text recognition.

Key-Words / Index Term

Classification, Datamining, Segmentation, Text recognition

References

[1] C. Patel and A. Desai, “Segmentation of text lines into words for Gujarati handwritten text,” Proc. 2010 Int. Conf. Signal Image Process. ICSIP 2010, pp. 130–134, 2010.
[2] C. Patel and A. Desai, “Zone identification for Gujarati handwritten word,” Proc. - 2nd Int. Conf. Emerg. Appl. Inf. Technol. EAIT 2011, pp. 194–197, 2011.
[3] C. Patel and A. Desai, “Gujarati Handwritten Character Recognition Using Hybrid Method Based on Binary Tree-Classifier And K-Nearest Neighbour,” Int. J. Eng. Res. Technol., vol. 2, no. 6, pp. 2337–2345, 2013.
[4] A. Desai, “Segmentation of Characters from old Typewritten Documents using Radon Transform,” Int. J. Comput. Appl., vol. 37, no. 9, pp. 10–15, 2012.
[5] A. A. Desai, “Handwritten Gujarati Numeral Optical Character Recognition using Hybrid Feature Extraction Technique,” Int. Conf. Image Process. Comput. Vision, Pattern Recognition, IPCV, 2010.
[6] A. A. Desai, “Gujarati handwritten numeral optical character reorganization through neural network,” J. Pattern Recognit., vol. 43, no. 7, pp. 2582–2589, 2010.
[7] A. a. Desai, “Support vector machine for identification of handwritten Gujarati alphabets using hybrid feature space,” CSI Trans. ICT, vol. 2, no. January, pp. 235–241, 2015.
[8] Mayil S. and Vanitha M, “A Survey on privacy Preserving Data Mining Techniques”, International Journal of Computer Science and Information Technologies. Vol.5 (5), pp. 6054-6056. ISSN: 0975-9646, 2014.
[9] M. Maloo, K. V Kale, and I. Technology, “Support Vector Machine Based Gujarati Numeral Recognition,” Int. J. Comput. Sci. Eng. ({IJCSE}), {ISSN} 0975-3397, vol. 3, no. 7, pp. 2595–2600, 2011.
[10] M. B. Mendapara and M. M. Goswami, “Stroke identification in Gujarati text using directional feature,” Proceeding IEEE Int. Conf. Green Comput. Commun. Electr. Eng. ICGCCEE 2014, 2014.
[11] N. Rave and S. K. Mitra, “Feature extraction based on stroke orientation estimation technique for handwritten numeral,” in Eighth International Conference on Advances in Pattern Recognition (ICAPR), 2015.
[12] Manimaran R. and Vanitha M, “An Efficient Study on Usage of Data Mining Techniques for Predicting Diabetes”, International Journal of Advanced Research Trends in Engineering and Technology (IJARTET) Vol.3 (20), pp.268-272 ISSN: 2394-3785, 2016.
[13] A. N. Vyas and M. M. Goswami, “Classification of handwritten Gujarati numerals,” 2015 Int. Conf. Adv. Comput. Commun. Informatics, ICACCI 2015, pp. 1231–1237, 2015.
[14] Y. M. Prutha and S. G. Anuradha, “Morphological Image Processing Approach of Vehicle Detection for Real-Time Traffic Analysis,” Int. J. Comput. Sci. Int. J. Comput. Sci. Eng., vol. 3, no. 5, pp. 88–92, 2014.
[15] M. A. Abuzaraida, A. M. Zeki, and A. M. Zeki, “Online recognition system for handwritten hindi digits based on matching alignment algorithm,” in International Conference on Advanced Computer Science Applications and Technologies, ACSAT 2014, 2014, pp. 168–171.
[16] S. Belhe, C. Paulzagade, A. Deshmukh, S. Jetley, and K. Mehrotra, “Hindi handwritten word recognition using HMM and symbol tree,” Proceeding Work. Doc. Anal. Recognit. - DAR ’12, p. 9, 2012.
[17] S. Joseph and A. Hameed, “Online handwritten malayalam character recognition using LIBSVM in Matlab,” in National Conference on Communication, Signal Processing and Networking, NCCSN 2014, 2015, pp. 1–5.
[18] A. Arora and A. M. Namboodiri, “A hybrid model for recognition of online handwriting in Indian scripts,” in International Conference on Frontiers in Handwriting Recognition, ICFHR 2010, 2010, pp. 433–438.
[19] K. P. Primekumar and S. M. Idiculla, “On-line Malayalam Handwritten Character Recognition using HMM and SVM,” Int. Conf. Signal Process. , Image Process. Pattern Recognit. [ ICSIPR], pp. 1–5, 2013.
[20] A. Sampath, C. Tripti, and V. Govindaru, “Online Handwritten Character Recognition for Malayalam,” ACM Int. Conf. Proceeding Ser., pp. 661–664, 2012.
[21] G. S. Reddy, P. Sharma, S. R. M. Prasanna, C. Mahanta, and L. N. Sharma, “Combined online and offline assamese handwritten numeral recognizer,” in National Conference on Communications, NCC 2012, 2012.
[22] A. Bharath and S. Madhvanath, “HMM-based lexicon-driven and lexicon-free word recognition for online handwritten indic scripts,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 4, pp. 670–682, 2012.