Improving Existing Punjabi Morphological Analyzer using N-gram

S. K. Sharma

Open Access Article Go Back

Improving Existing Punjabi Morphological Analyzer using N-gram

S. K. Sharma¹

Dept. of Computer Science and Applications, DAV University, Jalandhar, India.

Correspondence should be addressed to: sanju3916@rediffmail.com.

Section:Research Paper, Product Type: Journal Paper
Volume-5 , Issue-9 , Page no. 171-174, Sep-2017

CrossRef-DOI: https://doi.org/10.26438/ijcse/v5i9.171174

Online published on Sep 30, 2017

Copyright © S. K. Sharma . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at Google Scholar | DPI Digital Library

XML View

PDF Download

How to Cite this Paper

IEEE Citation
MLA Citation
APA Citation
BibTex Citation
RIS Citation

IEEE Style Citation: S. K. Sharma, “Improving Existing Punjabi Morphological Analyzer using N-gram,” International Journal of Computer Sciences and Engineering, Vol.5, Issue.9, pp.171-174, 2017.

MLA Style Citation: S. K. Sharma "Improving Existing Punjabi Morphological Analyzer using N-gram." International Journal of Computer Sciences and Engineering 5.9 (2017): 171-174.

APA Style Citation: S. K. Sharma, (2017). Improving Existing Punjabi Morphological Analyzer using N-gram. International Journal of Computer Sciences and Engineering, 5(9), 171-174.

BibTex Style Citation:
@article{Sharma_2017,
author = {S. K. Sharma},
title = {Improving Existing Punjabi Morphological Analyzer using N-gram},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {9 2017},
volume = {5},
Issue = {9},
month = {9},
year = {2017},
issn = {2347-2693},
pages = {171-174},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1450},
doi = {https://doi.org/10.26438/ijcse/v5i9.171174}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v5i9.171174}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1450
TI - Improving Existing Punjabi Morphological Analyzer using N-gram
T2 - International Journal of Computer Sciences and Engineering
AU - S. K. Sharma
PY - 2017
DA - 2017/09/30
PB - IJCSE, Indore, INDIA
SP - 171-174
IS - 9
VL - 5
SN - 2347-2693
ER -

VIEWS	PDF	XML
633	337 downloads	304 downloads

Bar Line

Abstract

Morphological analysis is an essential tool for almost all Natural Language Processes like POS tagging, Grammar checking, Sentence simplification, generation of Treebank and parsing. In this research article, author has used N-gram statistical technique to improve the existing morphological analyzer. The main factor that reduces the accuracy of morphological analyzer is presence of unknown words. In this research article author has used n-gram approach for detecting the POS tag of unknown word. The results shows an average precision of 82.34, recall 70.20 and F-measure 75.74.

Key-Words / Index Term

Morphological analyzer, Morph, N-gram approach

References

[1]. Bharati, Akshar, Amba P. Kulkarni, Vineet Chaitanya. (1998a).Challenges in Developing Word Analyzers for Indian Languages, Presented at Workshop on Morphology, CIEFL, Hyderabad, July 1998.
[2]. Bharati, Akshar, Rajeev Sangal and S.M. Bendre (1998b). Some Observations on Corpora of Some Indian Languages. Knowledge Based Computing Systems, Tata McGraw-Hill.
[3]. Goldsmith, John. (2001). Unsupervised Learning of the Morphology of a Natural Language. Computational Linguistics, Vol 27, No. 2, pp 153-198.
[4]. Daniel Jurafsky, James H. Martin. Speech and Language Processing:An introduction to speech recognition, Natural Language Processing, and Computational Linguistics. LTRC, IIIT Hyderabad http://ltrc.iiit.ac.in
[5]. Gill Mandeep Singh, Lehal Gurpreet Singh, Joshi S.S., A full form lexicon based Morphological Analysis and generation tool for Punjabi, International Journal of Cybernatics and Informatics, Hyderabad, India,October 2007, pp. 38-47
[6]. Brants, TnT – A statistical part-of-speech tagger. In Proc. Of the 6th Applied NLP Conference, pp. 224-231, 2000
[7]. Cutting, J. Kupiec, J. Pederson and P. Sibun, A practical part of-speech tagger. In Proc. of the 3rd Conference on Applied NLP, pp. 133-140, 1992
[8]. Dermatas and K. George, Automatic stochastic tagging of natural language texts. Computational Linguistics, 21(2): 137-163, 1995
[9]. Ekbal, Asif, and S. Bandyopadhyay,”Lexicon Development and POS tagging using a Tagged Bengali News Corpus”, In Proc. of FLAIRS-2007, Florida, 261-263, 2007
[10]. E. Dermatas and K. George, Automatic stochastic tagging of Natural language texts, Computational Linguistics, 21(2): 137-163, 1995
[11]. Ekbal Asif, et.al, “Bengali Part of Speech Tagging using Conditional Random Field” in Proceedings of the 7th International Symposium of Natural Language Processing (SNLP-2007), Pattaya, Thailand, 15 December 2007, pp.131-136

Citations	2325
h-index	16
i10-index	47