Open Access   Article Go Back

POS Tagging for Marathi Language using Hidden Markov Model

Nita V. Patil1

  1. School of Computer Sciences, North Maharashtra University, Jalgaon, India.

Correspondence should be addressed to: nvpatil@nmu.ac.in.

Section:Research Paper, Product Type: Journal Paper
Volume-6 , Issue-1 , Page no. 409-412, Jan-2018

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v6i1.409412

Online published on Jan 31, 2018

Copyright © Nita V. Patil . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Nita V. Patil, “POS Tagging for Marathi Language using Hidden Markov Model,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.1, pp.409-412, 2018.

MLA Style Citation: Nita V. Patil "POS Tagging for Marathi Language using Hidden Markov Model." International Journal of Computer Sciences and Engineering 6.1 (2018): 409-412.

APA Style Citation: Nita V. Patil, (2018). POS Tagging for Marathi Language using Hidden Markov Model. International Journal of Computer Sciences and Engineering, 6(1), 409-412.

BibTex Style Citation:
@article{Patil_2018,
author = {Nita V. Patil},
title = {POS Tagging for Marathi Language using Hidden Markov Model},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {1 2018},
volume = {6},
Issue = {1},
month = {1},
year = {2018},
issn = {2347-2693},
pages = {409-412},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1694},
doi = {https://doi.org/10.26438/ijcse/v6i1.409412}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i1.409412}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1694
TI - POS Tagging for Marathi Language using Hidden Markov Model
T2 - International Journal of Computer Sciences and Engineering
AU - Nita V. Patil
PY - 2018
DA - 2018/01/31
PB - IJCSE, Indore, INDIA
SP - 409-412
IS - 1
VL - 6
SN - 2347-2693
ER -

VIEWS PDF XML
1541 574 downloads 301 downloads
  
  
           

Abstract

Part-of-speech (POS) tagging plays significant role in almost every natural language processing task. This paper addresses a problem of POS tagging for Marathi language. Marathi is free word order, morphologically rich and highly inflectional Indian language. Supervised learning method that uses Hidden Markov Model is implemented to mark Marathi text using POS tags. The dataset required for training the algorithm consists of 12,000 Marathi sentences comprising news from popular Marathi newspaper. The algorithm for POS tagging predicts the tag for current word using the previous word tag pair. The POS tagging system has reported 86.61% accuracy in predicting correct POS to the words.

Key-Words / Index Term

Marathi, HMM, POS, Part of Speech, Tagset, Supervised learning

References

[1] Nita Patil, Ajay S. Patil and B. V. Pawar,"Issues and Challenges in Marathi Named Entity Recognition " International Journal on Natural Language Computing (IJNLC) Vol. 5, No.1, pp:15-31(2016) .
[2] Bharati, A., Sharma, D.M., Bai, L., Sangal, R., “AnnCorra: Annotating Corpora Guidelines for POS and Chunk Annotation for Indian Languages” (2006).
http://ltrc.iiit.ac.in/tr031/posguidelines.pdf
[3] Singh Thoudam Doren and Bandyopadhyay Sivaji, “Morphology Driven Manipuri POS Tagger”, Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages, pages 91–98, Hyderabad, India (2008)
[4] Shrivastava, M., Bhattacharyya, P., (2008) “Hindi POS Tagger Using Naive Stemming: Harnessing Morphological Information Without Extensive Linguistic Knowledge”. In: International Conference on NLP (ICON08), Macmillan Press, New Delhi.
[5] Manju K., Soumya S., Sumam, M. I., (2009) “Development of a POS Tagger for Malayalam - An Experience”. In International Conference on Advances in Recent Technologies in Communication and Computing, pp.709-713.
[6] H B Patil, A S Patil and B V Pawar. “Part-of-Speech Tagger for Marathi Language using Limited Training Corpora”. IJCA Proceedings on National Conference on Recent Advances in Information Technology NCRAIT(4), 2014, pages 33-37.
[7] Pallavi Bagul, Archana Mishra, Prachi Mahajan, Medinee Kulkarni, Gauri Dhopavkar, "Rule Based POS Tagger for Marathi Text". In proceeding of: International Journal of Computer Science and Information Technologies, Vol. 5 (2) , 2014, 1322-1326.
[8] Jyoti Singh, Nisheeth Joshi, Iti Mathur “Part Of Speech Tagging Of Marathi Text Using Trigram Method”. International Journal of Advanced Information Technology (IJAIT) Vol. 3, No.2, DOI: 10.5121/ijait2013.3203.
[9] Nidhi Mishra, Amit Mishra, “Part of Speech Tagging for Hindi Corpus”. In proceeding of International Conference on Communication Systems and Network Technologies, 978-0-7695-44373/11, 2011 IEEE DOI 10.1109/CSNT.2011.118.
[10] Javed Ahmed Mahar, Ghulam Qadir Memon, “Rule Based Part of Speech Tagging of Sindhi Language”. In proceeding of International Conference on Signal Acquisition and Processing 978-0-7695-3960-7/10,2010 IEEE DOI 10.1109/ICSAP.2010.27.