Open Access   Article Go Back

Strategy for Hindi Text Summarization using Content Based Indexing Approach

S. Sargule1 , R.M. Kagalkar2

Section:Research Paper, Product Type: Journal Paper
Volume-4 , Issue-9 , Page no. 36-42, Sep-2016

Online published on Sep 30, 2016

Copyright © S. Sargule, R.M. Kagalkar . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: S. Sargule, R.M. Kagalkar, “Strategy for Hindi Text Summarization using Content Based Indexing Approach,” International Journal of Computer Sciences and Engineering, Vol.4, Issue.9, pp.36-42, 2016.

MLA Style Citation: S. Sargule, R.M. Kagalkar "Strategy for Hindi Text Summarization using Content Based Indexing Approach." International Journal of Computer Sciences and Engineering 4.9 (2016): 36-42.

APA Style Citation: S. Sargule, R.M. Kagalkar, (2016). Strategy for Hindi Text Summarization using Content Based Indexing Approach. International Journal of Computer Sciences and Engineering, 4(9), 36-42.

BibTex Style Citation:
@article{Sargule_2016,
author = {S. Sargule, R.M. Kagalkar},
title = {Strategy for Hindi Text Summarization using Content Based Indexing Approach},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {9 2016},
volume = {4},
Issue = {9},
month = {9},
year = {2016},
issn = {2347-2693},
pages = {36-42},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1052},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1052
TI - Strategy for Hindi Text Summarization using Content Based Indexing Approach
T2 - International Journal of Computer Sciences and Engineering
AU - S. Sargule, R.M. Kagalkar
PY - 2016
DA - 2016/09/30
PB - IJCSE, Indore, INDIA
SP - 36-42
IS - 9
VL - 4
SN - 2347-2693
ER -

VIEWS PDF XML
2370 1619 downloads 1430 downloads
  
  
           

Abstract

The Document summarization provides summary of document in a very short time. Existing systems for document summarization have work carried on English text summarization. Such systems do not consider the context of the word to produce summary. Previously implemented document summarization models generally use the similarity among sentences in the original document to extract the most relevant sentences. The documents along with the sentences are generally indexed using standard term indexing computation methods, which do not take into account the context related to document. System takes Hindi document as input. That document undergoes through the algorithm and final output is produced as summary of input Hindi document by considering the context of the word. The Bernoulli Model of Randomness technique is used to check the probability of co-occurrences of two terms in large corpus. The methodology used contains lexical association, sentences indexing, word indexing.

Key-Words / Index Term

Document Summarization, Lexical Association, Context Indexing

References

[1] P. Goyal, L. Behera, and T. M. McGinnity, �A Context-Based Word Indexing Model for Document Summarization�, IEEE Trans. on Knowledge and Data Engineering, vol. 25, no. 8, August 2013.
[2] Q. Cao and S. Fujita, "Cost-effective replication schemes for query load balancing in DHT-based peer-to-peer P. Goyal, L. Behera, and T. M. McGinnity, �A Context-Based Word Indexing Model for Document Summarization�, IEEE Trans. on Knowledge and Data Engineering, vol. 25, no. 8, August 2013.
[3] X. Wan and J. Xiao, �Exploiting Neighborhood Knowledge for Single Document Summarization and Keyphrase Extraction,� ACM Trans. Information Systems, vol. 28, pp. 8:1-8:34, http://doi.acm.org/10.1145/1740592.1740596, June 2010.
[4] L.L. Bando, F. Scholer, and A. Turpin, �Constructing Query- Biased Summaries: A Comparison of Human and System Generated Snippets,� Proc. Third Symp. Information Interaction in Context, pp. 195-204, http://doi.acm.org/10.1145/1840784.1840813, 2010.
[5] X. Wan, �Towards a Unified Approach to Simultaneous Single- Document and Multi-Document Summarizations,� Proc. 23rd Int�l Conf. Computational Linguistics, pp. 1137-1145,http://portal.acm.org/citation.cfm?id=1873781.1873909, 2010.
[6] X. Wan, �An Exploration of Document Impact on Graph-Based Multi-Document Summarization,� Proc. Conf. Empirical Methods in Natural Language Processing, pp. 755-762, http://portal.acm.org/citation.cfm?id=1613715.1613811, 2008.
[7] Q.L. Israel, H. Han, and I.-Y. Song, �Focused Multi-Document Summarization: Human Summarization Activity vs. Automated Systems Techniques,� J. Computing Sciences in Colleges, vol. 25, pp. 10-20, http://portal.acm.org/citation.cfm?id=1747137.1747140, May 2010.
[8] C. Shen and T. Li, �Multi-Document Summarization via the Minimum Dominating Set,� Proc. 23rd Int�l Conf. Computational Linguistics, pp. 984-992, http://portal.acm.org/citation.cfm?id=1873781.1873892, 2010.
[9] X. Wan and J. Yang, �Multi-Document Summarization Using Cluster-Based Link Analysis,� Proc. 31st Ann. Int�l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 299-306,http://doi.acm.org/10.1145/1390334.1390386, 2008.
[10] D. Wang, T. Li, S. Zhu, and C. Ding, �Multi-Document Summarization via Sentence-Level Semantic Analysis and Symmetric Matrix Factorization,� Proc. 31st Ann. Int�l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 307-314,http://doi.acm.org/10.1145/1390334.1390387, 2008.
[11] S. Harabagiu and F. Lacatusu, �Using Topic Themes for Multi- Document Summarization,� ACM Trans. Information Systems, vol. 28, pp. 13:1-13:47, http://doi.acm.org/10.1145/1777432.1777436, July 2010.
[12] R. Varadarajan, V. Hristidis, and T. Li, �Beyond Single-Page Web Search Results,� IEEE Trans. Knowledge and Data Eng., vol. 20,no. 3, pp. 411-424, Mar. 2008.
[13] E. Lloret, A. Balahur, M. Palomar, and A. Montoyo, �Towards Building a Competitive Opinion Summarization System: Challenges and Keys,� Proc. Human Language Technologies: The 2009 Ann. Conference of the North Am. Ch. Assoc. for Computational Linguistics, Companion Vol. : Student Research Workshop and Doctoral Consortium, pp. 72-77,http://portal.acm.org/citation.cfm?id=1620932.1620945, 2009.
[14] J.G. Conrad, J.L. Leidner, F. Schilder, and R. Kondadadi, �Query-Based Opinion Summarization for Legal Blog Entries,� Proc. 12th Int�l Conf. Artificial Intelligence and Law, pp. 167-176, http://doi.acm.org/10.1145/1568234.1568253, 2009.
[15] H. Nishikawa, T. Hasegawa, Y. Matsuo, and G. Kikui, �Opinion Summarization with Integer Linear Programming Formulation for Sentence Extraction and Ordering,� Proc. 23rd Int�l Conf. Computational Linguistics: Posters, pp. 910-918, http://portal.acm.org/citation.cfm?id=1944566.1944671, 2010.
[16] C.C. Chen and M.C. Chen, �TSCAN: A Content Anatomy Approach to Temporal Topic Summarization,� IEEE Trans.Knowledge and Data Eng., vol. 24, no. 1, pp. 170-183, Jan. 2012.
[17] B. Andreopoulos, D. Alexopoulou, and M. Schroeder, �Word Sense Disambiguation in Biomedical Ontologies with Term Co-Occurrence Analysis and Document Clustering,� Int�l J. Data Mining and Bioinformatics, vol. 2, pp. 193-215, http://portal.acm.org/citation.cfm?id=1413934.1413935, Sept. 2008.
[18] P. Goyal, L. Behera, and T. McGinnity, �Query Representation Through Lexical Assoc. for Information Retrieval,� IEEE Trans.Knowledge and Data Eng., vol. 24, no. 12, pp. 2260-2273, Dec. 2011.
[19] D.E. Losada and L. Azzopardi, �Assessing Multivariate Bernoulli Models for Information Retrieval,� ACM Trans. Information Systems, vol. 26, pp. 17:1-17:46, http://doi.acm.org/10.1145/1361684.1361690, June 2008.
[20] J. Clarke and M. Lapata, �Discourse Constraints for DocumentCompression,� Computational Linguistics, vol. 36, pp. 411-441, 2010.
[21] Y. Ouyang, W. Li, Q. Lu, and R. Zhang, �A Study on Position Information in Document Summarization,� Proc. 23rd Int�l Conf. Computational Linguistics: Posters, pp. 919-927, http://portal.acm.org/citation.cfm?id=1944566.1944672, 2010.
[22] M. Karimzadehgan and C. Zhai, �Estimation of Statistical Translation Models Based on Mutual Information for Ad Hoc Information Retrieval,� Proc. 33rd Int�l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 323-330, http://doi.acm.org/10.1145/1835449.1835505, 2010.
[23] Swati Sargule, Ramesh M Kagalkar �Hindi Language Document Summarization using Context Based Indexing Model�, CiiT International Journal of Data Mining Knowledge Engineering, Vol.08 No. 01, Jan Issue 2016.
[24] Ramesh M. Kagalkar, Dr. Nagaraj H.N and Dr. S.V Gumaste, �A Novel Technical Approach for Implementing Static Hand Gesture Recognition�, International Journal of Advanced Research in Computer and Communication Engineering ISSN (Online) 2278-1021 ISSN (Print) 2319-5940 Vol. 4, Issue 7, July 2015.
[25] Amit kumar and Ramesh Kagalkar �Advanced Marathi Sign Language Recognition using Computer Vision� , International Journal of Computer Applications (0975 � 8887) Volume 118 � No. 13, May 2015.
[26] Amit kumar and Ramesh Kagalkar �Sign Language Recognition for Deaf User�, Internal Journal for Research in Applied Science and Engineering Technology, Volume 2 Issue XII, December 2014.
[27] Ramesh. M. Kagalkar and Dr. S. V. Gumaste �Automatic Graph Based Clustering for Image Searching and Retrieval from Database�, CiiT Software Engineering and Technology, Vol 8, No 2, 2016.
[28] Ramesh. M. Kagalkar and Dr. S. V. Gumaste �Review Paper: Detail Study for Sign Language Recognition Techniques�, CiiT Digital Image Processing, Vol 8, No 3, 2016.
[29] Ramesh M Kagalkar, Kajal Chavan, Asmita Jadhav, Ravina Patil and Asmita Rawool,� Self-Educating Tool Kit for Kids� CiiT Software Engineering and Technology, Vol 8, No 1, 2016.
[30] Vasundhara Kadam and Ramesh Kagalkar,� Review on Textual Description of Image Contents�, International Journal of Computer Applications (0975 � 8887) National Conference on Advances in Computing (NCAC 2015)
[31] Mrunmayee Patil and Ramesh Kagalkar �An Automatic Approach for Translating Simple Images into Text Descriptions and Speech for Visually Impaired People�, International Journal of Computer Applications (0975 � 8887) Volume 118 � No. 3, May 2015.
[32] Kaveri Kamble and Ramesh Kagalkar, �Audio Visual Speech Synthesis and Speech Recognition for Hindi Language�, International Journal of Computer Science and Information Technologies (IJCSIT) ISSN (Online): 0975-9646, Vol. 6 Issue 2, April 2015.
[33] Shivaji Chaudhari and Ramesh Kagalkar �A Review of Automatic Speaker recognition and Identifying Speaker Emotion Using Voice Signal� International Journal of Science and Research (IJSR), Volume 3, Issue 11 November 2014.
[34] Ajay R. Kadam and Ramesh Kagalkar, �Audio Scenarios Detection Technique�, International Journal of Computer Applications (IJCA), Volume 120 June 2015, Edition ISBN: 973-93-80887-55-4.
[35] Ramesh.M.Kagalkar and P.N.Girija� Neural Network Based Document Image Analysis for Text, Image Localization Using Wavelet Decomposition and Mathematical Morphology� International Journal on Computer Science and Information Technology (IJCEIT) Volume 16, No 21, ISSN 0974-2034, Jan-Feb 2010.
[36] Ramesh.M.Kagalkar, Mrityunjaya .V. Latte and Basavaraj.M.Kagalkar �An Improvement In Stopping Force Level Set Based Image Segmentation� International Journal on Computer Science and Information Technology(IJCEIT) ISSN 0974-2034, Volume 24, Issue No 01, June � August 2010.
[37] Vrushali K Gaikwad and Ramesh Kagalkar �Security and Verification of Data in Multi-Cloud Storage with Provable Data Possession�, International Journal of Computer Applications (0975 � 8887) Volume 117 � No. 5, May 2015.
[38] Swati Sargule, Ramesh M Kagalkar �Methodology of Context Centered Term Indexing Style Intended For Hindi Language Document Summarization�, CiiT International Journal of Software Engineering and Technology, Vol.08 No. 05, June Issue 2016.