Open Access   Article Go Back

An Effective Clustering Approach for Text Summarization

Rajani S. Sajjan1 , Meera G. Shinde2

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-10 , Page no. 191-197, Oct-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i10.191197

Online published on Oct 31, 2019

Copyright © Rajani S. Sajjan, Meera G. Shinde . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Rajani S. Sajjan, Meera G. Shinde, “An Effective Clustering Approach for Text Summarization,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.10, pp.191-197, 2019.

MLA Style Citation: Rajani S. Sajjan, Meera G. Shinde "An Effective Clustering Approach for Text Summarization." International Journal of Computer Sciences and Engineering 7.10 (2019): 191-197.

APA Style Citation: Rajani S. Sajjan, Meera G. Shinde, (2019). An Effective Clustering Approach for Text Summarization. International Journal of Computer Sciences and Engineering, 7(10), 191-197.

BibTex Style Citation:
@article{Sajjan_2019,
author = {Rajani S. Sajjan, Meera G. Shinde},
title = {An Effective Clustering Approach for Text Summarization},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {10 2019},
volume = {7},
Issue = {10},
month = {10},
year = {2019},
issn = {2347-2693},
pages = {191-197},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4920},
doi = {https://doi.org/10.26438/ijcse/v7i10.191197}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i10.191197}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4920
TI - An Effective Clustering Approach for Text Summarization
T2 - International Journal of Computer Sciences and Engineering
AU - Rajani S. Sajjan, Meera G. Shinde
PY - 2019
DA - 2019/10/31
PB - IJCSE, Indore, INDIA
SP - 191-197
IS - 10
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
258 250 downloads 134 downloads
  
  
           

Abstract

Text summarization automatically creates a shorter version of one or more text documents. It is an effective way of finding relevant information from large set of documents. Text summarization techniques are categorized as Extractive summarization and Abstractive summarization. Extractive summarization methods evaluate text summarization by selecting sentences present in documents according to predefined set of rules. Abstractive summaries attempt to improve the coherence among sentences by eliminating redundancies and clarifying the content of sentences. It should also extract the information is such a way that the content would be in the interest of the user. In this paper we used tokenization for preprocessing of statements then calculate TF/IDF for feature extraction, K-means clustering to generate clusters containing high frequency statements and then NEWSUM algorithm for weighing of statements that are used for relevant content summarization. We also present experimental results on a number of real data sets in order to illustrate the advantages of using proposed approach

Key-Words / Index Term

Text Mining, Text Summarization, Clustering, extractive summary, information extraction

References

[1] HtetMyet Lynn 1 , Chang Choi 2 , Pankoo Kim “An improved method of automatic text summarization for web contents using lexical chain with semantic-related terms”, Springer-Verlag Berlin Heidelberg 2017
[2] Xiaojun Wan 1 , FuliLuo 2 , Xue Sun Songfang Huang3 , Jin-ge Yao “Cross-language document summarization via extraction and ranking of multiple summaries” Springer- Verlag London 2018
[3] Andrew Mackey and Israel Cuevas “AUTOMATIC TEXT SUMMARIZATION WITHIN BIG DATA FRAMEWORKS”, ACM 2018
[4] Yong Zhang, Jinzhi Liao, Jiyuyang Tang “Extractive Document Summarization based on hierarchical GRU”, International Conference on Robots & Intelligent System IEEE 2018
[5] Lili Wan “Extractive Algorithm of English Text Summarization for English Teaching” IEEE 2018
[6] Anurag Shandilya, Kripabandhu Ghosh, Saptarshi Ghosh “Fairness of Extractive Text Summarization”, ACM 2018
[7] P.Krishnaveni, Dr. S. R. Balasundaram “Automatic Text Summarization by Local Scoring and Ranking for Improving Coherence”, Proceedings of the IEEE 2017 International Conference on Computing Methodologies and Communication
[8] Bagalkotkar, A., Kandelwal, A., Pandey, S., &Kamath, S. S. (2013, August). “A Novel Technique for Efficient Text Document Summarization as a Service”, In Advances in Computing and Communications (ICACC), 2013 Third International Conference on (pp. 50-53). IEEE.
[9] Ferreira, Rafael, Luciano de Souza Cabral, Rafael DueireLins, Gabriel Pereira e Silva, Fred Freitas, George DC Cavalcanti, Rinaldo Lima, Steven J. Simske, and Luciano Favaro. "Assessing sentence scoring techniques for extractive text summarization." Expert systems with applications 40, no. 14 (2013): 5755-5764.
[10] Gupta, V. K., &Siddiqui, T. J. (2012, December). “Multi-document summarization using sentence clustering”, In Intelligent Human Computer Interaction (IHCI), 2012 4th International Conference on (pp. 1-5). IEEE.
[11] Min-Yuh Day Department of Information Management Tamkang University New Taipei City, Taiwan myday@mail.tku.edu.tw Chao-Yu Chen Department of Information Management Tamkang University New Taipei City, Taiwan susan.cy.chen@gmail.tw “Artificial Intelligence for Automatic Text Summarization”,2018 IEEE International Conference on Information Reuse and Integration for Data Science
[12] Xiaoping SunandHaiZhuge*, Senior Member, IEEE Laboratory of Cyber-Physical-Social Intelligence, Guangzhou University, China Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, China System Analytics Research Institute, Aston University, UK “Summarization of Scientific Paper through Reinforcement Ranking on Semantic Link Network” ,IEEE 2018
[13] Ahmad T. Al-Taani (PhD, MSc, BSc) Professor of Computer Science (Artificial Intelligence) Faculty of Information Technology and Computer Sciences Yarmouk University, Jordan. ahmadta@yu.edu.jo “Automatic Text Summarization Approaches” ,IEEE 2017
[14] AlokRanjan Pal Dept. of Computer Science and Engineering College of Engineering and Management, KolaghatKolaghat, India chhaandasik@gmail.com DigantaSaha Dept. of Computer Science and Engineering Jadavpur University Kolkata, India neruda0101@yahoo.com “An Approach to Automatic Text Summarization using WordNet”, IEEE 2014
[15] Prakhar Sethi1, Sameer Sonawane2, Saumitra Khanwalker3, R. B. Keskar4 Department of Computer Science Engineering, Visvesvaraya National Institute of Technology, India 1 prakhar.sethi2@gmail.com, 2 sameer9311@gmail.com, 3 theapogee2011@gmail.com, 4 rbkeskar@cse.vnit.ac.in“Automatic Text Summarization of News Articles”, IEEE 2017
[16] Yue Hu and Xiaojun Wan “PPSGen: Learning-Based Presentation Slides Generation for Academic Papers” , IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 27, NO. 4, APRIL 2015
[17] Daan Van Britsom, AntoonBronselaer, and Guy De Tre “Using Data Merging Techniques for Generating Multidocument Summarizations” , IEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 23, NO. 3, JUNE 2015
[18] NingZhong, Yuefeng Li, and Sheng-Tang Wu “Effective Pattern Discovery for Text Mining”, IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 24, NO. 1, JANUARY 2012
[19] Mohsen Pourvali and Mohammad SanieeAbadeh Department of Electrical & Computer Qazvin Branch Islamic Azad University Qazvin, Iran Department of Electrical and Computer Engineering at TarbiatModares University Tehran, Iran “Automated Text Summarization Base on Lexicales Chain and graph Using of WordNet and Wikipedia Knowledge Base” , IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 1, No 3, January 2012
[20] Daan Van Britsom, AntoonBronselaer, Guy De Tre´ Department of Telecommunications and Information Processing, Ghent University Sint-Pietersnieuwstraat 41, B-9000 Ghent, Belgium “Using data merging techniques for generating multi-document summarizations” ,IEEE TRANSACTIONS ON FUZZY SYSTEMS 2018
[21] Yang Gao,YueXu, Yuefengli, “Pattern-based Topics for Document Modeling in Information Filtering” in IEEE Transaction on Knoweledge and Data Engineering, vol.27,No.6,June 2015.
[22] Y. Bastide, R. Taouil, N. Pasquier, G. Stumme, and L. Lakhal, “Mining frequent patterns with counting inference,” ACM SIGKDD Explorations Newslett., vol. 2, no. 2, pp. 66–75, 2000.
[23] H. Cheng, X. Yan, J. Han, and C.-W. Hsu, “Discriminative frequent pattern analysis for effective classification,” in Proc. IEEE 23rd Int. Conf. Data Eng., 2007, pp. 716–725.
[24] R. J. BayardoJr, “Efficiently mining long patterns from databases,” in Proc. ACM Sigmod Record, 1998, vol. 27, no. 2, pp. 85–93.
[25] J. Han, H. Cheng, D. Xin, and X. Yan, “Frequent pattern mining: Current status and future directions,” Data Min. Knowledge. Discovery., vol. 15, no. 1, pp. 55–86, 2007.
[26] http://kavita-ganesan.com/opiniosis-opinion-dataset/