Open Access   Article Go Back

Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training & multinomial Naïve Bayes (NB) in News Classification

Feroz Ahmed1 , Shabina Ghafir2

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-4 , Page no. 360-363, Apr-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i4.360363

Online published on Apr 30, 2019

Copyright © Feroz Ahmed, Shabina Ghafir . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Feroz Ahmed, Shabina Ghafir, “Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training & multinomial Naïve Bayes (NB) in News Classification,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.4, pp.360-363, 2019.

MLA Style Citation: Feroz Ahmed, Shabina Ghafir "Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training & multinomial Naïve Bayes (NB) in News Classification." International Journal of Computer Sciences and Engineering 7.4 (2019): 360-363.

APA Style Citation: Feroz Ahmed, Shabina Ghafir, (2019). Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training & multinomial Naïve Bayes (NB) in News Classification. International Journal of Computer Sciences and Engineering, 7(4), 360-363.

BibTex Style Citation:
@article{Ahmed_2019,
author = {Feroz Ahmed, Shabina Ghafir},
title = {Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training & multinomial Naïve Bayes (NB) in News Classification},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {4 2019},
volume = {7},
Issue = {4},
month = {4},
year = {2019},
issn = {2347-2693},
pages = {360-363},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4043},
doi = {https://doi.org/10.26438/ijcse/v7i4.360363}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i4.360363}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4043
TI - Linear Support Vector Machine (SVM) with Stochastic Gradient Descent (SGD) training & multinomial Naïve Bayes (NB) in News Classification
T2 - International Journal of Computer Sciences and Engineering
AU - Feroz Ahmed, Shabina Ghafir
PY - 2019
DA - 2019/04/30
PB - IJCSE, Indore, INDIA
SP - 360-363
IS - 4
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
485 324 downloads 229 downloads
  
  
           

Abstract

The motivation for this work arises from the need of Automatic Document Classification (ADC) which is necessary when the task involves business specific contexts which cannot be fulfilled via querying on any search engine. Since a large number of websites are available over the internet nowadays therefore users generally search information from different websites via search engines now. But search engines require appropriate keywords from users in order to give relevant information from the web and sometimes users have obtained irrelevant results if he or she is not sound enough to provide keywords correctly. Thus, we need a proper document classification for the material of our wish so that the one which is required can be obtained easily instead of wasting time in searching. To understand this, we have discussed the automatic document classification in news domain where we classify news articles into four distinct categories: business, science & technology, entertainment and health using Linear SVM with SGD training and multinomial NB classifier and compare their performance. The classification is based on the title of the news article taken as feature.

Key-Words / Index Term

Automatic Document Classification, Linear SVM, Stochastic Gradient Descent, multinomial NB

References

[1] C.C. Aggarwal and C. Zhai, “Mining Text Data”, MA: Springer, Eds. Boston, pp. 163-222, 2012.
[2] T. Jo, “Classifying News Articles Using Feature Similarity K Nearest Neighbor”, In Proc. of the 6th Int. Conf. on Green and Human Information Technology, ICGHIT 2018, Chiang Mai, Thailand, pp. 73-78, 2018.
[3] S. Xu, Y. Li and Z. Wang, “Bayesian Multinomial Naïve Bayes Classifier to Text Classification”, In Proc. of the 11th Int. Conf. on Multimedia and Ubiquitous Engineering, MUE 2017, Proc. of the 12th Int. Conf. on Future Information Technology, FutureTech 2017, Seoul, South Korea, pp. 347-352 2017.
[4] A. Hassaine, S. Mecheter and A. Jaoua, “Text Categorization Using Hyper Rectangular Keyword Extraction : Application to News Articles Classification”, In Proc. of the 15th Int. Conf. on Relational and Algebraic methods in Computer Science, RAMiCS 2015, Braga, Portugal, pp. 312-325 , 2015.
[5] A. A. Hakim, A. Erwin, K. I. Eng, M. Galinium and W. Muliady, “Automated document classification for news article in Bahasa Indonesia based on term frequency inverse document frequency (TF – IDF) approach”, In Proc. of the 6th Int. Conf. on Information Technology and Electrical Engineering, ICITEE 2014, Yogyakarta, Indonesia, pp. 1-4, 2014.
[6] S. Qiu, M. Jiang, Z. Zhang, Y. Lu and Z. Pei, “Chinese News Text Classification of the Stacked Denoising Auto Encoder Based on Adaptive Learning Rate and Additional Momentum Item”, In Proc. of the 15th Int. Symposium on Neural Networks, ISNN 2018, Minsk, Belarus, pp. 578-584, 2018.
[7] C. Li, G. Zhan and Z. Li, “ News Text Classification Based on Improved Bi-LSTM-CNN,” In Proc. of the 9th Int. Conf. on Information Technology in Medicine and Education, ITME 2018, Hangzhou, China, pp. 890-893, 2018.
[8] M. Fanjin, H. Ling and T.J.W. Xinzheng, “The Research of Semantic Kernel in SVM for Chinese Text Classifiction” , In Proc. of the 2nd Int. Conf. on Intelligent Information Processing, IIP’17 2017, Bangkok, Thailand, 2017.
[9] A.D. Patel and Y.K. Sharma, “Web Page Classification on News Feeds Using Hybrid Technique for Extraction”, In Proc. of the 3rd Int. Conf. on Information and Communication Technology for Intelligent Systems, ICTIS 2018, Ahmedabad, India, pp. 399-305, 2018.
[10] F. Kabir, S. Siddique, M. R. A. Kotwal and M. N. Huda, “Bangla text document categorization using Stochastic Gradient Descent (SGD) classifier”, In the Int. Conf. on Cognitive Computing and Information Processing, CCIP 2015, Noida, India, pp. 1-4, 2015.
[11] S. Brindha, K. Prabha and S. Sukumaran, “A survey on classification techniques for text mining" , In Proc. of the 3rd Int. Conf. on Advanced Computing and Communication Systems, ICACCS 2016, Coimbatore, India, pp. 1-5, 2016.
[12] M. Lichman, “News Aggregator Data Set”, UCI Machine Learning Repository, 2013. (https://archive.ics.uci.edu)