Open Access   Article Go Back

Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data

G. Pandey1 , N. Mishra2

Section:Research Paper, Product Type: Journal Paper
Volume-4 , Issue-9 , Page no. 12-18, Sep-2016

Online published on Sep 30, 2016

Copyright © G. Pandey, N. Mishra . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: G. Pandey, N. Mishra, “Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data,” International Journal of Computer Sciences and Engineering, Vol.4, Issue.9, pp.12-18, 2016.

MLA Style Citation: G. Pandey, N. Mishra "Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data." International Journal of Computer Sciences and Engineering 4.9 (2016): 12-18.

APA Style Citation: G. Pandey, N. Mishra, (2016). Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data. International Journal of Computer Sciences and Engineering, 4(9), 12-18.

BibTex Style Citation:
@article{Pandey_2016,
author = {G. Pandey, N. Mishra},
title = {Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {9 2016},
volume = {4},
Issue = {9},
month = {9},
year = {2016},
issn = {2347-2693},
pages = {12-18},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1048},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1048
TI - Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data
T2 - International Journal of Computer Sciences and Engineering
AU - G. Pandey, N. Mishra
PY - 2016
DA - 2016/09/30
PB - IJCSE, Indore, INDIA
SP - 12-18
IS - 9
VL - 4
SN - 2347-2693
ER -

VIEWS PDF XML
1706 1483 downloads 1402 downloads
  
  
           

Abstract

Dynamic feature evaluation and concept evaluation is major challenging task in the field of stream data classification. The continuity of data induced a new feature during classification process, but the classification process is predefined task for assigning data into class. Stream data comes into multiple feature sub-set format into infinite length. The infinite length not decided the how many class are assigned. Genetic algorithm is well known population based method. The performance of genetic algorithm is better than other optimization technique such as POS and ANT colony optimization. The dynamic nature of genetic algorithm maintains the dynamic feature evaluation. The optimization process goes through multiple stages in terms of selection of feature and optimization of feature. The optimized feature reduces the unclassified region of class during classification. The proposed method for stream data classification is MMCM-GA is implemented in MATLAB 7.8.0. And test the validation process used some reputed data set from UCI machine learning prosperity. These data are corpus, forest and finally used glass dataset. Our empirical evaluation of result shows better feature evaluation and minimization of error rate in comprehension of MCM stream data classification

Key-Words / Index Term

Stream Data Classification, POS, Ensemble, Optimal Feature, Genetic Algorithm (GA)

References

[1] Dayong Wang, Pengcheng Wu, Peilin Zhao, Yue Wu, Chunyan Miao, Steven C.H. Hoi �High-dimensional Data Stream Classification via Sparse Online Learning� IEEE International Conference on Data Mining, 2014. Pp 1007-1012.
[2] Mohammad M. Masud,Jing Gao, Latifur Khan, , Jiawei Han and Bhavani Thurai singham �Classification and Novel Class Detection in Concept-Drifting Data Streams under Time Constraints� IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 23, NO. 6, JUNE 2011. Pp 859-874.
[3] Ge Song, Yunming Ye �A New Ensemble Method for Multi-label Data Stream Classification in Non-stationary Environment� 2014 International Joint Conference on Neural Networks July 6-11, 2014, Beijing, China, Pp 1776-1783.
[4] Rashmi Dutta Baruah, PlamenAngelov, Diganta Baruah �Dynamically Evolving Fuzzy Classifier for Real-time Classification of Data Streams� IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) , July 6-11, 2014, Beijing, China , Pp 383-389.
[5] Justin Ma, Alex Kulesza, Mark Dredze, Koby Crammer, Lawrence K. Saul, Fernando Pereira �Exploiting Feature Covariance in High-Dimensional Online Learning� 2009, Pp 493-500.
[6] Mohammad M. Masud , Qing Chen , Latifur Khan, Charu Aggarwal ,Jing Gao, Jiawei Han and Bhavani Thurai singham �Addressing Concept-Evolution in Concept-Drifting Data Streams� 2009, Pp 124-130
[7] Mohammad M. Masud, Tahseen M. Al-Khateeb, Latifur Khan,Charu Aggarwal, Jing Gao Jiawei Hanand Bhavani Thuraisingham�Detecting Recurring and Novel Classes in Concept-Drifting Data Streams� 2012. Pp 897-902.
[8] Mohammad M. Masud, Qing Chen, Jing Gao, Latifur Khan, Jiawei Han, and Bhavani Thuraisingham �Classification and Novel Class Detection of Data Streams in a Dynamic Feature Space� J.L. Balcazar et al. (Eds.): ECML PKDD 2010, Pp. 337�352.
[9] Zhihui Lai, Zhong Jin, Jian Yang, W.K Wong �Sparse Local Discriminant Projections for Face Feature Extraction� IEEE, 2010, Pp1051-1060.
[10] Clay Woolam, Mohammad M. Masud, and Latifur Khan �Lacking Labels in the Stream: Classifying Evolving Stream Data with Few Labels� 2009, LNAI 5722. Pp. 552�562.
[11] Li Su, Hong-yanLiu, Zhen-Hui Song, �A New Classification Algorithm for Data Stream� , I.J.Modern Education and Computer Science, 2011. Pp 32-39
[12] Manjeet Kaur, Manoj agnihotri �A Hybrid technique using Genetic algorithm and ANT colony optimization for improving in cloud datacenter�. IJCSE, Volume 04, Issue-08.