Open Access   Article Go Back

Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data

A. Hency Juliet1 , R. Padmajavalli2

Section:Research Paper, Product Type: Journal Paper
Volume-6 , Issue-10 , Page no. 801-807, Oct-2018

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v6i10.801807

Online published on Oct 31, 2018

Copyright © A. Hency Juliet, R. Padmajavalli . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: A. Hency Juliet, R. Padmajavalli, “Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.10, pp.801-807, 2018.

MLA Style Citation: A. Hency Juliet, R. Padmajavalli "Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data." International Journal of Computer Sciences and Engineering 6.10 (2018): 801-807.

APA Style Citation: A. Hency Juliet, R. Padmajavalli, (2018). Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data. International Journal of Computer Sciences and Engineering, 6(10), 801-807.

BibTex Style Citation:
@article{Juliet_2018,
author = {A. Hency Juliet, R. Padmajavalli},
title = {Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {10 2018},
volume = {6},
Issue = {10},
month = {10},
year = {2018},
issn = {2347-2693},
pages = {801-807},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=3104},
doi = {https://doi.org/10.26438/ijcse/v6i10.801807}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i10.801807}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=3104
TI - Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data
T2 - International Journal of Computer Sciences and Engineering
AU - A. Hency Juliet, R. Padmajavalli
PY - 2018
DA - 2018/10/31
PB - IJCSE, Indore, INDIA
SP - 801-807
IS - 10
VL - 6
SN - 2347-2693
ER -

VIEWS PDF XML
494 253 downloads 269 downloads
  
  
           

Abstract

Many factors affecting the success of data mining techniques, the pureness of data are one of the factors. The inclusion of irrelevant and noisy data in the pattern analyzing phase, can results poor predicting performance. To discover information from the endometrial carcinoma data set, the pre-processing technique such as cleaning, transforming and modelling are applied. Diverse kinds of pre-processing techniques were functions in the data set in order to work with the full pledged data set. Methodology Used: The data mining tool WEKA is used for feature selection. Using various classifiers, evaluators and search methods six features are selected out of eighteen, attributes. The performance evaluation was done using RStudio. The accuracy of the classifiers model Random Forest and Naïve Bayes are checked for the minimized and full data sets. The hybrid model was formed by combing both the models to improve the performance of the classifier model. Findings: The Hybrid model was adopted for the performance evaluation by combining naïve Bayes and random forest classifier and the accuracy of the new model is 93.55%.

Key-Words / Index Term

WEKA; Endometrial; R; carcinoma; classifiers; Naïve Bayes; Random Forest

References

[1] A detailed guide – endometrial cancer. Available in http://www.cancer.org/cancer/endometrialcancer/what-is-endometrial-cancer.
[2] A detailed guide – endometrial cancer. Available in www. cancer. org/ cancer/ endometrial cancer/cancer-risk-factors.
[3] Chia-Ming Wang, Yin-Fu Huang, Evolutionary-based feature selection approaches with new criteria for data mining: A case study of credit approval data, Elsevier Journal, Expert Systems with Applications 36 (2009) 5900–5908
[4] E. Friberg & N. Orsini & C. S. Mantzoros & A. Wolk, Diabetes Mellitus And Risk Of Endometrial Cancer: A Meta-Analysis, Springer-Verlag 2007.
[5] F.Angiulli and C. Pizzuti, “Outlier Mining in Large High-Dimensional Data Sets”, IEEE Trans. on Knowledge and Data Engineering, vol. 17, no. 2, pp. 203-215, 2005.
[6] Faysal A Saksouk, MD; Chief Editor: Eugene C Lin, MD. Endometrial Carcinoma Imaging.
[7] Wrathematics (27 August 2011). "How Much of R Is Written in R". librestats. Retrieved 2018-08-07.
[8] “7 of the Best Free Graphical User Interfaces for R". linuxlinks.com. Retrieved 9 February 2016.
[9] Huan Liu and Lei Yu, Toward Integrating Feature Selection Algorithms for Classification and Clustering, IEEE Transactions on Computers, Pages 191 - 226
[10] J.Han, M.Kamber, J.Pei. Data Mining concepts and Techniques. 3rd Edition, Simon Fraser University, pages-230-240, 2006.
[11] M. Dash. Feature selection via set cover. In Proceedings of IEEE Knowledge and Data Engineering Exchange Workshop, pages 165–171, 1997.
[12] Mark Hall and Geoffrey Holmes, Benchmarking Attribute Selection Techniques for Discrete Class Data Mining, Mark Hall and Geoffrey Holmes, 2002
[13] Mark Hall, correlation based feature selection for district and numeric machine learning, In proc. Of 17th International conference on Machince Learning, ICML2000.
[14] P. Mitra, C. A. Murthy, and S. K. Pal. Unsupervised feature selection using feature similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(3):301–312, 2002.
[15] P. Ravisankar, V. Ravi, G. Raghava Rao, I. Bose, Detection of financial statement fraud and feature selection using data mining techniques, Elsevier Journal Decision Support Systems 50 (2011) 491–500
[16] Ramya Rathan, Sridhar R, Balasubraminian S, Association Rule- Spatial Data Mining Approach For Exploration Of Endometrial Cancer Data, International Journal Of Advanced Research In Computer Science And Software Engineering, Volume 3, Issue 10, October 2013.
[17] The Cancer Genome Atlas Research Network. Integrated Genomic Characterization of Endometrial Carcinoma. Nature. May 2, 2013. DOI: 10.1038/nature12113.
[18] Wei-Shinn Ku, A Bayesian Inference-Based Framework for RFID Data Cleansing, IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 25, NO. 10, Page - 2177, 2013
[19] K. Pavya, B. Srinivasan, "Enhancing Wrapper Based Algorithms for Selecting Optimal Features from Thyroid Disease Dataset", International Journal of Computer Sciences and Engineering, Vol.6, Issue.3, pp.7-13, 2018.
[20] B. Bakariya, G.S. Thakur, "Effectuation of Web Log Preprocessing and Page Access Frequency using Web Usage Mining", International Journal of Computer Sciences and Engineering, Vol.1, Issue.1, pp.1-5, 2013.
[21] G. Pandey, N. Mishra, "Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data", International Journal of Computer Sciences and Engineering, Vol.4, Issue.9, pp.12-18, 2016.