Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data
A. Hency Juliet1 , R. Padmajavalli2
Section:Research Paper, Product Type: Journal Paper
Volume-6 ,
Issue-10 , Page no. 801-807, Oct-2018
CrossRef-DOI: https://doi.org/10.26438/ijcse/v6i10.801807
Online published on Oct 31, 2018
Copyright © A. Hency Juliet, R. Padmajavalli . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: A. Hency Juliet, R. Padmajavalli, “Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.10, pp.801-807, 2018.
MLA Style Citation: A. Hency Juliet, R. Padmajavalli "Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data." International Journal of Computer Sciences and Engineering 6.10 (2018): 801-807.
APA Style Citation: A. Hency Juliet, R. Padmajavalli, (2018). Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data. International Journal of Computer Sciences and Engineering, 6(10), 801-807.
BibTex Style Citation:
@article{Juliet_2018,
author = {A. Hency Juliet, R. Padmajavalli},
title = {Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {10 2018},
volume = {6},
Issue = {10},
month = {10},
year = {2018},
issn = {2347-2693},
pages = {801-807},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=3104},
doi = {https://doi.org/10.26438/ijcse/v6i10.801807}
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i10.801807}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=3104
TI - Performance Evaluation Using Classifier Algorithm On Endometrial Cancer Data
T2 - International Journal of Computer Sciences and Engineering
AU - A. Hency Juliet, R. Padmajavalli
PY - 2018
DA - 2018/10/31
PB - IJCSE, Indore, INDIA
SP - 801-807
IS - 10
VL - 6
SN - 2347-2693
ER -
VIEWS | XML | |
544 | 282 downloads | 293 downloads |
Abstract
Many factors affecting the success of data mining techniques, the pureness of data are one of the factors. The inclusion of irrelevant and noisy data in the pattern analyzing phase, can results poor predicting performance. To discover information from the endometrial carcinoma data set, the pre-processing technique such as cleaning, transforming and modelling are applied. Diverse kinds of pre-processing techniques were functions in the data set in order to work with the full pledged data set. Methodology Used: The data mining tool WEKA is used for feature selection. Using various classifiers, evaluators and search methods six features are selected out of eighteen, attributes. The performance evaluation was done using RStudio. The accuracy of the classifiers model Random Forest and Naïve Bayes are checked for the minimized and full data sets. The hybrid model was formed by combing both the models to improve the performance of the classifier model. Findings: The Hybrid model was adopted for the performance evaluation by combining naïve Bayes and random forest classifier and the accuracy of the new model is 93.55%.
Key-Words / Index Term
WEKA; Endometrial; R; carcinoma; classifiers; Naïve Bayes; Random Forest
References
[1] A detailed guide – endometrial cancer. Available in http://www.cancer.org/cancer/endometrialcancer/what-is-endometrial-cancer.
[2] A detailed guide – endometrial cancer. Available in www. cancer. org/ cancer/ endometrial cancer/cancer-risk-factors.
[3] Chia-Ming Wang, Yin-Fu Huang, Evolutionary-based feature selection approaches with new criteria for data mining: A case study of credit approval data, Elsevier Journal, Expert Systems with Applications 36 (2009) 5900–5908
[4] E. Friberg & N. Orsini & C. S. Mantzoros & A. Wolk, Diabetes Mellitus And Risk Of Endometrial Cancer: A Meta-Analysis, Springer-Verlag 2007.
[5] F.Angiulli and C. Pizzuti, “Outlier Mining in Large High-Dimensional Data Sets”, IEEE Trans. on Knowledge and Data Engineering, vol. 17, no. 2, pp. 203-215, 2005.
[6] Faysal A Saksouk, MD; Chief Editor: Eugene C Lin, MD. Endometrial Carcinoma Imaging.
[7] Wrathematics (27 August 2011). "How Much of R Is Written in R". librestats. Retrieved 2018-08-07.
[8] “7 of the Best Free Graphical User Interfaces for R". linuxlinks.com. Retrieved 9 February 2016.
[9] Huan Liu and Lei Yu, Toward Integrating Feature Selection Algorithms for Classification and Clustering, IEEE Transactions on Computers, Pages 191 - 226
[10] J.Han, M.Kamber, J.Pei. Data Mining concepts and Techniques. 3rd Edition, Simon Fraser University, pages-230-240, 2006.
[11] M. Dash. Feature selection via set cover. In Proceedings of IEEE Knowledge and Data Engineering Exchange Workshop, pages 165–171, 1997.
[12] Mark Hall and Geoffrey Holmes, Benchmarking Attribute Selection Techniques for Discrete Class Data Mining, Mark Hall and Geoffrey Holmes, 2002
[13] Mark Hall, correlation based feature selection for district and numeric machine learning, In proc. Of 17th International conference on Machince Learning, ICML2000.
[14] P. Mitra, C. A. Murthy, and S. K. Pal. Unsupervised feature selection using feature similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(3):301–312, 2002.
[15] P. Ravisankar, V. Ravi, G. Raghava Rao, I. Bose, Detection of financial statement fraud and feature selection using data mining techniques, Elsevier Journal Decision Support Systems 50 (2011) 491–500
[16] Ramya Rathan, Sridhar R, Balasubraminian S, Association Rule- Spatial Data Mining Approach For Exploration Of Endometrial Cancer Data, International Journal Of Advanced Research In Computer Science And Software Engineering, Volume 3, Issue 10, October 2013.
[17] The Cancer Genome Atlas Research Network. Integrated Genomic Characterization of Endometrial Carcinoma. Nature. May 2, 2013. DOI: 10.1038/nature12113.
[18] Wei-Shinn Ku, A Bayesian Inference-Based Framework for RFID Data Cleansing, IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 25, NO. 10, Page - 2177, 2013
[19] K. Pavya, B. Srinivasan, "Enhancing Wrapper Based Algorithms for Selecting Optimal Features from Thyroid Disease Dataset", International Journal of Computer Sciences and Engineering, Vol.6, Issue.3, pp.7-13, 2018.
[20] B. Bakariya, G.S. Thakur, "Effectuation of Web Log Preprocessing and Page Access Frequency using Web Usage Mining", International Journal of Computer Sciences and Engineering, Vol.1, Issue.1, pp.1-5, 2013.
[21] G. Pandey, N. Mishra, "Optimal Feature Selection in Stream Data Classification Using Improved Ensemble Classifier for High Dimension Data", International Journal of Computer Sciences and Engineering, Vol.4, Issue.9, pp.12-18, 2016.