Open Access   Article Go Back

PROFICIENCY ANALYSIS OF VARIOUS DATA MINING TECHNIQUES FOR DIABETES PROGNOSIS

Misba Reyaz1 , Gagan Kumar2

Section:Research Paper, Product Type: Journal Paper
Volume-6 , Issue-6 , Page no. 990-997, Jun-2018

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v6i6.990997

Online published on Jun 30, 2018

Copyright © Misba Reyaz, Gagan Kumar . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Misba Reyaz, Gagan Kumar, “PROFICIENCY ANALYSIS OF VARIOUS DATA MINING TECHNIQUES FOR DIABETES PROGNOSIS,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.6, pp.990-997, 2018.

MLA Style Citation: Misba Reyaz, Gagan Kumar "PROFICIENCY ANALYSIS OF VARIOUS DATA MINING TECHNIQUES FOR DIABETES PROGNOSIS." International Journal of Computer Sciences and Engineering 6.6 (2018): 990-997.

APA Style Citation: Misba Reyaz, Gagan Kumar, (2018). PROFICIENCY ANALYSIS OF VARIOUS DATA MINING TECHNIQUES FOR DIABETES PROGNOSIS. International Journal of Computer Sciences and Engineering, 6(6), 990-997.

BibTex Style Citation:
@article{Reyaz_2018,
author = {Misba Reyaz, Gagan Kumar},
title = {PROFICIENCY ANALYSIS OF VARIOUS DATA MINING TECHNIQUES FOR DIABETES PROGNOSIS},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {6 2018},
volume = {6},
Issue = {6},
month = {6},
year = {2018},
issn = {2347-2693},
pages = {990-997},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=2287},
doi = {https://doi.org/10.26438/ijcse/v6i6.990997}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i6.990997}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=2287
TI - PROFICIENCY ANALYSIS OF VARIOUS DATA MINING TECHNIQUES FOR DIABETES PROGNOSIS
T2 - International Journal of Computer Sciences and Engineering
AU - Misba Reyaz, Gagan Kumar
PY - 2018
DA - 2018/06/30
PB - IJCSE, Indore, INDIA
SP - 990-997
IS - 6
VL - 6
SN - 2347-2693
ER -

VIEWS PDF XML
454 230 downloads 198 downloads
  
  
           

Abstract

In this paper, various data mining techniques are analyzed and their proficiencies have been evolved. Medical professionals need a reliable prediction methodology to diagnose factors influencing diabetes. There are large quantities of information about patients and their medical conditions. Generally, data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information. The main aim of this thesis is to show the comparison of different classification algorithms such as Multilayer perceptron neural network (MLPNN), Zero R, K-Star based on computing time, Correctly Classified Instances, Incorrectly Classified Instances, Mean absolute error, Root mean squared error, Relative absolute error, Root relative squared error, precision value , Recall , F-measure, the data evaluated using 25 fold Cross Validation error rate, error rate focuses True Positive, True Negative, False Positive and False Negative and the clustering algorithm such as K-means algorithm based on varied number of clusters and Sum of squared error. Classification is an important data mining technique with broad applications. It classifies data of various kinds. Classification is used in every field of our life. Classification is used to classify each item in a set of data into one of predefined set of classes or groups. Clustering analysis method is one of the main analytical methods in data mining; in which k-means clustering algorithm is most popularly/widely used for many applications. K-means algorithm has higher efficiency and scalability and converges fast when dealing with large data sets. Clustering is an adaptive procedure in which objects are clustered or grouped together, based on the principle of maximizing the intra-class similarity and minimizing the inter-class similarity. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Waikato Environment for Knowledge Analysis or in short, WEKA is used to obtain the accuracy of algorithms and find out which algorithm is most suitable for user working on data of diabetic patients. Weka is a data mining tool. It contains many machine leaning algorithms. It provides the facility to classify our data through various algorithms.

Key-Words / Index Term

Multilayer perceptron neural network (MLPNN), Zero R, K-Star, K-means, WEKA

References

[1] P.Yasodha, M.Kannan, “Analysis of a Population of Diabetic Patients Databases in Weka Tool”, International Journal of Scientific & Engineering Research, Volume 2, Issue 5, May-2011.
[2] K. Rajesh, V. Sangeetha, “Application of Data Mining Methods and Techniques for Diabetes Diagnosis”, International Journal of Engineering and Innovative Technology (IJEIT), Volume 2, Issue 3, September 2012.
[3] Sukhjinder Singh, Kamaljit Kaur,”A Review on Diagnosis of Diabetes in Data Mining”, International Journal of Science and Research (IJSR), 2013.
[4] Veena Vijayan V, Aswathy Ravikumar, ”Study of Data Mining Algorithms for Prediction and Diagnosis of Diabetes Mellitus”, International Journal of Computer Applications (0975 – 8887) Volume 95– No.17, June 2014.
[5] P.Yasodha, N.R. Ananthanarayanan, ”Comparative Study of Diabetic Patient Data’s Using Classification Algorithm in WEKA Tool”, International Journal of Computer Applications Technology and Research, Volume 3– Issue 9, 554 - 558, 2014 .
[6] Aiswarya Iyer, S. Jeyalatha and Ronak Sumbaly,”DIAGNOSIS OF DIABETES USING CLASSIFICATION MINING TECHNIQUES”, International Journal of Data Mining & Knowledge Management Process (IJDKP), Vol.5, No.1, January 2015.
[7] Sabreena Jan, Vinod Sharma,” A Study of various data mining techniques for diabetic prognosis”, International Journal of Modern Computer Science (IJMCS), Volume 4, Issue 3, June, 2016.
[8] P. Suresh Kumar and V. Umatejaswi* , “Diagnosing Diabetes using Data Mining Techniques”, International Journal of Scientific and Research Publications, Volume 7, Issue 6, June 2017 .
[9] Saman Hina*, Anita Shaikh and Sohail Abul Sattar, “Analyzing Diabetes Datasets using Data Mining”, Journal of Basic & Applied Sciences, 2017, 13.
[10] S.Selvakumar, K.Senthamarai Kannan and S.GothaiNachiyar,”Prediction of Diabetes Diagnosis, Using Classification Based Data Mining Techniques”, International Journal of Statistics and Systems, Volume 12, Number 2 (2017).
[11] Prakash Singh, Aarohi Surya, “PERFORMANCE ANALYSIS OF CLUSTERING ALGORITHMS IN DATA MINING IN WEKA”, International Journal of Advances in Engineering & Technology, Jan., 2015.
[12] M.Mounika, S.D.Suganya, B.Vijayashanthi, S.KrishnaAnand,” Predictive Analysis of Diabetic Treatment Using Classification Algorithm”, (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 6 (3), 2015, 2502-2505.
[13] Bharat Chaudhari, Manan Parikh,”A Comparative Study of clustering algorithms Using weka tools”, International Journal of Application or Innovation in Engineering & Management (IJAIEM), Volume 1, Issue 2, October 2012.
[14] Pallavi , Sunila Godara,” A Comparative Performance Analysis of Clustering Algorithms”, International Journal of Engineering Research and Applications (IJERA), Vol. 1, Issue 3, pp.441-445.
[15] Y. S. Thakare, S. B. Bagal,” Performance Evaluation of K-means Clustering Algorithm with Various Distance Metrics, International Journal of Computer Applications (0975 – 8887), Volume 110 – No. 11, January 2015.
[16] Nidhi Singh, Divakar Singh,”Performance Evaluation of K-Means and Hierarchal Clustering in Terms of Accuracy and Running Time”, (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 3 (3), 2012, 4119-4121.
[17] Arka Haldar, G.Prudhvi Raj, S.V.S.S Lakshmi, “Comparison of Different Classification Techniques Using WEKA for Diabetic Diagnosis”, International Journal of Innovative Research in Computer and Communication Engineering, Vol. 6, Issue 1, January 2018.
[18] Santosh Rani, Dr. Sandeep Kautish,” Application of Data Mining Techniques for Prediction of Diabetes - A Review, International Journal of Scientific Research in Computer Science, Engineering and Information Technology © 2018 IJSRCSEIT | Volume 3 | Issue 3 | ISSN : 2456-3307.