Open Access   Article Go Back

Imputation of Missing Data using Fuzzy-Rough Hybridization

Pallab Kumar Dey1

Section:Research Paper, Product Type: Journal Paper
Volume-07 , Issue-01 , Page no. 226-231, Jan-2019

Online published on Jan 20, 2019

Copyright © Pallab Kumar Dey . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Pallab Kumar Dey, “Imputation of Missing Data using Fuzzy-Rough Hybridization,” International Journal of Computer Sciences and Engineering, Vol.07, Issue.01, pp.226-231, 2019.

MLA Style Citation: Pallab Kumar Dey "Imputation of Missing Data using Fuzzy-Rough Hybridization." International Journal of Computer Sciences and Engineering 07.01 (2019): 226-231.

APA Style Citation: Pallab Kumar Dey, (2019). Imputation of Missing Data using Fuzzy-Rough Hybridization. International Journal of Computer Sciences and Engineering, 07(01), 226-231.

BibTex Style Citation:
@article{Dey_2019,
author = {Pallab Kumar Dey},
title = {Imputation of Missing Data using Fuzzy-Rough Hybridization},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {1 2019},
volume = {07},
Issue = {01},
month = {1},
year = {2019},
issn = {2347-2693},
pages = {226-231},
url = {https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=623},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=623
TI - Imputation of Missing Data using Fuzzy-Rough Hybridization
T2 - International Journal of Computer Sciences and Engineering
AU - Pallab Kumar Dey
PY - 2019
DA - 2019/01/20
PB - IJCSE, Indore, INDIA
SP - 226-231
IS - 01
VL - 07
SN - 2347-2693
ER -

           

Abstract

Missing data imputation has a significant impact in data mining task. Data mining algorithms cannot be executed effectively due to missing attribute values. Improper handle of missing values affects the data mining and classification accuracy. Imputation based preprocessing approach is very effective technique for handling missing value. In this paper most similar object used to impute missing value. For searching similar object core attributes have to give highest priority after that reduct attributes. In the proposed method to fill missing value concept of core and reduct attributes has been used. Rough set is most suitable to handle discrete data. Fuzzy set can handle continuous data in a better way. Hybridization methodology like fuzzy-rough set are more powerful to deal with imprecision and uncertainty for discrete as well as continuous data. Detail study has been given to impute missing value. Fuzzy rough set based fuzz- rough core reduct based (FRCRB) algorithm has been proposed for missing value imputation.

Key-Words / Index Term

Missing value, Imputation, Rough set, Fuzzy set and fuzzy-rough set, data analysis.

References

[1] P. K. Dey and S. Mukhopadhyay, “Core reduct based preprocessing approach to incomplete data”, International Journal of Intelligent Engineering and Systems, Vol. 10, No. 5, pp. 19-28, 2017.
[2] A. C. Acock, “Working with Missing Values”, Journal of Marriage and Family, Vol.67, No.4, pp. 1012-1028, 2005.
[3] J. R. Quinlan, C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers, 1993.
[4] J. W. Grzymala-Busse and A. Y. Wang, “Modified algorithms LEM1 and LEM2 for rule induction from data with missing attribute values”, In: Proc. of Fifth international Workshop on Rough Sets and Soft Computing (RSSC’97), Third Joint Conference on Information Sciences (JCIS’97), pp.69-72, 1997.
[5] J. W. Grzymala-Busse, “Data with missing attribute values: Generalization of idiscernibility relation and rule induction”, Transactions on Rough Sets, Lecture Notes in Computer Science Journal Subline, Springer-Verlag, vol. 1 , pp. 78–95, 2004.
[6] D. Aha, D. Kibler and M. Albert, “Instance-based learning algorithms”, Machine Learning, Vol.6, pp. 37-66, 1991.
[7] J. Han and M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann, 2001.
[8] P. K. Dey and S. Mukhopadhyay, “Modified Deviation Approach to Deal with Missing Attribute values in Data Mining with Different percentage of Missing Values”, International Journal of Computer Applications, Vol.73, No. 5, pp. 1-6, 2013.
[9] J. V. Hulse and T. M. Khoshgoftaar, “Incomplete-case nearest neighbor imputation in software measurement data”, Information Sciences, Vol. 259, No. 2, pp. 596-610, 2014.
[10] R. R. Andridge and R. J. A. Little, “A review of hot deck imputation for survey non- response”, International Statistical Review, Vol.78, No.1, pp. 40-64, 2010.
[11] A. Dempster, N. Laired and D. Rubin, “Maximum likelihood from incomplete data via the EM algorithm”, Journal of the Royal Statistical Society, Vol.39, No.1, pp. 1-38, 1977.
[12] D. B. Rubin, Multiple Imputation for Nonresponse in Surveys, Wiley, New York, 1987.
[13] J. L. Schafer, “Multiple imputation: a primer”, Statistical Methods in Medical Research, Vol. 8, pp. 3–15, 1999.
[14] W. Zhou, W. Zhang and Y. Fu , “An incomplete data analysis approach using rough set theory”, Intelligent Mechatronics and Automation, IEEE, pp.332-338, 2004.
[15] M. Kryszkiewicz, “Rough set approach to incomplete information systems”, Information Sciences, Elsevier, Vol. 112, pp.39-49, 1998.
[16] J. W. Grzymala-Busse, “Three Approaches to Missing Attribute Values— A Rough Set Perspective”, In: Proc. of the Workshop on Foundations of Data Mining, associated with the fourth IEEE International Conference on Data Mining, Brighton, UK, November 1–4, 2004.
[17] J. W. Grzymala-Busse and S. Siddhaye, “Rough Set Approaches to Rule Induction from Incomplete Data”, In: Proceedings of the IPMU`2004, the 10th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Perugia, Italy, July 4–9, vol. 2, 923–930, 2004.
[18] M. Amiri and R. Jensen, “Missing data imputation using fuzzy-rough methods”, Neurocomputing, Elsevier, Vol.205, pp. 152-164, 2016.
[19] D. Dubois, H. Prade, “Rough fuzzy sets and fuzzy rough sets”, International Journal of General Systems, Vol.17, No.1, pp. 191–209, 1990
[20] D.S. Yeung, D.G. Chen, E.C.C. Tsang, J.W.T. Lee and X.Z. Wang, “On the generalization of fuzzy rough sets”, IEEE Trans. Fuzzy Systems, Vol.13, No.3, pp. 343–361, 2005.
[21] W. Wu and W. Zhang, “Constructive and axiomatic approaches of fuzzy approximation operators”, Information Sciences, Vol.159, pp. 233–254, 2004.
[22] R. Jensen, Q. Shen, “Fuzzy-rough attribute reduction with application to web categorization”, Fuzzy Sets and Systems, Elsevier, Vol.141, pp. 469–485, 2004.
[23] Q. Shen, R. Jensen, “On robust fuzzy rough set models”, Pattern Recognition, Vol. 37(7), pp. 1351–1363, 2004.
[24] R. Jensen, Q. Shen, “Semantics-preserving dimensionality reduction: Rough and fuzzy-rough-based approaches”, IEEE Transactions on Knowledge and Data Engineering, Vol.16, pp. 1457-1471, 2004.
[25] R. B. Bhatt, M. Gopal, “On fuzzy-rough sets approach to feature selection”, Pattern Recognition Letter, Vol.26, No.7, pp. 965–975, 2005.
[26] R.B. Bhatt, M. Gopal, “On the compact computational domain of fuzzy rough sets”, Pattern Recognition Letter, Vol.26, No.11, pp. 1632–1640, 2005.
[27] R. Jensen, Q. Shen, “New approaches to fuzzy-rough feature selection”, IEEE Transactions on Fuzzy Systems, Vol.17, No.4, pp. 824-838, 2009.
[28] A. Mieszkowicz-Rolka and L. Rolka, “Variable precision fuzzy rough sets”, Transactions on Rough sets, Springer, Vol. LNCS-3100, pp. 144-160,2004.
[29] S. Zhao, E. C. C. Tsang, and D. Chen, “The model of fuzzy variable precision rough sets”, IEEE Transactions on Fuzzy Systems, Vol.17, No2, pp. 451–467, 2009.
[30] Q. H. Hu, D. R. Yu, Z. X. Xie, and J. F. Liu, “Fuzzy probabilistic approximation spaces and their information measures”, IEEE Transactions on Fuzzy Systems, Vol.14, No.2, pp. 191–201, 2006.
[31] Q. H. Hu, D. R. Yu and Z. X. Xie , “Information-preserving hybrid data reduction based on fuzzy-rough techniques”, Pattern Recognition Letters, Vol.27, No.5, pp. 414–423, 2006.
[32] Q. Hu., Z. Xie and Daren Yu, “Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation”, Pattern Recognition, Vol.40, pp. 3509 – 3521, 2007.
[33] P. Maji and P. Garai, “Fuzzy Rough simultaneous Attribute selection and Feature Extraction Algorithm”, IEEE Transactions on Cybernetics, Vol.43, No.4, pp. 1166–1177, 2013.
[34] E. C. C. Tsang, D. G. Chen, D. S. Yeung and X. Z. Wang, “Fuzzy probabilistic approximation spaces and their information measures”, IEEE Transactions on Fuzzy Systems, Vol.16, No.5, pp. 1130–1141, 2008.
[35] D. Chen, L. Zhang, S. Zhao, Q. Hu and P. Zhu, “A noval algorithm for finding reducts with fuzzy rough sets”, IEEE Transactions on Fuzzy Systems, Vol.20, No.2, pp. 385–389, 2012.
[36] C. C. Nghia and N. L. Giang, “Fuzzy Rough set based Attribute Reduction in Fuzzy Decision Tables”, International Journal of Computer Applicationss, Vol.132, No.4, pp. 32–37, 2015.
[37] E. C. C. Tsang, D. Chen, D. S. Young, X. Wang and J. Lee, “Attribute Reduction Using Fuzzy Rough Sets”, IEEE Transactions on Fuzzy Systems, Vol.16, No.5, pp. 1130-1140, 2008.