Open Access   Article Go Back

Hybrid Classification Algorithm for Improved Big Data Processing

Azibator Banigo1 , Nuka Dumle Nwiabu2 , Vincent Ike Anireh3

  1. Department of Computer Science, Rivers State University, Port Harcourt, Nigeria.
  2. Department of Computer Science, Rivers State University, Port Harcourt, Nigeria.
  3. Department of Computer Science, Rivers State University, Port Harcourt, Nigeria.

Section:Research Paper, Product Type: Journal Paper
Volume-11 , Issue-6 , Page no. 30-36, Jun-2023

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v11i6.3036

Online published on Jun 30, 2023

Copyright © Azibator Banigo, Nuka Dumle Nwiabu, Vincent Ike Anireh . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Azibator Banigo, Nuka Dumle Nwiabu, Vincent Ike Anireh, “Hybrid Classification Algorithm for Improved Big Data Processing,” International Journal of Computer Sciences and Engineering, Vol.11, Issue.6, pp.30-36, 2023.

MLA Style Citation: Azibator Banigo, Nuka Dumle Nwiabu, Vincent Ike Anireh "Hybrid Classification Algorithm for Improved Big Data Processing." International Journal of Computer Sciences and Engineering 11.6 (2023): 30-36.

APA Style Citation: Azibator Banigo, Nuka Dumle Nwiabu, Vincent Ike Anireh, (2023). Hybrid Classification Algorithm for Improved Big Data Processing. International Journal of Computer Sciences and Engineering, 11(6), 30-36.

BibTex Style Citation:
@article{Banigo_2023,
author = {Azibator Banigo, Nuka Dumle Nwiabu, Vincent Ike Anireh},
title = {Hybrid Classification Algorithm for Improved Big Data Processing},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {6 2023},
volume = {11},
Issue = {6},
month = {6},
year = {2023},
issn = {2347-2693},
pages = {30-36},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=5590},
doi = {https://doi.org/10.26438/ijcse/v11i6.3036}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v11i6.3036}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=5590
TI - Hybrid Classification Algorithm for Improved Big Data Processing
T2 - International Journal of Computer Sciences and Engineering
AU - Azibator Banigo, Nuka Dumle Nwiabu, Vincent Ike Anireh
PY - 2023
DA - 2023/06/30
PB - IJCSE, Indore, INDIA
SP - 30-36
IS - 6
VL - 11
SN - 2347-2693
ER -

VIEWS PDF XML
158 222 downloads 99 downloads
  
  
           

Abstract

This paper puts forward a novel approach in big data processing and it is aimed at cutting computational time and enhancing classification accuracy. The research focuses on the relevance and significance of hybrid algorithms, specifically combining the Ball Tree and Weighted k Nearest Neighbors (k-NN) algorithms. The objective of this study is to address the limitations of traditional algorithms by reducing computational time while improving classification accuracy. The methodology employed in this research is the constructive research method, which allows for the development and evaluation of new algorithms. This methodology was chosen as it facilitates the creation of innovative approaches to tackle the challenges of big data processing. Experimental results demonstrate that the proposed hybrid algorithm yields promising outcomes. When classifying the MNIST dataset, the algorithm achieved an accuracy rate of 97%, misclassifying only 256 out of 10,000 images. The harmonic mean between precision and recall was found to be 0.999716, indicating a high level of performance. Notably, the computational time required for classification was significantly shorter compared to traditional classification techniques. Overall, the hybrid algorithm combining the Ball Tree and Weighted k-NN proved to be an effective solution for big data processing. By reducing computational time and enhancing accuracy, it presents a valuable contribution to the field. This research opens avenues for further exploration and application of hybrid algorithms in various domains where efficient and accurate big data processing is crucial.

Key-Words / Index Term

Big Data, Ball Tree Algorithm, Classification, Weighted K-Nearest Neighbours (WKNN), Hybrid Algorithm, K- Nearest Neighbours, MNIST Dataset.

References

[1]. Kalio, Q.P, Nwiabu, N, “A Framework for Securing Data Warehouse Using Hybrid Approach,” International Journal of Computer Science and Mathematical Theory, Vol.4, No.1, pp.3-4, 2019.
[2]. Prasad, Lakshmi Y.” Big Data Analytics Made Easy”. Vol 1, Punjab : Notion Press, 2016
[3]. Farhad, M., Hamid, N., & Bahman, J., “Energy-Efficient Big Data Analytics”., Advances in Computer., Vol.100, 2015
[4]. Nwiabu, N., Adeyanju, I. “User Centred Design Approach to Situation Awareness,” International Journal of Computer Application, Vol.49. Issue.27, pp 26-30, 2012
[5]. Kramer, Oliver. “Dimensionality Reduction with Unsupervised Nearest Neighbors”. “Intelligent Systems Reference Library” First Edition, Vol 51, Berlin : Springer Berlin, Heidelberg, 2013. ISBN 978-3-642-38652-7..
[6]. Kumar , N., Zhang, L., & Nayar, S. “What Is a Good Nearest Neighbors Algorithm for Finding Similar Patches in Images?” [ed.] David Forsyth, Philip Torr and Andrew Zisserman. Marseille : “In the Proceedings of the European Conference on Computer Vision.” Vol.5303, pp.364-379, 2008, Springer, Berlin, Heidelberg, ISBN: 978-3-540-88688-4.
[7]. Dudani, S.A. “The Distance-Weighted k-Nearest-Neighbor Rule.” IEEE Transactions on Systems, Man, and Cybernetics 4, Vols. SMC-6, pp.325-327. May 21, 1975.
[8]. Kiruthika, R., & Vijayakumar, V. “An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data,” International Journal of Computer Sciences and Engineering, Vol.8, No 2, pp.12-17, February 28, 2020, E-ISSN: 2347-2693.
[9]. Gupta, A., & Pratik, G. “Implementation of K-Means Clustering in Big Data Environment,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.11, pp.38-44, November 10, 2019, E-ISSN: 2347-2693.
[10]. Iqbal, R., Doctor, F., More, B., Mahmud, S., Yousuf, U. “Big Data analytics and Computational Intelligence for Cyber–Physical Systems: Recent trends and state of the art applications,” Future Generation Computer Systems, Vol.105, pp.766-778, 2020. https://doi.org/10.1016/j.future.2017.10.021.
[11]. Rawal, B., & Ruchi, A. “Improving Accuracy of Classification Based on C4.5 Decision Tree Algorithm Using Big Data Analytics.” Advances in Intelligent Systems and Computing (AISC),. Vol.711, No 7, pp.203-210. ISBN 978-981-10-8055-5.
[12]. Jothi, J. M., Arockiam, L “A Framework to enhance the Accuracy of Aspect level Sentiment Analysis in Big Data, ”Conference on Inventive Computing and Informatics,” Vol.17, pp.452-457, 2017, ISBN: 978-1-5386-4031-9.
[13]. Guo-Feng, F., Yan-Hui, G., & Jia-Mei, Z. “Application of the Weighted K-Nearest Neighbor Algortihm for Short-Term Load Forecasting,” Multidisciplinary Digital Publishing Institute, Vol.12, Issue.5, pp. 1-19, 2019.
[14]. Chakravarthy, S. S., Bharanidharan, N., & Rajaguru, H. “Deep Learning-Based Metaheuristic Weighted K-Nearest Neighbor Algorithm for the Severity Classification of Breast Cancer,” Innovation and Research in BioMedical Engineering(IRBM), Vol.44, Issue.3 pp.50-62. June 15, 2023 ISSN 1959-0318.
[15]. Naz, G., Muhammad, A., Aldahmani, S., & Khan, Z.A “Weighted k-Nearest Neighbours Ensemble With Added Accuracy and Diversity,” In the Proceedings of the IEEE International Conference on Big Data 2022 , Osaka., pp.3341-3347, 2022.
[16]. Wan, W., & Lee, H. J. “Deep feature representation and ball-tree for face sketch,” International Journal of System Assurance Engineering and Management, Vol.11, pp.818-823. September 3, 2019.
[17]. Giang, N. T., Huong, H. L., Tai, H. P., & Hiep, X. H. “A parallel algorithm for determining the communication radius of an automatic trap based on balltree structure,” In the Proceedings of the Eighth International Conference on Knowledge and Systems Engineering (KSE) Hanoi Vietnam. pp.139-143, 2016.
[18]. Neeraj, K., Li, Z., & Nayar, S. “What is a good Nearest Neighbors Algorithm for Finding Similar Patches in Images.” In the Proceedings of the 10th European Conference on Computer Vision - ECCV 2008, pp.364-378, 2008. 978-3-540-88689-1.
[19]. Nwiabu, N., Alison, I., & Oyeneyin, B. “Modelling Case-Based Reasoning in Situation-Aware Disaster Management,” Communications of the IIMA, Vol.19, Issue.1, pp.1-19, 2021.