Open Access   Article Go Back

A Critical Study of Big Data Techniques and Predictive Analytics Algorithms

B. Jogeswara Rao1 , M.S. Prasad Babu2

Section:Review Paper, Product Type: Journal Paper
Volume-6 , Issue-12 , Page no. 695-700, Dec-2018

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v6i12.695700

Online published on Dec 31, 2018

Copyright © B. Jogeswara Rao, M.S. Prasad Babu . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: B. Jogeswara Rao, M.S. Prasad Babu, “A Critical Study of Big Data Techniques and Predictive Analytics Algorithms,” International Journal of Computer Sciences and Engineering, Vol.6, Issue.12, pp.695-700, 2018.

MLA Style Citation: B. Jogeswara Rao, M.S. Prasad Babu "A Critical Study of Big Data Techniques and Predictive Analytics Algorithms." International Journal of Computer Sciences and Engineering 6.12 (2018): 695-700.

APA Style Citation: B. Jogeswara Rao, M.S. Prasad Babu, (2018). A Critical Study of Big Data Techniques and Predictive Analytics Algorithms. International Journal of Computer Sciences and Engineering, 6(12), 695-700.

BibTex Style Citation:
@article{Rao_2018,
author = {B. Jogeswara Rao, M.S. Prasad Babu},
title = {A Critical Study of Big Data Techniques and Predictive Analytics Algorithms},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {12 2018},
volume = {6},
Issue = {12},
month = {12},
year = {2018},
issn = {2347-2693},
pages = {695-700},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=3399},
doi = {https://doi.org/10.26438/ijcse/v6i12.695700}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v6i12.695700}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=3399
TI - A Critical Study of Big Data Techniques and Predictive Analytics Algorithms
T2 - International Journal of Computer Sciences and Engineering
AU - B. Jogeswara Rao, M.S. Prasad Babu
PY - 2018
DA - 2018/12/31
PB - IJCSE, Indore, INDIA
SP - 695-700
IS - 12
VL - 6
SN - 2347-2693
ER -

VIEWS PDF XML
285 209 downloads 184 downloads
  
  
           

Abstract

Big data is defined as the collection of a broad set of data. The tremendous increase in the usage of the internet over social media applications and forums such as mailing system, e-collection of research scholar articles, retrieval and online transaction data in the field of health leads to high exponential growth in the storage of data. These vast collections of data may lead to arising problems in big data analytics. Subsequently, the predictions based on unknown future events were performed by using Predictive analytics. This approach is found to utilize numerous techniques such as machine learning, statistics, data mining, modelling, and artificial intelligence in analysing the data for predicting the future. However, in past few decades, there have been significant developments in various techniques, architecture, tools, and platforms for managing the enormous amount of big data and to predict its future events considering predictive analytic algorithms. This paper provides a detailed survey of existing techniques, computing tools used in big data analysis and predictive analytic algorithms with its advantages and limitations. Further, this paper discusses the essential aspects considered to overcome the analytic data problems regarding availability and scalability and its various applications

Key-Words / Index Term

Bigdata, Machine learning algorithms, predictive analytics

References

[1] Lyman P, Varian H. How much information 2003? Tech. Rep, 2004. [Online]. Available: http://www2.sims.berkeley.edu/research/projects/how-much-info-2003/printable_report.pdf.
[2] Xu, R., & Wunsch, D. II.(2009). Clustering. Hoboken.
[3] Kaisler, S., Armour, F., Espinosa, J. A., & Money, W. (2013, January). Big data: Issues and challenges moving forward. In System sciences (HICSS), 2013 46th Hawaii international conference on (pp. 995-1004). IEEE.
[4] Huang, W., Wang, H., Zhang, Y., & Zhang, S. (2017). A novel cluster computing technique based on signal clustering and analytic hierarchy model using hadoop. Cluster Computing, 1-8.
[5] Gao, S., Li, L., Li, W., Janowicz, K., & Zhang, Y. (2017). Constructing gazetteers from volunteered big geo-data based on Hadoop. Computers, Environment and Urban Systems, 61, 172-186.
[6] Ji, C., Xiong, Z., Fang, C., Hui, L. V., & Zhang, K. (2017, July). A GPU Based Parallel Clustering Method for Electric Power Big Data. In Information Science and Control Engineering (ICISCE), 2017 4th International Conference on (pp. 29-33). IEEE.
[7] Malakar, R., & Vydyanathan, N. (2013, February). A CUDA-enabled Hadoop cluster for fast distributed image processing. In Parallel Computing Technologies (PARCOMPTECH), 2013 National Conference on (pp. 1-5). IEEE.
[8] Iqbal, M. H., & Soomro, T. R. (2015). Big data analysis: Apache storm perspective. International journal of computer trends and technology, 19(1), 9-14.
[9] Requeno, J. I., Merseguer, J., & Bernardi, S. (2017, August). Performance Analysis of Apache Storm Applications using Stochastic Petri Nets. In Information Reuse and Integration (IRI), 2017 IEEE International Conference on (pp. 411-418). IEEE.
[10] Shvachko, K., Kuang, H., Radia, S., & Chansler, R. (2010, May). The hadoop distributed file system. In Mass storage systems and technologies (MSST), 2010 IEEE 26th symposium on (pp. 1-10). Ieee.
[11] Zhang, T., Ramakrishnan, R., & Livny, M. (1997). BIRCH: A new data clustering algorithm and its applications. Data Mining and Knowledge Discovery, 1(2), 141-182.
[12] Chakraborty, S., & Nagwani, N. K. (2014). Analysis and study of Incremental DBSCAN clustering algorithm. arXiv preprint arXiv:1406.4754.
[13] Chakraborty, S., & Nagwani, N. K. (2014). Analysis and study of Incremental DBSCAN clustering algorithm. arXiv preprint arXiv:1406.4754.
[14] Cui, X., Zhu, P., Yang, X., Li, K., & Ji, C. (2014). Optimized big data K-means clustering using MapReduce. The Journal of Supercomputing, 70(3), 1249-1259.
[15] Rebentrost, P., Mohseni, M., & Lloyd, S. (2014). Quantum support vector machine for big data classification. Physical review letters, 113(13), 130503.
[16] Bolón-Canedo, V., Sánchez-Maroño, N., & Alonso-Betanzos, A. (2015). Recent advances and emerging challenges of feature selection in the context of big data. Knowledge-Based Systems, 86, 33-45.
[17] Jeong, Y. S., Shin, K. S., & Jeong, M. K. (2015). An evolutionary algorithm with the partial sequential forward floating search mutation for large-scale feature selection problems. Journal of The Operational research society, 66(4), 529-538.
[18] Chen, B., Haas, P., & Scheuermann, P. (2002, July). A new two-phase sampling based algorithm for discovering association rules. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 462-468). ACM.
[19] Lee, G., Yun, U., & Ryu, K. H. (2014). Sliding window based weighted maximal frequent pattern mining over data streams. Expert Systems with Applications, 41(2), 694-708.
[20] Kumar, B., & Kumar, D. (2017). A Matrix based Maximal Frequent Itemset Mining Algorithm without Subset Creation. International Journal of Computer Applications, 159(6).
[21] M.J. Zaki, “SPADE: An efficient algorithm for mining frequent sequences”, Machine learning, 42(1-2), pp.31-60, 2001.
[22] Ayres, J., Flannick, J., Gehrke, J., & Yiu, T. (2002, July). Sequential pattern mining using a bitmap representation. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 429-435). ACM.
[23] Kumar, A., Sinha, R., Bhattacherjee, V., Verma, D. S., & Singh, S. (2012, March). Modeling using K-means clustering algorithm. In Recent Advances in Information Technology (RAIT), 2012 1st International Conference on (pp. 554-558). IEEE.
[24] Lavanya, B., & Divya, B. (2017). BIG DATA ANALYSIS USING SVM AND K-NN DATA MINING TECHNIQUES. International Journal of Computer Science and Mobile Computing (IJCSMC), 6(1), 84-91.
[25] Boukenze, B., Mousannif, H., & Haqiq, A. Predictive analytics in healthcare system using data mining techniques. Computer Science & Information Technology, 1.