Open Access   Article Go Back

Apache Hadoop: A Guide for Cluster Configuration & Testing

Ankit Shah1 , Mamta Padole2

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-4 , Page no. 792-796, Apr-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i4.792796

Online published on Apr 30, 2019

Copyright © Ankit Shah, Mamta Padole . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Ankit Shah, Mamta Padole, “Apache Hadoop: A Guide for Cluster Configuration & Testing,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.4, pp.792-796, 2019.

MLA Style Citation: Ankit Shah, Mamta Padole "Apache Hadoop: A Guide for Cluster Configuration & Testing." International Journal of Computer Sciences and Engineering 7.4 (2019): 792-796.

APA Style Citation: Ankit Shah, Mamta Padole, (2019). Apache Hadoop: A Guide for Cluster Configuration & Testing. International Journal of Computer Sciences and Engineering, 7(4), 792-796.

BibTex Style Citation:
@article{Shah_2019,
author = {Ankit Shah, Mamta Padole},
title = {Apache Hadoop: A Guide for Cluster Configuration & Testing},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {4 2019},
volume = {7},
Issue = {4},
month = {4},
year = {2019},
issn = {2347-2693},
pages = {792-796},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4118},
doi = {https://doi.org/10.26438/ijcse/v7i4.792796}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i4.792796}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4118
TI - Apache Hadoop: A Guide for Cluster Configuration & Testing
T2 - International Journal of Computer Sciences and Engineering
AU - Ankit Shah, Mamta Padole
PY - 2019
DA - 2019/04/30
PB - IJCSE, Indore, INDIA
SP - 792-796
IS - 4
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
331 331 downloads 197 downloads
  
  
           

Abstract

For Big Data processing, analyzing and storing Apache Hadoop is widely adopted as a framework. Hadoop facilitates processing through MapReduce, analyzing using Apache Spark and storage using the Hadoop Distributed File System (HDFS). Hadoop is popular due to its wide applicability and easy to run on commodity hardware functionality. But the installation of Hadoop on single and distributed cluster always remains a headache for the new developers and researchers. In this paper, we present the step by step process to run Hadoop on a single node and also explain how it can be used as a distributed cluster. We have implemented and tested the Hadoop framework using single node and cluster using ten (10) nodes. We have also explained primary keywords to understand the concept of Hadoop.

Key-Words / Index Term

Apache Hadoop, Hadoop Cluster Configuration, Hadoop Testing, Hadoop Implementation

References

[1] Forbes Welcome, https://www.forbes.com/sites/gilpress/2014/09/03/12-big-data-definitions-whats-yours/#487d104413ae (Access on March 30, 2019)
[2] Hadoop, http://hadoop.apache.org (Access on March 30, 2019)
[3] Dean, J. and Ghemawat, S., MapReduce: simplified data processing on large clusters. Communications of the ACM, 51(1), pp.107-113 (2008).
[4] Shah A., Padole M. (2019) Performance Analysis of Scheduling Algorithms in Apache Hadoop. In: Shukla R., Agrawal J., Sharma S., Singh Tomer G. (eds) Data, Engineering and Applications. Springer, Singapore
[5] Shvachko, K., Kuang, H., Radia, S. and Chansler, R., 2010, May. The hadoop distributed file system. In MSST (Vol. 10, pp. 1-10).
[6] Vavilapalli, V.K., Murthy, A.C., Douglas, C., Agarwal, S., Konar, M., Evans, R., Graves, T., Lowe, J., Shah, H., Seth, S. and Saha, B., (2013). Apache hadoop yarn: Yet another resource negotiator. In Proceedings of the 4th annual Symposium on Cloud Computing (p.5). ACM.