A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools
Ankita Kadre1 , S.R Yadav2
Section:Survey Paper, Product Type: Journal Paper
Volume-3 ,
Issue-10 , Page no. 52-57, Oct-2015
Online published on Oct 31, 2015
Copyright © Ankita Kadre , S.R Yadav . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Citation
IEEE Style Citation: Ankita Kadre , S.R Yadav, “A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools,” International Journal of Computer Sciences and Engineering, Vol.3, Issue.10, pp.52-57, 2015.
MLA Citation
MLA Style Citation: Ankita Kadre , S.R Yadav "A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools." International Journal of Computer Sciences and Engineering 3.10 (2015): 52-57.
APA Citation
APA Style Citation: Ankita Kadre , S.R Yadav, (2015). A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools. International Journal of Computer Sciences and Engineering, 3(10), 52-57.
BibTex Citation
BibTex Style Citation:
@article{Kadre_2015,
author = {Ankita Kadre , S.R Yadav},
title = {A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {10 2015},
volume = {3},
Issue = {10},
month = {10},
year = {2015},
issn = {2347-2693},
pages = {52-57},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=703},
publisher = {IJCSE, Indore, INDIA},
}
RIS Citation
RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=703
TI - A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools
T2 - International Journal of Computer Sciences and Engineering
AU - Ankita Kadre , S.R Yadav
PY - 2015
DA - 2015/10/31
PB - IJCSE, Indore, INDIA
SP - 52-57
IS - 10
VL - 3
SN - 2347-2693
ER -
![]() |
![]() |
![]() |
2587 | 2401 downloads | 2445 downloads |




Abstract
Data is growing at a rate which cannot be handled by the traditional methods of computing. To store and process such data new data analysis and storage techniques have emerged over the last few years. Hadoop is one such parallel processing open source framework which provides distributed storage and processing of big data. This paper introduces Big Data, a new platform which enables accessing, manipulating, analyzing, and visualizing data residing on a Hadoop cluster. In this paper a survey is done on big data analysis using Hadoop and other utility tools like Pig and Hive. The majority of large-scale data intensive applications executed by data centers are based on Map-Reduce or its open-source implementation, Hadoop. Such applications are executed on large clusters requiring large amounts of energy, making the energy costs a large fraction of the data center’s overall costs. Therefore to minimizing the energy consumption when executing Map-Reduce jobs is a critical concern for data centers. In this survey Flight data has been analyzed in terms of the mentioned parameters such as time complexity and energy consumption information’s are retrieved using Hadoop.
Key-Words / Index Term
Map reduce, big data, Hadoop, HDFS, Pig & Hive, Flight data
References
.
[1] Toshimori Honjo, Kazuki Oikawa "Hardware acceleration of Hadoop Map-Reduce" in 2013 IEEE International Conference on Big Data.
[2] Ming Meng, Jing Gao, Jun-jie Chen "Blast-Parallel: The Parallelizing Implementation Of Sequence Alignment Algorithms Based On Hadoop Platform" in 2013 6th International Conference on Biomedical Engineering and Informatics (BMEI 2013).
[3] Madhury Mohandas, Dhanya P M "An Approach for Log Analysis Based Failure Monitoring in Hadoop Cluster" in 2013 IEEE.
[4] Ilja Kromonov, Pelle Jakovits, Satish Narayana Srirama "NEWT - A Resilient BSP Framework for Iterative Algorithms on Hadoop YARN" in 2014 IEEE.
[5] Lena Mashayekhy, Mahyar Movahed Nejad, Daniel Grosu, Dajun Lu, Weisong Shi "Energy-aware Scheduling of MapReduce Jobs" in 2014 IEEE International Congress on Big Data.
[6] Oscar D. Lara, Weiqiang Zhuang, and Adarsh Pannu "Big R: Large-scale Analytics on Hadoop using R" in 2014 IEEE International Congress on Big Data
[7] Kiran M., Amresh Kumar "Verification and Validation of Parallel Support Vector Machine Algorithm based on Map-Reduce Program Model on Hadoop Cluster" in 2013 International Conference on Advanced Computing and Communication Systems (ICACCS -2013), Dec. 19 – 21, 2013, Coimbatore, INDIA