Open Access   Article Go Back

Comparative Study on Speculative Execution Strategy to Improve MapReduce Performance

Rahul R. Ghuleand Sachin N. Deshmukh1

Section:Research Paper, Product Type: Journal Paper
Volume-3 , Issue-3 , Page no. 197-200, Mar-2015

Online published on Mar 31, 2015

Copyright © Rahul R. Ghuleand Sachin N. Deshmukh . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Rahul R. Ghuleand Sachin N. Deshmukh, “Comparative Study on Speculative Execution Strategy to Improve MapReduce Performance,” International Journal of Computer Sciences and Engineering, Vol.3, Issue.3, pp.197-200, 2015.

MLA Style Citation: Rahul R. Ghuleand Sachin N. Deshmukh "Comparative Study on Speculative Execution Strategy to Improve MapReduce Performance." International Journal of Computer Sciences and Engineering 3.3 (2015): 197-200.

APA Style Citation: Rahul R. Ghuleand Sachin N. Deshmukh, (2015). Comparative Study on Speculative Execution Strategy to Improve MapReduce Performance. International Journal of Computer Sciences and Engineering, 3(3), 197-200.

BibTex Style Citation:
@article{Deshmukh_2015,
author = {Rahul R. Ghuleand Sachin N. Deshmukh},
title = {Comparative Study on Speculative Execution Strategy to Improve MapReduce Performance},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {3 2015},
volume = {3},
Issue = {3},
month = {3},
year = {2015},
issn = {2347-2693},
pages = {197-200},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=448},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=448
TI - Comparative Study on Speculative Execution Strategy to Improve MapReduce Performance
T2 - International Journal of Computer Sciences and Engineering
AU - Rahul R. Ghuleand Sachin N. Deshmukh
PY - 2015
DA - 2015/03/31
PB - IJCSE, Indore, INDIA
SP - 197-200
IS - 3
VL - 3
SN - 2347-2693
ER -

VIEWS PDF XML
2546 2327 downloads 2490 downloads
  
  
           

Abstract

MapReduce is widely used and popular programming model for huge amount of data processing. Hadoop is open source implementation of MapReduce framework. Performance of Hadoop depends some of the metrics like job execution time and cluster throughput. In MapReduce, Job is divided into multiple map and reduce tasks. Some tasks can be executed slowly due to internal or external reasons. Because of this slow tasks job execution time is prolonged which leads to degradation of Hadoop performance. To overcome this, current MapReduce framework launch speculative execution in which each slow tasks is backed up other node in order to reduce the job execution time. These slow tasks can be called as straggler tasks. However, current MapReduce speculative execution does not estimate the progress of the tasks properly which leads to identifying incorrect slow tasks. Also, they do not consider data skew among the tasks. This paper studies various speculative execution strategy like HAT (History based auto-tuning), Longest Approximate Time to End (LATE) and Maximum Cost Performance (MCP). These strategies overcome the drawbacks of default speculative execution to improve MapReduce performance.

Key-Words / Index Term

MapReduce, Hadoop, Straggler, speculative execution

References

[1] Qi Chen, Cheng Liu and Zhen Xiao, “Improving MapReduce performance using smart speculative execution strategy”, IEEE Transaction on Computers VOL 63, NO. 4, APRIL 2014.
[2] Apache hadoop, http://hadoop.apache.org/, Accessed on 26 December 2015
[3] J. Dean and S. Ghemawat, “Mapreduce: Simplified Data Processing on Large Clusters,” Comm. ACM, vol. 51, pp. 107-113, Jan. 2008.
[4] Exponential Weighted Moving Average, http://en.wikipedia.org/wiki1, Accessed on 7 January 2015
[5] MapReduce. [Online] Available: http://www.ibm.com, Accessed on 15January 2015
[6] G. Ananthanarayana, S. Kandula, A. Greenberg, I. Stocia, Y. Lu, B.Saha, and E. Harris, “Reining in the Outliers in Mapreduce Clusters Using Mantri” Proc. Inth USENIX Conf. Operating System Design and implementation, (OSDI ‘10), 2010.
[7] M. Zaharia, A. Konwinski, A. D. Joseph, R. Katz, and I. Stoica, “Improving MapReduce Performance in Heterogeneous Environments,” in Proc. of the 8th USENIX conference on Operating systems design and implementation , ser. OSDI, 2008.
[8] Zhe Wang, Zhengdong Zhu, Pengfei Zheng, Qiang Liu, Xiaoshe Dong, “New Scheduler Strategy for Heterogeneous Workload-aware in Hadoop,” 8th Annual ChinaGrid Conference, 2013.
[9] Huanle Xu, Wing Cheong Lau, “Optimization for Speculative Execution of Multiple Jobs in a MapReduce-like Cluster,” 8th Annual ChinaGrid Conference, 2013.
[10] Xuelian Lin, Chunming Hu, Richong Zhang, Chengzhang Wang, “Modeling the Performance of MapReduce under Resource Contentions and Task Failures,” Cloud Computing Technology and Science (CloudCom), IEEE 5th International Conference on (Vol 1 ), December 2013.
[11] Tao Gu, Chuang Zuo, Qun Liao, Yulu Yangand Tao Li, “Improving MapReduce Performance by Data Prefetching in Heterogeneous or Shared Environments”, International Journal of Grid and Distributed ComputingVol.6, No.5, 2013.
[12] G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu, B. Saha, and E. Harris, “Reining in the Outliers in Map-Reduce Clusters Using Mantri,” Proc. Ninth USENIX Conf. Operating Systems Design and Implementation (OSDI), 2010.
[13] Y. Kwon, M. Balazinska, and B. Howe, “A Study of Skew in Mapreduce Applications,” Proc. Fifth Open Cirrus Summit, 2011.
[14] Open Stack Cloud Operating System, http://www.openstack.org/, Accessed on 13 February 2015.
[15] Amazon Elastic Compute Cloud (EC2), http://aws.amazon.com/ec2/,Accessed on 28 January 2015
[16] Quan Chen, MinyiGuo, Qianni Deng, Long Zheng, Song Guo, Yao Shen, “HAT: History-based auto-tunningMapReduce in heterogenous environments” Springer Science+Business media, LLC, 2011.