Investigating Policies for Performance of Multi-core Processors

Surendra Kumar Shukla, P.K. Chande

Open Access Article Go Back

Investigating Policies for Performance of Multi-core Processors

Surendra Kumar Shukla¹ , P.K. Chande²

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-2 , Page no. 964-980, Feb-2019

CrossRef-DOI: https://doi.org/10.26438/ijcse/v7i2.964980

Online published on Feb 28, 2019

Copyright © Surendra Kumar Shukla, P.K. Chande . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at Google Scholar | DPI Digital Library

XML View

PDF Download

How to Cite this Paper

IEEE Citation
MLA Citation
APA Citation
BibTex Citation
RIS Citation

IEEE Style Citation: Surendra Kumar Shukla, P.K. Chande, “Investigating Policies for Performance of Multi-core Processors,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.2, pp.964-980, 2019.

MLA Style Citation: Surendra Kumar Shukla, P.K. Chande "Investigating Policies for Performance of Multi-core Processors." International Journal of Computer Sciences and Engineering 7.2 (2019): 964-980.

APA Style Citation: Surendra Kumar Shukla, P.K. Chande, (2019). Investigating Policies for Performance of Multi-core Processors. International Journal of Computer Sciences and Engineering, 7(2), 964-980.

BibTex Style Citation:
@article{Shukla_2019,
author = {Surendra Kumar Shukla, P.K. Chande},
title = {Investigating Policies for Performance of Multi-core Processors},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {2 2019},
volume = {7},
Issue = {2},
month = {2},
year = {2019},
issn = {2347-2693},
pages = {964-980},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=3778},
doi = {https://doi.org/10.26438/ijcse/v7i2.964980}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i2.964980}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=3778
TI - Investigating Policies for Performance of Multi-core Processors
T2 - International Journal of Computer Sciences and Engineering
AU - Surendra Kumar Shukla, P.K. Chande
PY - 2019
DA - 2019/02/28
PB - IJCSE, Indore, INDIA
SP - 964-980
IS - 2
VL - 7
SN - 2347-2693
ER -

VIEWS	PDF	XML
712	486 downloads	158 downloads

Bar Line

Abstract

Performance is a critical concern of multi-core systems. There are some issues which affect the performance of multicore systems especially shared resource contention and application to core mapping. To address the performance issues various software and hardware-based policies are proposed in different works of literature. These policies address the particular performance issue through some specific approach in isolation. However, having many performance issues and the corresponding number of policies to solve the issues; it is not clear which policy would be beneficial for a particular situation for application execution. There is a need of investigation & classification of existing policies through various aspects like the approach used to address the performance issues, tools used for profiling the application and metrics used to find the source of performance degradation. The classification of policies could help make static and runtime decisions for addressing different performance issues which arise owing to resource allocation and contention. In this paper, we reviewed various policies employed for performance improvement of multicore systems. Policies like the application to core scheduling, memory allocation, bandwidth allocation, parameter tuning & self-awareness are investigated on various angles and resulted in an in-depth classification which is conferred from the tables. Further, classification could be used to design a holistic policy scheduler which could schedule a policy considering the application workload characteristics in totality. Also, the scheduler could help on performance improvement through scheduling/switching the appropriate policies at run time for application execution while considering the system status.

Key-Words / Index Term

Investigation, Multi-core, Parameter, Policy, Performance

References

[1] D. Geer, "Chip makers turn to multi-core processors", Computer, vol. 38, no. 5, pp. 11-13,2005.
[2] A. Roy, J. Xu, and M. Chowdhury, "Multi-core processors: A new wayforward and challenges", International Conference on Microelectronics, Sharjaha,UAE, pp.454-457, 2008.
[3] G. Blake, R. Dreslinski and T. Mudge, "A survey of multi-core processors", IEEE Signal Processing Magazine, vol. 26, no. 6, pp. 26-37,2009.
[4] S. Hao, Q. Liu, L. Zhang, and J. Wang, "Processes Scheduling on Heterogeneous Multi-core Architecture with Hardware Support", in International Conference on Networking, Architecture, and Storage, China, , pp. 236-241, 2011.
[5] R. Teodorescu and J. Torrellas, "Variation-Aware Application Scheduling and Power Management for Chip Multiprocessors",66 in International Symposium on Computer Architecture, Beijing, China, pp.363-374, 2008.
[6] Y. Cheng, W. Chen, Z. Wang, and Y. Xiang, “Precise contention-aware performance prediction on virtualized multicore system,” Journal of system architecture, vol. 72, pp. 42-50, 2017.
[7] A. Asaduzzaman "Performance modeling of multicore and manycore networked systems," in International Journal of Computer Networks and communications (IJCNC), vol.4, No.2, pp. 53-67, 2012.
[8] S. Prasad, "Program Execution on Reconfigurable Multicore Architectures", Electronic Proceedings in Theoretical Computer Science, vol. 211, pp. 83-91, 2016.
[9] H. Yun, G. Yao, R. Pellizzoni, M. Caccamo and L. Sha, "Memory Bandwidth Management for Efficient Performance Isolation in Multi-Core Platforms", IEEE Transactions on Computers, vol. 65, no. 2, pp. 562-576,2016.
[10] S. Ren, L. Tan, C. Li, Z. Xiao and W. Song, "Leveraging Hardware-Assisted Virtualization for Deterministic Replay on Commodity Multi-Core Processors", IEEE Transactions on Computers, vol. 67, no. 1, pp. 45-58,2018.
[11] M. Pricopi and T. Mitra, "Bahurupi", ACM Transactions on Architecture and Code Optimization, vol. 8, no. 4, pp. 1-21, 2012.
[12] James E. Bennett, Michael J. Flynn, Performance Factors for Superscalar Processors, Stanford University, Stanford, CA,1995
[13] Lachaize, R., Lepers, B. and Quéma, V., “MemProf: a memory profiler for NUMA multicore systems”, In: USENIX ATC`12 Proceedings of the 2012 USENIX conference on Annual Technical Conference. Boston: ACM, pp.5-5, 2012.
[14] R. Knauerhase, P. Brett, B. Hohlt, T. Li, and S. Hahn, "Using OS Observations to Improve Performance in Multicore Systems", IEEE Micro, vol. 28, no. 3, pp. 54-66, 2008.
[15] D. Shelepov and et al., "HASS: A Scheduler for Heterogeneous Multicore Systems", in ACM SIGOPS Operating Systems Review, New York, pp. 66-75, 2009
[16] Becchi M, Crowley P, “Dynamic thread assignment on heterogeneous multiprocessor architectures”, In Proceedings of the 3rd conference on computing frontiers, New York, 2006, pp. 29-40, 2006
[17] Kumar R et al. “Single-ISA heterogeneous multi-core architectures for multithreaded workload performance”, In Proceedings of the 31st annual international symposium on computer architecture, Washington, pp., 64-75, 2004
[18] T.M.Birhanu, Z. Li, H. Sekiya, N. Komuro, Y.-J. Choi, "Efficient thread mapping for heterogeneous multicore iot systems", Mobile Information Systems, vol. 1565, pp. 8, 2017
[19] D. Koufaty, D. Reddy, and S. Hahn, "Bias scheduling in heterogeneous multi-core architectures," in Proc. of the 5th European Conference on Computer Systems, France pp. 125-138, 2010
[20] S. Zhuravlev, S. Blagodarov, and A. Fedorova, “Addressing shared resource contention in multicore processors via scheduling,” in ACM SIGARCH Computer Architecture News, vol. 38, pp. 129–142, 2010.
[21] Z. Majo and T. Gross, "Memory management in NUMA multicore systems: trapped between cache contention and interconnect overhead", in ACM SIGPLAN Notices - ISMM `11, pp.11-20 2011.
[22] Agarwal A, Miller J, Eastep J, Wentziaff D, Kasture H Self-aware computing. Technical report, MIT, 2009.
[23] D. Molka, R. Schöne, D. Hackenberg and W. Nagel, "Detecting memory-boundedness with hardware performance counters", Proceedings of the 8th ACM/SPEC International Conference on Performance Engineering, Italy, pp. 27-38, 2017.
[24] H. Yun, G. Yao, R. Pellizzoni, M. Caccamo, L. Sha, "MemGuard: Memory bandwidth reservation system for efficient performance isolation in multi-core platforms", Proc. Real-Time Embedded Technol. Appl. Symp., USA, pp. 55-64, 2013.
[25] Karcher, T., Pankratius, V.: Auto-Tuning Multicore Applications at Run-Time with a Cooperative Tuner. Technical Report, 2011-4, Karlsruhe Institute of Technology, Germany (2011)
[26] P. Kansakar and A. Munir, “A Design space exploration methodology for parameter optimization in multicore processors,” IEEE Trans. Parallel Distrib. Syst., vol. 29, no. 1, pp. 2–15,2018.
[27] M. Kulkarni, V. Pai, and D. Schuff, “Towards architecture independent metrics for multicore performance analysis,” ACM SIGMETRICS Perform. Eval. Rev., vol. 38, pp. 10-14, 2011.
[28] Y. Wang and K. B. Kent, “A Region-Based Approach to Pipeline Parallelism in Java Programs on Multicores,” Proc. - 2017 25th Euromicro Int. Conf. Parallel, Distrib. Network-Based Process. PDP 2017, Russia, pp. 124–131, 2017.
[29] R. Das et al., "Application-to-core Mapping Policies to Reduce Memory System Interference in Multi-core Systems", HPCA 2013.
[30] Eric Lau , Jason E. Miller , Inseok Choi , Donald Yeung , Saman Amarasinghe,Anant Agarwal, “Multicore performance optimization using partner cores”, Proceedings of the 3rd USENIX conference on Hot topic in parallelism, Berkeley, pp.11-11, 2011,
[31] Da-WeiChang,Ing-ChaoLin,Yu-ShiangChien,Chn-LunLin,A.Su,andChung PingYoung,"CASA:Contention-Aware Scratchpad Memory Allocation for Online Hybrid On-Chip Memory Management", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 33, no. 12, pp. 1806-1817, 2014.
[32] S. Gu, Q. Zhuge, J. Yi, J. Hu, E. H.-M. Sha, "Optimizing task and data assignment on multi-core systems with multi-port SPMs", IEEE Trans. Parallel Distrib. Syst., vol. 26, no. 9, pp. 2549-2560, 2015.
[33] N. Ramasubramanian, V. V. Srnivas, and N. Ammasai Gounden, “Performance of Cache Memory Subsystems for Multicore Architectures,” Int. J. Comput. Sci. Eng. Appl., vol. 1, no. 5, pp. 59–71,2011.
[34] V. Kazempour, A. Fedorova, and P. Alagheband, “Performance implications of cache affinity on multicore processors,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 5168, pp. 151–161, 2008.
[35] M. Rawlins, A. Gordon-Ross, "A cache tuning heuristic for multi-core architecture", IEEE transaction on computers, vol. 62, no. 8, pp. 1570-1583, 2013.
[36] M. Rawlins and A. Gordon-Ross, "An application classification guided cache tuning heuristic for multi-core architectures", 17th Asia and South Pacific Design Automation Conference,2012.
[37] K. Huang, K. Wang, D. Zheng, X. Zhang and X. Yan, "Access Adaptive and Thread-Aware Cache Partitioning in Multicore Systems", Electronics, vol. 7, no. 9, pp. 172, 2018.
[38] Y. Song, O. Alavoine, and B. Lin, “Row-buffer hit harvesting in orchestrated last-level cache and DRAM scheduling for heterogeneous multicore systems,” in Proceedings of the 2018 Design, Automation and Test in Europe Conference and Exhibition, Janua, pp. 779–784, 2018
[39] P. Anuradha, H. Rallapalli, and G. Narsimha, “Energy efficient scheduling algorithm for the multicore heterogeneous embedded architectures,” Des. Autom. Embed. Syst., vol. 22, no. 1–2, 2018
[40] Fuad, M., Deb, D. and Baek, J., “Self-Healing by Means of Runtime Execution Profiling”, Proceedings of 14th International Conference on Computer and Information Technology, Dhaka, pp., 202-207, 2011.
[41] A. Ganapathi, K. Datta, A. Fox, and D. A. Patterson, “A case for machine learning to optimize multicore performance,” in Proceedings of the First USENIX conference on Hot topics in parallelism, 2009.
[42] Jain, R., Panda, P. and Subramoney, S. “Cooperative Multi-Agent Reinforcement Learning-Based Co-optimization of Cores, Caches, and On- chip Network”, ACM Transactions on Architecture and Code Optimization, vol. 14, issue-4, pp.1-25, 2017.
[43] N. Huber, F. Brosig, S. Spinner, S. Kounev and M. Bahr, "Model-Based Self- Aware Performance and Resource Management Using the Descartes Modeling Language", IEEE Transactions on Software Engineering, vol. 43, no. 5, pp. 432-452, 2017.
[44] K.Hasan,J.Antonio, and S.Radhakrishnan," A model-driven approach for predicting and analysing the execution efficiency of multi-core processing", International Journal of Computational Science and Engineering, vol. 14, no. 2, pp. 105-125, 2017.
[45] Khondker S. Hasan, John K. Antonio, and Sridhar Radhakrishnan, "A New Multi-core CPU Resource Availability Prediction Model for Concurrent Processes," Lecture Notes in Engineering and Computer Science: Proceedings of The International MultiConference of Engineers and Computer Scientists, Hong Kong, pp. 130-135, 2017.
[46] K. Moazzemi, A. Kanduri, D. Juh´asz, A. Miele, A. M. Rahmani, P. Liljeberg, A. Jantsch, N. Dutt, “Trends in on-chip dynamic resource management”, in21st Euromicro Conference on Digital System Design (DSD), Prague, pp. 62–69, 2018.
[47] Yan-fei Zhu and Xiong-min Tang, "Overview of swarm intelligence”,International Conference on Computer Application and System Modeling, China, pp. 400-403, 2010.
[48] Hariri, B. Khargharia, H. Chen, J. Yang, Y. Zhang, M. Parashar, and H. Liu, "The Autonomic Computing Paradigm", Cluster Computing, vol. 9, no. 1, pp. 5-17, 2006.
[49] Lewis, P.R., Self-aware computing systems: from psychology to engineering. In: Design, Automation and Test in Europe Conference and Exhibition, Switzerland,pp.,1044–1049, 2017.
[50] N. Dutt, A. Jantsch, and S. Sarma, "Toward Smart Embedded Systems", ACM Transactions on Embedded Computing Systems, vol. 15, no. 2, pp. 1-27, 2016.
[51] D. Dasgupta, H. Bedi, and D. Garrett, “A conceptual model of self-monitoring multi-core systems,” Proc. Sixth Annu. Work. Cyber Secur. Inf. Intell. Res. - CSIIRW ’10, pp. 83 1,2010.
[52] O. Mattes and W. Karl, “Evaluating the Self-Optimization Process of the Adaptive Memory Management Architecture Self-aware Memory,” in Proceedings of the 1st Workshop on Resource Awareness and Adaptivity in Multi-Core, Germany pp.16–21, 2014.
[53] G.P. Sunitha, B.P. Vijay Kumar, S.M. Dilip Kumar, "A Nature Inspired Optimal Path Finding Algorithm to Mitigate Congestion in WSNs", International Journal of Scientific Research in Network Security and Communication, Vol.6, Issue.3, pp.50-57, 2018
[54] Chingrace Guite, Kamaljeet Kaur Mangat, "A Study on Energy Efficient VM Allocation in Green Cloud Computing", International Journal of Scientific Research in Computer Science and Engineering, Vol.6, Issue.4, pp.37-40, 2018

Citations	2325
h-index	16
i10-index	47