期刊名称:International Journal of Computer Science Issues
印刷版ISSN:1694-0784
电子版ISSN:1694-0814
出版年度:2013
卷号:10
期号:2
出版社:IJCSI Press
摘要:One of the first steps in embedded-system design flow is to choose the most efficient implementation of the embedded software application. However, this is difficult to do at the earliest design stages because particular details of the final many- core HW platform are usually unknown and many possible mappings of the software tasks/threads have to be evaluated. This paper presents a complete framework for early performance estimation of parallel programs in many-core platforms. The proposed framework is based on a specific native-simulation approach oriented to many-core platforms, which enables fast simulation and profiling. The software parallelism is specified in OpenMP, a commonly used application software interface (API) for shared-memory parallel programming. In order to support Non-Uniform Memory Access (NUMA) architectures (which are dominant in high-performance many-core platforms), the paper proposes some OpenMP extensions. These extensions improve performance analysis and facilitate the automatic translation from OpenMP to OpenCL (a low-level API for heterogeneous computing), which are commonly used for NUMA programming). Results show that the proposed OpenMP extension and specific parallel modeling techniques provide reliable results even for NUMA architectures.
关键词:OpenMP;OpenCL; performance analysis; many;core; NUMA; early estimation.