Abstract
In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel programming of heterogeneous platforms (multicore+GPUs). Loop-of-Stencil-reduce is general enough to subsume map, reduce, map-reduce, stencil, stencil-reduce, and, crucially, their usage in a loop. It transparently targets (by using OpenCL) combinations of CPU cores and GPUs, and it makes it possible to simplify the deployment of a single stencil computation kernel on different GPUs. The paper discusses the implementation of Loop-of-stencil-reduce within the FastFlow parallel framework, considering a simple iterative data-parallel application as running example (Game of Life) and a highly effective parallel filter for visual data restoration to assess performance. Thanks to the high-level design of the Loop-of-stencil-reduce, it was possible to run the filter seamlessly on a multicore machine, on multi-GPUs, and on both.
Original language | English |
---|---|
Title of host publication | Proceedings - 14th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2015 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 172-177 |
Number of pages | 6 |
Volume | 3 |
ISBN (Print) | 9781467379519 |
DOIs | |
Publication status | Published - 02 Dec 2015 |
Event | 14th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2015 - Helsinki, Finland Duration: 20 Aug 2015 → 22 Aug 2015 |
Conference
Conference | 14th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2015 |
---|---|
Country/Territory | Finland |
City | Helsinki |
Period | 20/08/2015 → 22/08/2015 |
Keywords
- fastflow
- GPUs
- heterogeneous platforms
- multi-core
- OpenCL
- parallel patterns
- skeletons
ASJC Scopus subject areas
- Computer Networks and Communications