Abstract
This paper describes the deployment on GPUs of PROP, a program of the 2DRMP suite which models electron collisions with H-like atoms and ions. Because performance on GPUs is better in single precision than in double precision, the numerical stability of the PROP program in single precision has been studied. The numerical quality of PROP results computed in single precision and their impact on the next program of the 2DRMP suite has been analyzed. Successive versions of the PROP program on GPUs have been developed in order to improve its performance. Particular attention has been paid to the optimization of data transfers and of linear algebra operations. Performance obtained on several architectures (including NVIDIA Fermi) are presented.
Original language | English |
---|---|
Title of host publication | 25th IEEE International Symposium on Parallel and Distributed Processing |
Subtitle of host publication | IPDPS 2011 |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 1359-1366 |
Number of pages | 8 |
ISBN (Electronic) | 978-0-7695-4577-6 |
ISBN (Print) | 978-1-61284-425-1 |
DOIs | |
Publication status | Published - May 2011 |
Event | 12th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-11) - Anchorage, Alaska, United States Duration: 16 May 2011 → 20 May 2011 |
Conference
Conference | 12th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-11) |
---|---|
Country | United States |
City | Anchorage, Alaska |
Period | 16/05/2011 → 20/05/2011 |
Bibliographical note
ISSN: 1530-2075ASJC Scopus subject areas
- Computational Theory and Mathematics
- Software
- Theoretical Computer Science