A Roe's flux-difference splitting scheme has been implemented using the NVIDIA CUDA architecture and has been applied to solve the two-dimensional compressible Euler equations. Different standard test cases have been considered in order to estimate the speed-up of GPU computing with respect to CPU calculation. A detailed description of the kernel configuration has been provided and a theoretical analysis of the GPU execution time as a function of the number of threads managed by the kernels is also reported. The loss of performance has been fully described consequent to the use of zero-copy memory. Significant performance improvements have been obtained by using a more recent GPU and CUDA Toolkit. A test case on multi-GPU architecture has been presented in the domain decomposition approach.
Computing unsteady compressible flows using Roe's flux-difference splitting scheme on GPUs
Gianpiero Colonna;
2013
Abstract
A Roe's flux-difference splitting scheme has been implemented using the NVIDIA CUDA architecture and has been applied to solve the two-dimensional compressible Euler equations. Different standard test cases have been considered in order to estimate the speed-up of GPU computing with respect to CPU calculation. A detailed description of the kernel configuration has been provided and a theoretical analysis of the GPU execution time as a function of the number of threads managed by the kernels is also reported. The loss of performance has been fully described consequent to the use of zero-copy memory. Significant performance improvements have been obtained by using a more recent GPU and CUDA Toolkit. A test case on multi-GPU architecture has been presented in the domain decomposition approach.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.