We present a highly optimized thread-safe lattice Boltzmann model in which the non-equilibrium part of the distribution function is locally reconstructed via recursivity of Hermite polynomials. Such a procedure allows the explicit incorporation of non-equilibrium moments of the distribution up to the order supported by the lattice. Thus, the proposed approach increases accuracy and stability at low viscosities without compromising performance and amenability to parallelization with respect to standard lattice Boltzmann models. The high-order thread-safe lattice Boltzmann is tested on two types of turbulent flows, namely, the turbulent channel flow at R e τ = 180 and the axisymmetric turbulent jet at Re = 7000; it delivers results in excellent agreement with reference data [direct numerical simulations (DNS), theory, and experiments] and (a) achieves peak performance [ ∼ 5 × 10 12 floating point operations (FLOP) per second and an arithmetic intensity of ∼ 7 FLOP / byte on a single graphic processing unit] by significantly reducing the memory footprint, (b) retains the algorithmic simplicity of standard lattice Boltzmann computing, and (c) allows to perform stable simulations at vanishingly low viscosities. Our findings open attractive prospects for high-performance simulations of realistic turbulent flows on GPU-based architectures. Such expectations are confirmed by excellent agreement among lattice Boltzmann, experimental, and DNS reference data.

High-order thread-safe lattice Boltzmann model for high performance computing turbulent flow simulations

Montessori A.;La Rocca M.;Lauricella M.;Tiribocchi A.;Succi S.
2024

Abstract

We present a highly optimized thread-safe lattice Boltzmann model in which the non-equilibrium part of the distribution function is locally reconstructed via recursivity of Hermite polynomials. Such a procedure allows the explicit incorporation of non-equilibrium moments of the distribution up to the order supported by the lattice. Thus, the proposed approach increases accuracy and stability at low viscosities without compromising performance and amenability to parallelization with respect to standard lattice Boltzmann models. The high-order thread-safe lattice Boltzmann is tested on two types of turbulent flows, namely, the turbulent channel flow at R e τ = 180 and the axisymmetric turbulent jet at Re = 7000; it delivers results in excellent agreement with reference data [direct numerical simulations (DNS), theory, and experiments] and (a) achieves peak performance [ ∼ 5 × 10 12 floating point operations (FLOP) per second and an arithmetic intensity of ∼ 7 FLOP / byte on a single graphic processing unit] by significantly reducing the memory footprint, (b) retains the algorithmic simplicity of standard lattice Boltzmann computing, and (c) allows to perform stable simulations at vanishingly low viscosities. Our findings open attractive prospects for high-performance simulations of realistic turbulent flows on GPU-based architectures. Such expectations are confirmed by excellent agreement among lattice Boltzmann, experimental, and DNS reference data.
2024
Istituto Applicazioni del Calcolo ''Mauro Picone''
High performance computing, lattice Boltzmann simulations, turbulent flows
File in questo prodotto:
File Dimensione Formato  
035171_1_5.0202155.pdf

non disponibili

Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 1.86 MB
Formato Adobe PDF
1.86 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/510473
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 4
social impact