Publication Details

Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems

KUNÍK, O.; JAROŠ, J. Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems. Ostrava: 2024. p. 0-0.
Czech title
Zrychlení hybridního lokálního rozložení domény pro k-Wave toolbox na multi-GPU systémech
Type
presentation, poster
Language
English
Authors
Keywords

k-Wave, HPC, Hybrid decomposition, Local decomposition, CUDA, Multi-GPU

Abstract

The k-Wave toolbox is designed for high-fidelity acoustic wave simulations using Fourier collocation for spatial derivatives, but its performance is constrained by communication overhead on multi-CPU and multi-GPU systems. We present a hybrid local domain decomposition approach that partitions the simulation domain into subdomains, each assigned to a GPU with configurable resolution. Using CUDA and cuFFT for Fourier transforms and NVLink for halo exchanges, our method minimizes inter-subdomain communication and accelerates multi-GPU performance. Testing on the Karolina supercomputer shows strong scalability and accuracy, especially with uniform-resolution subdomains, and proves effective even for large-scale simulations beyond single-GPU memory limits.

Published
2024
Pages
1
Conference
8th Users' Conference of IT4Innovations, Ostrava, CZ
Place
Ostrava
BibTeX
@misc{BUT193367,
  author="Oliver {Kuník} and Jiří {Jaroš}",
  title="Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems",
  year="2024",
  pages="1",
  address="Ostrava",
  url="https://www.fit.vut.cz/research/publication/13293/",
  note="presentation, poster"
}
Files
Back to top