Publication details
- Free-Surface Lattice Boltzmann Simulation on Many-Core Architectures (Martin Schreiber, Philipp Neumann, Stefan Zimmer, Hans-Joachim Bungartz), In 2011 International Conference on Computational Science, Procedia Computer Science (4), pp. 984–993, Elsevier, ICCS 2011, Singapore, 2011
Publication details – DOI
Abstract
Current advances in many-core technologies demand simulation algorithms suited for the corresponding architectures while with regard to the respective increase of computational power, real-time and interactive simulations become possible and desirable. We present an OpenCL implementation of a Lattice-Boltzmann-based free-surface solver for GPU architectures. The massively parallel execution especially requires special techniques to keep the interface region consistent, which is here addressed by a novel multipass method. We further compare different memory layouts according to their performance for both a basic driven cavity implementation and the free-surface method, pointing out the capabilities of our implementation in real-time and interactive scenarios, and shortly present visualizations of the flow, obtained in real-time.
BibTeX
@inproceedings{FLBSOMASNZ11, author = {Martin Schreiber and Philipp Neumann and Stefan Zimmer and Hans-Joachim Bungartz}, title = {{Free-Surface Lattice Boltzmann Simulation on Many-Core Architectures}}, year = {2011}, booktitle = {{2011 International Conference on Computational Science}}, publisher = {Elsevier}, series = {Procedia Computer Science}, number = {4}, pages = {984--993}, conference = {ICCS 2011}, location = {Singapore}, doi = {http://dx.doi.org/10.1016/j.procs.2011.04.104}, abstract = {Current advances in many-core technologies demand simulation algorithms suited for the corresponding architectures while with regard to the respective increase of computational power, real-time and interactive simulations become possible and desirable. We present an OpenCL implementation of a Lattice-Boltzmann-based free-surface solver for GPU architectures. The massively parallel execution especially requires special techniques to keep the interface region consistent, which is here addressed by a novel multipass method. We further compare different memory layouts according to their performance for both a basic driven cavity implementation and the free-surface method, pointing out the capabilities of our implementation in real-time and interactive scenarios, and shortly present visualizations of the flow, obtained in real-time.}, }