Register Efficient Memory Allocator for GPUs
Abstract
We compare four existing dynamic memory allocators optimized for GPUs and show their strengths and weaknesses. In the measurements we use three generic evaluation tests proposed in the literature and add one with a real workload where dynamic memory allocation is used for building the kd-tree data structure. Following the performance analysis we propose a new dynamic memory allocator and its variants that address the limitations of the existing dynamic memory allocators. The new dynamic memory allocator uses few resources and is targeted towards large and variably sized memory allocations on massively parallel hardware architectures.
BibTeX
@inproceedings {10.2312:hpg.20141090,
booktitle = {Eurographics/ ACM SIGGRAPH Symposium on High Performance Graphics},
editor = {Ingo Wald and Jonathan Ragan-Kelley},
title = {{Register Efficient Memory Allocator for GPUs}},
author = {Vinkler, Marek and Vlastimil, Havran},
year = {2014},
publisher = {The Eurographics Association},
ISSN = {2079-8679},
ISBN = {978-3-905674-60-6},
DOI = {10.2312/hpg.20141090}
}
booktitle = {Eurographics/ ACM SIGGRAPH Symposium on High Performance Graphics},
editor = {Ingo Wald and Jonathan Ragan-Kelley},
title = {{Register Efficient Memory Allocator for GPUs}},
author = {Vinkler, Marek and Vlastimil, Havran},
year = {2014},
publisher = {The Eurographics Association},
ISSN = {2079-8679},
ISBN = {978-3-905674-60-6},
DOI = {10.2312/hpg.20141090}
}