dc.contributor.author | Weber, Nicolas | en_US |
dc.contributor.author | Goesele, Michael | en_US |
dc.contributor.editor | Margarita Amor and Markus Hadwiger | en_US |
dc.date.accessioned | 2014-12-16T07:31:19Z | |
dc.date.available | 2014-12-16T07:31:19Z | |
dc.date.issued | 2014 | en_US |
dc.identifier.isbn | 978-3-905674-59-0 | en_US |
dc.identifier.issn | 1727-348X | en_US |
dc.identifier.uri | http://dx.doi.org/10.2312/pgv.20141085 | en_US |
dc.identifier.uri | http://hdl.handle.net/10.2312/pgv.20141085.057-064 | |
dc.description.abstract | The continuing evolution of Graphics Processing Units (GPU) has shown rapid performance increases over the years. But with each new hardware generation, the constraints for programming them efficiently have changed. Programs have to be tuned towards one specific hardware to unleash the full potential. This is time consuming and costly as vendors tend to release a new generation every 18 months. It is therefore important to auto-tune GPU code to achieve GPU-specific improvements. Using either static or empirical profiling to adjust parameters or to change the kernel implementation. We introduce a new approach to automatically improve memory access on GPUs. Our system generates an application specific library which abstracts the memory access for complex arrays on the host and GPU side. This allows to optimize the code by exchanging the memory layout without recompiling the application, as all necessary layouts are pre-compiled into the library. Our implementation is able to speedup real-world applications up to an order of magnitude and even outperforms hand-tuned implementations. | en_US |
dc.publisher | The Eurographics Association | en_US |
dc.subject | D.3.3 [Programming Technique] | en_US |
dc.subject | Language Constructs and Features | en_US |
dc.subject | Data types and structures | en_US |
dc.subject | I.3.1 [Computer Graphics] | en_US |
dc.subject | Hardware Architecture | en_US |
dc.subject | Graphics processors | en_US |
dc.subject | I.3.6 [Computer Graphics] | en_US |
dc.subject | Methodology and Techniques | en_US |
dc.subject | Graphics data structures and data types | en_US |
dc.title | Auto-Tuning Complex Array Layouts for GPUs | en_US |
dc.description.seriesinformation | Eurographics Symposium on Parallel Graphics and Visualization | en_US |