dc.contributor.author | Tzeng, Stanley | en_US |
dc.contributor.author | Patney, Anjul | en_US |
dc.contributor.author | Owens, John D. | en_US |
dc.contributor.editor | Michael Doggett and Samuli Laine and Warren Hunt | en_US |
dc.date.accessioned | 2013-10-28T10:21:22Z | |
dc.date.available | 2013-10-28T10:21:22Z | |
dc.date.issued | 2010 | en_US |
dc.identifier.isbn | 978-3-905674-26-2 | en_US |
dc.identifier.issn | 2079-8687 | en_US |
dc.identifier.uri | http://dx.doi.org/10.2312/EGGH/HPG10/029-037 | en_US |
dc.description.abstract | We explore software mechanisms for managing irregular tasks on graphics processing units (GPUs). We demonstrate that dynamic scheduling and efficient memory management are critical problems in achieving high efficiency on irregular workloads. We experiment with several task-management techniques, ranging from the use of a single monolithic task queue to distributed queuing with task stealing and donation. On irregular workloads, we show that both centralized and distributed queues have more than 100 times as much idle times as our task-stealing and -donation queues. Our preferred choice is task-donation because of comparable performance to task-stealing while using less memory overhead. To help in this analysis, we use an artificial task-management system that monitors performance and memory usage to quantify the impact of these different techniques. We validate our results by implementing a Reyes renderer with its irregular split-and-dice workload that is able to achieve real-time framerates on a single GPU. | en_US |
dc.publisher | The Eurographics Association | en_US |
dc.title | Task Management for Irregular-Parallel Workloads on the GPU | en_US |
dc.description.seriesinformation | High Performance Graphics | en_US |