Show simple item record

dc.contributor.authorCederman, Danielen_US
dc.contributor.authorTsigas, Philippasen_US
dc.contributor.editorDavid Luebke and John Owensen_US
dc.date.accessioned2013-10-28T10:19:26Z
dc.date.available2013-10-28T10:19:26Z
dc.date.issued2008en_US
dc.identifier.isbn978-3-905674-09-5en_US
dc.identifier.issn1727-3471en_US
dc.identifier.urihttp://dx.doi.org/10.2312/EGGH/EGGH08/057-064en_US
dc.description.abstractTo get maximum performance on the many-core graphics processors it is important to have an even balance of the workload so that all processing units contribute equally to the task at hand. This can be hard to achieve when the cost of a task is not known beforehand and when new sub-tasks are created dynamically during execution. With the recent advent of scatter operations and atomic hardware primitives it is now possible to bring some of the more elaborate dynamic load balancing schemes from the conventional SMP systems domain to the graphics processor domain. We have compared four different dynamic load balancing methods to see which one is most suited to the highly parallel world of graphics processors. Three of these methods were lock-free and one was lock-based. We evaluated them on the task of creating an octree partitioning of a set of particles. The experiments showed that synchronization can be very expensive and that new methods that take more advantage of the graphics processors features and capabilities might be required. They also showed that lock-free methods achieves better performance than blocking and that they can be made to scale with increased numbers of processing units.en_US
dc.publisherThe Eurographics Associationen_US
dc.subjectCategories and Subject Descriptors (according to ACM CCS): C.1.2 [Processor Architectures]: Multiple Data Stream Architectures (Multiprocessors)en_US
dc.titleOn Dynamic Load Balancing on Graphics Processorsen_US
dc.description.seriesinformationGraphics Hardwareen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record