Show simple item record

dc.contributor.authorKopta, D.en_US
dc.contributor.authorShkurko, K.en_US
dc.contributor.authorSpjut, J.en_US
dc.contributor.authorBrunvand, E.en_US
dc.contributor.authorDavis, A.en_US
dc.contributor.editorDeussen, Oliver and Zhang, Hao (Richard)en_US
dc.date.accessioned2015-03-02T19:44:46Z
dc.date.available2015-03-02T19:44:46Z
dc.date.issued2015en_US
dc.identifier.urihttp://dx.doi.org/10.1111/cgf.12458en_US
dc.description.abstractWe propose two hardware mechanisms to decrease energy consumption on massively parallel graphics processors for ray tracing. First, we use a streaming data model and configure part of the L2 cache into a ray stream memory to enable efficient data processing through ray reordering. This increases L1 hit rates and reduces off‐chip memory energy substantially through better management of off‐chip memory access patterns. To evaluate this model, we augment our architectural simulator with a detailed memory system simulation that includes accurate control, timing and power models for memory controllers and off‐chip dynamic random‐access memory . These details change the results significantly over previous simulations that used a simpler model of off‐chip memory, indicating that this type of memory system simulation is important for realistic simulations that involve external memory. Secondly, we employ reconfigurable special‐purpose pipelines that are constructed dynamically under program control. These pipelines use shared execution units that can be configured to support the common compute kernels that are the foundation of the ray tracing algorithm. This reduces the overhead incurred by on‐chip memory and register accesses. These two synergistic features yield a ray tracing architecture that reduces energy by optimizing both on‐chip and off‐chip memory activity when compared to a more traditional approach.We propose two hardware mechanisms to decrease energy consumption on massively parallel graphics processors for ray tracing. First, we use a streaming data model and configure part of the L2 cache into a ray stream memory to enable efficient data processing through ray reordering. This increases L1 hit rates and reduces off‐chip memory energy substantially through better management of off‐chip memory access patterns. Secondly, we employ reconfigurable special‐purpose compute pipelines that are constructed dynamically under program control. These two synergistic features yield a ray tracing architecture that reduces energy by optimizing both on‐chip and off‐chip memory activity when compared to a more traditional approach. To evaluate this model, we augment our architectural simulator with a detailed off‐chip memory system simulation that includes accurate control, timing and power models for memory controllers and DRAM.en_US
dc.publisherCopyright © 2015 The Eurographics Association and John Wiley & Sons Ltd.en_US
dc.titleMemory Considerations for Low Energy Ray Tracingen_US
dc.description.seriesinformationComputer Graphics Forumen_US
dc.description.sectionheadersArticlesen_US
dc.description.volume34en_US
dc.description.number1en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record