dc.contributor.author | Grosset, A. V. Pascal | en_US |
dc.contributor.author | Prasad, Manasa | en_US |
dc.contributor.author | Christensen, Cameron | en_US |
dc.contributor.author | Knoll, Aaron | en_US |
dc.contributor.author | Hansen, Charles | en_US |
dc.contributor.editor | C. Dachsbacher and P. Navrátil | en_US |
dc.date.accessioned | 2015-05-24T19:41:17Z | |
dc.date.available | 2015-05-24T19:41:17Z | |
dc.date.issued | 2015 | en_US |
dc.identifier.uri | http://dx.doi.org/10.2312/pgv.20151157 | en_US |
dc.description.abstract | Modern supercomputers have very powerful multi-core CPUs. The programming model on these supercomputer is switching from pure MPI to MPI for inter-node communication, and shared memory and threads for intra-node communication. Consequently the bottleneck in most systems is no longer computation but communication between nodes. In this paper, we present a new compositing algorithm for hybrid MPI parallelism that focuses on communication avoidance and overlapping communication with computation at the expense of evenly balancing the workload. The algorithm has three stages: a direct send stage where nodes are arranged in groups and exchange regions of an image, followed by a tree compositing stage and a gather stage. We compare our algorithm with radix-k and binary-swap from the IceT library in a hybrid OpenMP/MPI setting, show strong scaling results and explain how we generally achieve better performance than these two algorithms. | en_US |
dc.publisher | The Eurographics Association | en_US |
dc.subject | I.3.1 [Computer Graphics] | en_US |
dc.subject | Hardware Architecture | en_US |
dc.subject | Parallel processing | en_US |
dc.subject | I.3.2 [Computer Graphics] | en_US |
dc.subject | Graphics Systems | en_US |
dc.subject | Distributed/network graphics | en_US |
dc.title | TOD-Tree: Task-Overlapped Direct send Tree Image Compositing for Hybrid MPI Parallelism | en_US |
dc.description.seriesinformation | Eurographics Symposium on Parallel Graphics and Visualization | en_US |
dc.description.sectionheaders | Improved Algorithms | en_US |
dc.identifier.doi | 10.2312/pgv.20151157 | en_US |
dc.identifier.pages | 67-76 | en_US |