Designing Coalescing Network-on-Chip for Efficient Memory Accesses of GPGPUs
Document Type
Conference Proceeding
Publication Date
2014
Publication Title
Network and Parallel Computing
Abstract
The massive multithreading architecture of General Purpose Graphic Processors Units (GPGPU) makes them ideal for data parallel computing. However, designing efficient GPGPU chips poses many challenges. One major hurdle is the interface to the external DRAM, particularly the buffers in the memory controllers (MCs), which is stressed heavily by the many concurrent memory accesses from the GPGPU. Previous approaches considered scheduling the memory requests in the memory buffers to reduce switching of memory rows. The problem is that the window of requests that can be considered for scheduling is too narrow and the memory controller is very complex, affecting the critical path. In view of the massive multithreading architecture of GPGPUs that can hide memory access latencies, we exploit in this paper the novel idea of rearranging the memory requests in the network-on-chip (NoC), called packet coalescing. To study the feasibility of this idea, we have designed an expanded NoC router that supports packet coalescing and evaluated its performance extensively. Evaluation results show that this NoC-assisted design strategy can improve the row buffer hit rate in the memory controllers. A comprehensive investigation of factors affecting the performance of coalescing is also conducted and reported.
Repository Citation
Chen, Chien-Ting; Huang, Yoshi Shih-Chieh; Chang, Yuan-Ying; Tu, Chiao-Yun; King, Chung-Ta; Wang, Tai-Yuan; Sang, Janche; and Li, Ming-Hua, "Designing Coalescing Network-on-Chip for Efficient Memory Accesses of GPGPUs" (2014). Electrical and Computer Engineering Faculty Publications. 356.
https://engagedscholarship.csuohio.edu/enece_facpub/356
Original Citation
C. Chen, Y. S. Huang, Y. Chang, C. Tu, C. King, T. Wang, J. Sang and M. Li, "Designing coalescing network-on-chip for efficient memory accesses of GPGPUs," in Network and Parallel Computing: 11th IFIP WG 10.3 International Conference, NPC 2014, Ilan, Taiwan, September 18-20, 2014. Proceedings, C. Hsu, X. Shi and V. Salapura, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 2014, pp. 169-180.
DOI
10.1007/978-3-662-44917-2_15
Comments
11th IFIP WG 10.3 International Conference, NPC 2014, Ilan, Taiwan, September 18-20, 2014. Proceedings