Loading…
Thumbnail Image

On latency in GPU throughput microarchitectures

Andersch, Michael; Lucas, Jan; Álvarez-Mesa, Mauricio; Juurlink, Ben

Modern GPUs provide massive processing power (arithmetic throughput) as well as memory throughput. Presently, while it appears to be well understood how performance can be improved by increasing throughput, it is less clear what the effects of micro-architectural latencies are on the performance of throughput-oriented GPU architectures. In fact, little is publicly known about the values, behavior, and performance impact of microarchitecture latency components in modern GPUs. This work attempts to fill that gap by analyzing both the idle (static) as well as loaded (dynamic) latency behavior of GPU microarchitectural components. Our results show that GPUs are not as effective in latency hiding as commonly thought and based on that, we argue that latency should also be a GPU design consideration besides throughput.
Published in: 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2015), 10.1109/ISPASS.2015.7095801, IEEE