E²MC: Entropy Encoding Based Memory Compression for GPUs

dc.contributor.authorLal, Sohan
dc.contributor.authorLucas, Jan
dc.contributor.authorJuurlink, Ben
dc.date.accessioned2018-08-17T06:45:27Z
dc.date.available2018-08-17T06:45:27Z
dc.date.issued2017
dc.description.abstractModern Graphics Processing Units (GPUs) provide much higher off-chip memory bandwidth than CPUs, but many GPU applications are still limited by memory bandwidth. Unfortunately, off-chip memory bandwidth is growing slower than the number of cores and has become a performance bottleneck. Thus, optimizations of effective memory bandwidth play a significant role for scaling the performance of GPUs. Memory compression is a promising approach for improving memory bandwidth which can translate into higher performance and energy efficiency. However, compression is not free and its challenges need to be addressed, otherwise the benefits of compression may be offset by its overhead. We propose an entropy encoding based memory compression (E2MC) technique for GPUs, which is based on the well-known Huffman encoding. We study the feasibility of entropy encoding for GPUs and show that it achieves higher compression ratios than state-of-the-art GPU compression techniques. Furthermore, we address the key challenges of probability estimation, choosing an appropriate symbol length for encoding, and decompression with low latency. The average compression ratio of E2MC is 53% higher than the state of the art. This translates into an average speedup of 20% compared to no compression and 8% higher compared to the state of the art. Energy consumption and energy-delay product are reduced by 13% and 27%, respectively. Moreover, the compression ratio achieved by E2MC is close to the optimal compression ratio given by Shannon's source coding theorem.en
dc.description.sponsorshipEC/H2020/688759/EU/Low-Power Parallel Computing on GPUs 2/LPGPU2en
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/8107
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-7268
dc.language.isoenen
dc.relation.issupplementtohttp://dx.doi.org/10.14279/depositonce-7075
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subject.ddc004 Datenverarbeitung; Informatikde
dc.subject.othermemory compressionen
dc.subject.otherGPUsen
dc.subject.otherHuffman compressionen
dc.subject.othermemory bandwidthen
dc.subject.otherenergy efficiencyen
dc.titleE²MC: Entropy Encoding Based Memory Compression for GPUsen
dc.typeTabular Dataen
tub.accessrights.dnbunknown*
tub.affiliationFak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Architektur eingebetteter Systemede
tub.affiliation.facultyFak. 4 Elektrotechnik und Informatikde
tub.affiliation.groupFG Architektur eingebetteter Systemede
tub.affiliation.instituteInst. Technische Informatik und Mikroelektronikde

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
E2MC_Data.csv
Size:
6.66 KB
Format:
Comma-separated values
Description:
Raw Research Data from E²MC: Entropy Encoding Based Memory Compression for GPUs
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections