Local memory-aware kernel perforation

Maier, Daniel; Cosenza, Biagio; Juurlink, Ben

Local memory-aware kernel perforation

dc.contributor.author	Maier, Daniel
dc.contributor.author	Cosenza, Biagio
dc.contributor.author	Juurlink, Ben
dc.date.accessioned	2018-06-04T15:23:14Z
dc.date.available	2018-06-04T15:23:14Z
dc.date.issued	2018
dc.description.abstract	Many applications provide inherent resilience to some amount of error and can potentially trade accuracy for performance by using approximate computing. Applications running on GPUs often use local memory to minimize the number of global memory accesses and to speed up execution. Local memory can also be very useful to improve the way approximate computation is performed, e.g., by improving the quality of approximation with data reconstruction techniques. This paper introduces local memory-aware perforation techniques specifically designed for the acceleration and approximation of GPU kernels. We propose a local memory-aware kernel perforation technique that first skips the loading of parts of the input data from global memory, and later uses reconstruction techniques on local memory to reach higher accuracy while having performance similar to state-of-the-art techniques. Experiments show that our approach is able to accelerate the execution of a variety of applications from 1.6× to 3× while introducing an average error of 6%, which is much smaller than that of other approaches. Results further show how much the error depends on the input data and application scenario, the impact of local memory tuning and different parameter configurations.	en
dc.identifier.isbn	978-1-4503-5617-6
dc.identifier.uri	https://depositonce.tu-berlin.de/handle/11303/7911
dc.identifier.uri	http://dx.doi.org/10.14279/depositonce-7072
dc.language.iso	en	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject.ddc	004 Datenverarbeitung; Informatik	de
dc.subject.other	approximate computing	en
dc.subject.other	GU	en
dc.subject.other	kernel perforation	en
dc.title	Local memory-aware kernel perforation	en
dc.type	Conference Object	en
dc.type.version	acceptedVersion	en
dcterms.bibliographicCitation.doi	10.1145/3168814	en
dcterms.bibliographicCitation.originalpublishername	Association for Computing Machinery (ACM)	en
dcterms.bibliographicCitation.originalpublisherplace	New York, NY, USA	en
dcterms.bibliographicCitation.pageend	287	en
dcterms.bibliographicCitation.pagestart	278	en
dcterms.bibliographicCitation.proceedingstitle	Proceedings of 2018 IEEE/ACM International Symposium on Code Generation and Optimization (CGO’18)	en
tub.accessrights.dnb	free	en
tub.affiliation	Fak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Architektur eingebetteter Systeme	de
tub.affiliation.faculty	Fak. 4 Elektrotechnik und Informatik	de
tub.affiliation.group	FG Architektur eingebetteter Systeme	de
tub.affiliation.institute	Inst. Technische Informatik und Mikroelektronik	de
tub.publisher.universityorinstitution	Technische Universität Berlin	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: MaierCGO18.pdf
Size:: 688.83 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 4.9 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Publications