An automatic input-sensitive approach for heterogeneous task partitioning

dc.contributor.authorKofler, Kofler
dc.contributor.authorGrasso, Ivan
dc.contributor.authorCosenza, Biagio
dc.contributor.authorFahringer, Thomas
dc.date.accessioned2017-10-26T10:38:33Z
dc.date.available2017-10-26T10:38:33Z
dc.date.issued2013
dc.description.abstractUnleashing the full potential of heterogeneous systems, consisting of multi-core CPUs and GPUs, is a challenging task due to the difference in processing capabilities, memory availability, and communication latencies of different computational resources. In this paper we propose a novel approach that automatically optimizes task partitioning for different (input) problem sizes and different heterogeneous multi-core architectures. We use the Insieme source-to-source compiler to translate a single-device OpenCL program into a multi-device OpenCL program. The Insieme Runtime System then performs dynamic task partitioning based on an offline-generated prediction model. In order to derive the prediction model, we use a machine learning approach based on Artificial Neural Networks (ANN) that incorporates static program features as well as dynamic, input sensitive features. Principal component analysis have been used to further improve the task partitioning. Our approach has been evaluated over a suite of 23 programs and respectively achieves a performance improvement of 22% and 25% compared to an execution of the benchmarks on a single CPU and a single GPU which is equal to 87.5% of the optimal performance.en
dc.identifier.isbn978-1-4503-2130-3
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/7020
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-6341
dc.language.isoen
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subject.ddc004 Datenverarbeitung; Informatik
dc.subject.othercode analysisen
dc.subject.othercompilersen
dc.subject.othergpuen
dc.subject.otherheterogeneous computingen
dc.subject.othermachine learningen
dc.subject.otherruntime systemen
dc.subject.othertask partitioningen
dc.titleAn automatic input-sensitive approach for heterogeneous task partitioningen
dc.typeConference Objecten
dc.type.versionacceptedVersionen
dcterms.bibliographicCitation.doi10.1145/2464996.2465007
dcterms.bibliographicCitation.originalpublishernameAssociation for Computing Machinery (ACM)en
dcterms.bibliographicCitation.originalpublisherplaceNew York, NYen
dcterms.bibliographicCitation.pageend160
dcterms.bibliographicCitation.pagestart149
dcterms.bibliographicCitation.proceedingstitleProceedings of the 27th International ACM Conference on International Conference on Supercomputingen
tub.accessrights.dnbdomain
tub.affiliationFak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Architektur eingebetteter Systemede
tub.affiliation.facultyFak. 4 Elektrotechnik und Informatikde
tub.affiliation.groupFG Architektur eingebetteter Systemede
tub.affiliation.instituteInst. Technische Informatik und Mikroelektronikde
tub.publisher.universityorinstitutionTechnische Universität Berlinen

Files

Original bundle
Now showing 1 - 1 of 1
Loading…
Thumbnail Image
Name:
An_automatic.pdf
Size:
1.32 MB
Format:
Adobe Portable Document Format

Collections