Spatio-temporal SIMT and scalarization for improving GPU efficiency
dc.contributor.author | Lucas, Jan | |
dc.contributor.author | Andersch, Michael | |
dc.contributor.author | Álvarez-Mesa, Mauricio | |
dc.contributor.author | Juurlink, Ben | |
dc.date.accessioned | 2017-10-24T10:05:17Z | |
dc.date.available | 2017-10-24T10:05:17Z | |
dc.date.issued | 2015 | |
dc.description.abstract | Temporal SIMT (TSIMT) has been suggested as an alternative to conventional (spatial) SIMT for improving GPU performance on branch-intensive code. Although TSIMT has been briefly mentioned before, it was not evaluated. We present a complete design and evaluation of TSIMT GPUs, along with the inclusion of scalarization and a combination of temporal and spatial SIMT, named Spatiotemporal SIMT (STSIMT). Simulations show that TSIMT alone results in a performance reduction, but a combination of scalarization and STSIMT yields a mean performance enhancement of 19.6% and improves the energy-delay product by 26.2% compared to SIMT. | en |
dc.description.sponsorship | EC/FP7/288653/EU/Low-Power Parallel Computing on GPUs/LPGPU | en |
dc.identifier.issn | 1544-3566 | |
dc.identifier.uri | https://depositonce.tu-berlin.de/handle/11303/6923 | |
dc.identifier.uri | http://dx.doi.org/10.14279/depositonce-6262 | |
dc.language.iso | en | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject.ddc | 004 Datenverarbeitung; Informatik | |
dc.subject.other | GPUs | en |
dc.subject.other | branch divergence | en |
dc.subject.other | scalarization | en |
dc.subject.other | temporal SIMT | en |
dc.title | Spatio-temporal SIMT and scalarization for improving GPU efficiency | en |
dc.type | Article | en |
dc.type.version | acceptedVersion | en |
dcterms.bibliographicCitation.articlenumber | 32 | |
dcterms.bibliographicCitation.doi | 10.1145/2811402 | |
dcterms.bibliographicCitation.issue | 3 | |
dcterms.bibliographicCitation.journaltitle | ACM Transactions on Architecture and Code Optimization (TACO) | en |
dcterms.bibliographicCitation.originalpublishername | Association for Computing Machinery (ACM) | en |
dcterms.bibliographicCitation.originalpublisherplace | New York, NY | en |
dcterms.bibliographicCitation.volume | 12 | |
tub.accessrights.dnb | domain | |
tub.affiliation | Fak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Architektur eingebetteter Systeme | de |
tub.affiliation.faculty | Fak. 4 Elektrotechnik und Informatik | de |
tub.affiliation.group | FG Architektur eingebetteter Systeme | de |
tub.affiliation.institute | Inst. Technische Informatik und Mikroelektronik | de |
tub.publisher.universityorinstitution | Technische Universität Berlin | en |
Files
Original bundle
1 - 1 of 1