A QHD-capable parallel H.264 decoder

dc.contributor.authorChi, Chi Ching
dc.contributor.authorJuurlink, Ben
dc.date.accessioned2017-10-26T10:38:31Z
dc.date.available2017-10-26T10:38:31Z
dc.date.issued2011
dc.description.abstractVideo coding follows the trend of demanding higher performance every new generation, and therefore could utilize many-cores. A complete parallelization of H.264, which is the most advanced video coding standard, was found to be difficult due to the complexity of the standard. In this paper a parallel implementation of a complete H.264 decoder is presented. Our parallelization strategy exploits function-level as well as data-level parallelism. Function-level parallelism is used to pipeline the H.264 decoding stages. Data-level parallelism is exploited within the two most time consuming stages, the entropy decoding stage and the macroblock decoding stage. The parallelization strategy has been implemented and optimized on three platforms with very different memory architectures, namely an 8-core SMP, a 64-core cc-NUMA, and an 18-core Cell platform. Evaluations have been performed using 4kx2k QHD sequences. On the SMP platform a maximum speedup of 4.5x is achieved. The SMP-implementation is reasonably performance portable as it achieves a speedup of 26.6x on the cc-NUMA system. However, to obtain the highest performance (speedup of 33.4x and throughput of 200 QHD frames per second), several cc-NUMA specific optimizations are necessary such as optimizing the page placement and statically assigning threads to cores. Finally, on the Cell platform a near ideal speedup of 16.5x is achieved by completely hiding the communication latency.en
dc.description.sponsorshipEC/FP7/248647/EU/ENabling technologies for a programmable many-CORE/ENCOREen
dc.identifier.isbn978-1-4503-0102-2
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/7019
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-6340
dc.language.isoen
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subject.ddc004 Datenverarbeitung; Informatik
dc.subject.other4k x 2ken
dc.subject.othercellen
dc.subject.otherdecodingen
dc.subject.otherh.264en
dc.subject.othernumaen
dc.subject.otherparallelen
dc.subject.othersmpen
dc.titleA QHD-capable parallel H.264 decoderen
dc.typeConference Objecten
dc.type.versionacceptedVersionen
dcterms.bibliographicCitation.doi10.1145/1995896.1995945
dcterms.bibliographicCitation.originalpublishernameAssociation for Computing Machinery (ACM)en
dcterms.bibliographicCitation.originalpublisherplaceNew York, NYen
dcterms.bibliographicCitation.pageend326
dcterms.bibliographicCitation.pagestart317
dcterms.bibliographicCitation.proceedingstitleProceedings of the International Conference on Supercomputingen
tub.accessrights.dnbdomain
tub.affiliationFak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Architektur eingebetteter Systemede
tub.affiliation.facultyFak. 4 Elektrotechnik und Informatikde
tub.affiliation.groupFG Architektur eingebetteter Systemede
tub.affiliation.instituteInst. Technische Informatik und Mikroelektronikde
tub.publisher.universityorinstitutionTechnische Universität Berlinen

Files

Original bundle
Now showing 1 - 1 of 1
Loading…
Thumbnail Image
Name:
AQHD_Capable.pdf
Size:
669.62 KB
Format:
Adobe Portable Document Format

Collections