Please use this identifier to cite or link to this item: http://dx.doi.org/10.14279/depositonce-6783
Main Title: An Optimized Parallel IDCT on Graphics Processing Units
Author(s): Wang, Biao
Álvarez-Mesa, Mauricio
Chi, Chi Ching
Juurlink, Ben
Type: Book Part
Language Code: en
Is Part Of: 10.1007/978-3-642-36949-0
Abstract: In this paper we present an implementation of the H.264/AVC Inverse Discrete Cosine Transform (IDCT) optimized for Graphics Processing Units (GPUs) using OpenCL. By exploiting that most of the input data of the IDCT for real videos are zero valued coefficients a new compacted data representation is created that allows for several optimizations. Experimental evaluations conducted on different GPUs show average speedups from 1.7× to 7.4× compared to an optimized single-threaded SIMD CPU version.
URI: https://depositonce.tu-berlin.de//handle/11303/7569
http://dx.doi.org/10.14279/depositonce-6783
Issue Date: 2013
Date Available: 12-Apr-2018
DDC Class: 004 Datenverarbeitung; Informatik
Subject(s): IDCT
GPU
H.264
OpenCL
parallel programming
License: http://rightsstatements.org/vocab/InC/1.0/
Book Title: Euro-Par 2012: Parallel Processing Workshops
Publisher: Springer
Publisher Place: Berlin; Heidelberg
Publisher DOI: 10.1007/978-3-642-36949-0_18
Page Start: 155
Page End: 164
Series: Lecture Notes in Computer Science
Series Number: 7640
EISSN: 1611-3349
ISBN: 978-3-642-36949-0
978-3-642-36948-3
ISSN: 0302-9743
Appears in Collections:FG Architektur eingebetteter Systeme » Publications

Files in This Item:
File Description SizeFormat 
10.1007.978-3-642-36949-0_18.pdf878.31 kBAdobe PDFThumbnail
View/Open


Items in DepositOnce are protected by copyright, with all rights reserved, unless otherwise indicated.