Please use this identifier to cite or link to this item: http://dx.doi.org/10.14279/depositonce-7085
|Main Title:||GPU Parallelization of HEVC In-Loop Filters|
de Souza, Diego F.
Chi, Chi Ching
|Abstract:||In the High Efficiency Video Coding (HEVC) standard, multiple decoding modules have been designed to take advantage of parallel processing. In particular, the HEVC in-loop filters (i.e., the deblocking filter and sample adaptive offset) were conceived to be exploited by parallel architectures. However, the type of the offered parallelism mostly suits the capabilities of multi-core CPUs, thus making a real challenge to efficiently exploit massively parallel architectures such as Graphic Processing Units (GPUs), mainly due to the existing data dependencies between the HEVC decoding procedures. In accordance, this paper presents a novel strategy to increase the amount of parallelism and the resulting performance of the HEVC in-loop filters on GPU devices. For this purpose, the proposed algorithm performs the HEVC filtering at frame-level and employs intrinsic GPU vector instructions. When compared to the state-of-the-art HEVC in-loop filter implementations, the proposed approach also reduces the amount of required memory transfers, thus further boosting the performance. Experimental results show that the proposed GPU in-loop filters deliver a significant improvement in decoding performance. For example, average frame rates of 76 frames per second (FPS) and 125 FPS for Ultra HD 4K are achieved on an embedded NVIDIA GPU for All Intra and Random Access configurations, respectively.|
|DDC Class:||004 Datenverarbeitung; Informatik|
High Efficiency Video Coding
Graphics Processor Unit
|Journal Title:||International journal of parallel programming|
|Publisher Place:||Dordrecht [u.a.]|
|Appears in Collections:||FG Architektur eingebetteter Systeme » Publications|
Items in DepositOnce are protected by copyright, with all rights reserved, unless otherwise indicated.