Inst. Technische Informatik und Mikroelektronik

158 Items

Recent Submissions
GPU power modeling and architectural enhancements for GPU energy efficiency

Lucas, Jan (2019)

Graphics Processing Units (GPUs) can now be found in nearly every PC and smartphone. Initially designed for 3D graphics, they evolved into general purpose accelerators, able to outperform CPUs on many tasks. The architecture of GPUs is optimized for massively parallel applications. This reduces the required control logic but also results in lower performance in applications with irregular contr...

Neuer Optimierungsansatz für die Entwicklung hochkomplexer monolithisch integrierter Indiumphosphid Mach-Zehnder Terabit Transmitterchips

Rausch, Marko (2019)

Um dem auch zukünftig stark ansteigenden Datenverkehrsaufkommen gerecht zu werden, bedarf es leistungsfähiger optischer Daten- und Kommunikationsnetze stetig steigender Kapazität. Das ist nur zu erreichen durch eine ständige Weiterentwicklung aller erforderlichen Netzkomponenten im Hinblick auf Bandbreite, Signalqualität, Leistungsaufnahme und Komplexität (speziell Sender, Router und Empfänger)...

SLC: Memory Access Granularity Aware Selective Lossy Compression for GPUs

Lal, Sohan ; Lucas, Jan ; Juurlink, Ben (2019)

Memory compression is a promising approach for reducing memory bandwidth requirements and increasing performance, however, memory compression techniques often result in a low effective compression ratio due to large memory access granularity (MAG) exhibited by GPUs. Our analysis of the distribution of compressed blocks shows that a significant percentage of blocks are compressed to a size that ...

Chip Multiprocessor Traffic Models Providing Consistent Multicast and Spatial Distributions

Tutsch, Dietmar ; Lüdtke, Daniel (2008)

Chip multiprocessors (CMPs) have become the center of attention in recent years. They consist of multiple processor cores on a single chip. These cores are connected on-chip by a bus or, if many cores are involved, by an appropriate network. To investigate how a multicore processor behaves dependent on the chosen network-on-chip topology, a corresponding model must be established for performanc...

Activity and Eye Movement Analysis as Basis of Vehicle Cabin Design

Rötting, Matthias ; Rösler, Dirk ; Lohse, Katrin ; Göbel, Matthias (2000)

An inventory of different methods was developed over the last couple years to evaluate the ergonomic quality of different drivers workplaces. The cockpit of short-haul buses, long-haul buses, streetcars and harvesting machinery were evaluated. Based on this analysis criteria for the re-design could be developed. E.g. the design of the standard German short-haul buses is based on the results of ...

Modelling and control of a small-scale unmanned helicopter

Du, J. F. ; Kondak, K. ; Zhang, Y. O. ; Lu, T. S. (2008)

The dynamical model of a toy helicopter considered as two rigid bodies is deduced using Kane's equation. Another model is presented that considers the helicopter as a single rigid body. It is shown that the response of the rotational dynamics modelled as two rigid bodies is cosine while that modelled as one rigid body is linear. In addition, a flight controller is presented that is based on dyn...

Precise 3-D localization and simulation of topographic objects in synthetic aperture radar images

Saeed, Adnan (2018)

The geometric localization accuracy for Terrasar-X resulting due to very precise orbit determination is a prominent feature in the current generation of SAR satellites. If the most prominent error sources are accounted for, the 2-D position of targets can be estimated with high accuracy. Such a method is referred to as SAR geodesy. Having two such acquisitions from different orbits, a 3-D posit...

Leveraging novel information for coarse-grained prediction of protein motion

Putz, Ines (2018)

Proteins are involved in almost all functions in our cells due to their ability to combine conformational motion with chemical specificity. Hence, information about the motions of a protein provides insights into its function. Proteins move on a rugged energy landscape with many local minima, which is imposed on their high-dimensional conformational space. Exhaustive sampling of this space exce...

Model Predictive Control for a Small Scale Unmanned Helicopter

Du, Jianfu ; Kondak, Konstantin ; Bernard, Markus ; Zhang, Yaou ; Lü, Tiansheng ; Hommel, Günter (2008)

Kinematical and dynamical equations of a small scale unmanned helicoper are presented in the paper. Based on these equations a model predictive control (MPC) method is proposed for controlling the helicopter. This novel method allows the direct accounting for the existing time delays which are used to model the dynamics of actuators and aerodynamics of the main rotor. Also the limits of the act...

Efficient HEVC Decoder for Heterogeneous CPU with GPU Systems

Wang, Biao ; Alvarez-Mesa, Mauricio ; Chi, Chi Ching ; Juurlink, Ben ; Souza, Diego F. de ; Ilic, Aleksandar ; Roma, Nuno ; Sousa, Leonel (2016)

The High Efficiency Video Coding (HEVC) standard provides higher compression efficiency than other video coding standards but at the cost of increased computational load, which makes it hard to achieve real-time encoding/decoding of high-resolution, high-quality video sequences. In this paper, we investigate how Graphics Processing Units (GPUs) can be employed to accelerate HEVC decoding. GPUs ...

EPSILON-CP: using deep learning to combine information from multiple sources for protein contact prediction

Stahl, Kolja ; Schneider, Michael ; Brock, Oliver (2017-06-17)

Background Accurately predicted contacts allow to compute the 3D structure of a protein. Since the solution space of native residue-residue contact pairs is very large, it is necessary to leverage information to identify relevant regions of the solution space, i.e. correct contacts. Every additional source of information can contribute to narrowing down candidate regions. Therefore, recent met...

Enabling GPU software developers to optimize their applications – The LPGPU2approach

Juurlink, Ben ; Lucas, Jan ; Mammeri, Nadjib ; Keramidas, Georgios ; Pontzolkova, Katerina ; Aransay, Ignacio ; Kokkala, Chrysa ; Bliss, Martyn ; Richards, Andrew (2017)

Low-power GPUs have become ubiquitous, they can be found in domains ranging from wearable and mobile computing to automotive systems. With this ubiquity has come a wider range of applications exploiting low-power GPUs, placing ever increasing demands on the expected performance and power efficiency of the devices. The LPGPU 2 project is an EU-funded, Innovation Action, 30-month-project targetin...

Highly parallel HEVC decoding for heterogeneous systems with CPU and GPU

Wang, Biao ; de Souza, Diego F. ; Álvarez-Mesa, Mauricio ; Chi, Chi Ching ; Juurlink, Ben ; Ilic, Aleksandar ; Roma, Nuno ; Sousa, Leonel (2017)

The High Efficiency Video Coding HEVC standard provides a higher compression efficiency than other video coding standards but at the cost of an increased computational load, which makes hard to achieve real-time encoding/decoding for ultra high-resolution and high-quality video sequences. Graphics Processing Units GPU are known to provide massive processing capability for highly parallel and re...

Highly Parallel HEVC Decoding for Heterogeneous Systems with CPU and GPU - Research Data

Wang, Biao ; Felix de Souza, Diego ; Alvarez-Mesa, Mauricio ; Chi, Chi Ching ; Juurlink, Ben ; Ilic, Aleksandar ; Nuno Roma, Nuno ; Sousa, Leonel (2017)

The High Efficiency Video Coding (HEVC) standard provides a higher compression efficiency than other video coding standards but at the cost of an increased computational load, which makes hard to achieve real-time encoding/decoding for ultra high-resolution and high-quality video sequences. Graphics Processing Units (GPUs) are known to provide massive processing capability for highly parallel a...

Optimal DC/AC Data Bus Inversion Coding

Lucas, Jan ; Lal, Sohan ; Juurlink, Ben (2018-10-29)

GDDR5 and DDR4 memories use data bus inversion (DBI) coding to reduce termination power and decrease the number of output transitions. Two main strategies exist for encoding data using DBI: DBI DC minimizes the number of outputs transmitting a zero, while DBI AC minimizes the number of signal transitions. We show that neither of these strategies is optimal and reduction of interface power of up...

VComputeBench: A Vulkan Benchmark Suite for GPGPU on Mobile and Embedded GPUs

Mammeri, Nadjib (2018-09-30)

GPUs have become immensely important computational units on embedded and mobile devices. However, GPGPU developers are often not able to exploit the compute power offered by GPUs on these devices mainly due to the lack of support of traditional programming models such as CUDA and OpenCL. The recent introduction of the Vulkan API provides a new programming model that could be explored for GPGPU ...

A regularized fusion based 3D reconstruction framework

Rajput, Muhammad Asif Ali (2018)

Recent developments in depth sensing technologies enabled mobile robots to perceive surroundings with high accuracy. Robotic applications, equipped with depth perception technology, enable the capability of autonomous navigation to self-driving cars, assist in critical surgical procedures, or reconstruct the 3D model of a potentially hazardous environment. There exists a variety of 3D sensors r...

VComputeBench: A Vulkan Benchmark Suite for GPGPU on Mobile and Embedded GPUs

Mammeri, Nadjib ; Juurlink, Ben (2018)

GPUs have become immensely important computational units on embedded and mobile devices. However, GPGPU developers are often not able to exploit the compute power offered by GPUs on these devices mainly due to the lack of support of traditional programming models such as CUDA and OpenCL. The recent introduction of the Vulkan API provides a new programming model that could be explored for GPGPU ...

Application-Specific Cache and Prefetching for HEVC CABAC Decoding

Habermann, Philipp ; Chi, Chi Ching ; Álvarez-Mesa, Mauricio ; Juurlink, Ben (2017)

Context-based Adaptive Binary Arithmetic Coding (CABAC) is the entropy coding module in the HEVC/H.265 video coding standard. As in its predecessor, H.264/AVC, CABAC is a well-known throughput bottleneck due to its strong data dependencies. Besides other optimizations, the replacement of the context model memory by a smaller cache has been proposed for hardware decoders, resulting in an improve...

E²MC: Entropy Encoding Based Memory Compression for GPUs

Lal, Sohan ; Lucas, Jan ; Juurlink, Ben (2017)

Modern Graphics Processing Units (GPUs) provide much higher off-chip memory bandwidth than CPUs, but many GPU applications are still limited by memory bandwidth. Unfortunately, off-chip memory bandwidth is growing slower than the number of cores and has become a performance bottleneck. Thus, optimizations of effective memory bandwidth play a significant role for scaling the performance of GPUs....