Inst. Softwaretechnik und Theoretische Informatik

414 Items

Recent Submissions
Towards CRISP-ML(Q): A Machine Learning Process Model with Quality Assurance Methodology

Studer, Stefan ; Bui, Thanh Binh ; Drescher, Christian ; Hanuschkin, Alexander ; Winkler, Ludwig ; Peters, Steven ; Müller, Klaus-Robert (2021-04-22)

Machine learning is an established and frequently used technique in industry and academia, but a standard process model to improve success and efficiency of machine learning applications is still missing. Project organizations and machine learning practitioners face manifold challenges and risks when developing machine learning applications and have a need for guidance to meet business expectat...

Information flows to support software developers in using security APIs

Gorski, Peter Leo (2021)

At the end of 2019, about 4.1 billion people on earth were using the internet. Because people entrust their most intimate and private data to their devices, the European legislation has declared the protection of natural persons in relation to the processing of personal data as a fundamental right. In 2018 23 million people worldwide, having the responsibility of implementing data security and ...

NISQA Speech Quality Corpus

Mittag, Gabriel (2021-04)

The NISQA Speech Quality Corpus contains two training, two validation and four test datasets that have been used for training and evaluating the deep learning based speech quality model NISQA. Overall, it contains more than 14,000 speech files with annotated speech quality and speech quality dimension ratings.

Identification of nodes and Networks

Liu, Yang (2021)

Complex systems in a broad range of scientific domains have been shown to be well-characterized by networks in an increasing number of studies. Problems such as cascading failures, spreading dynamics and the extraction of leading factors from raw data through the construction of networks can all be studied within the paradigm of network science. Such problems concerning networks are usually dir...

DNC: Dataset for Noise Classification

Rafael, Zequeira Jiménez (2021)

The DNC dataset contains 4377 environmental background noise recordings labeled according to the type of noise. The recordings are approximately equally balanced between three main categories, i.e., *"mechanic"*, *"melodic"*, and *"quiet"*. These noise categories were selected as we found in a previous study that these are the background noises that can distract users in crowdsourcing when perf...

Efficient computation of optimal temporal walks under waiting-time constraints

Bentert, Matthias ; Himmel, Anne-Sophie ; Nichterlein, André ; Niedermeier, Rolf (2020-10-06)

Node connectivity plays a central role in temporal network analysis. We provide a broad study of various concepts of walks in temporal graphs, that is, graphs with fixed vertex sets but arc sets changing over time. Taking into account the temporal aspect leads to a rich set of optimization criteria for “shortest” walks. Extending and broadening state-of-the-art work of Wu et al. [IEEE TKDE 2016...

NDNetGaming - development of a no-reference deep CNN for gaming video quality prediction

Utke, Markus ; Zadtootaghaj, Saman ; Schmidt, Steven ; Bosse, Sebastian ; Möller, Sebastian (2020-07-24)

Gaming video streaming services are growing rapidly due to new services such as passive video streaming of gaming content, e.g. Twitch.tv, as well as cloud gaming, e.g. Nvidia GeForce NOW and Google Stadia. In contrast to traditional video content, gaming content has special characteristics such as extremely high and special motion patterns, synthetic content and repetitive content, which poses...

Parameterized dynamic cluster editing

Luo, Junjie ; Molter, Hendrik ; Nichterlein, André ; Niedermeier, Rolf (2020-07-25)

We introduce a dynamic version of the NP -hard graph modification problem Cluster Editing . The essential point here is to take into account dynamically evolving input graphs: having a cluster graph (that is, a disjoint union of cliques) constituting a solution for a first input graph, can we cost-efficiently transform it into a “similar” cluster graph that is a solution for a second (“subseque...

The power of linear-time data reduction for maximum matching

Mertzios, George B. ; Nichterlein, André ; Niedermeier, Rolf (2020-07-06)

Finding maximum-cardinality matchings in undirected graphs is arguably one of the most central graph primitives. For m -edge and n -vertex graphs, it is well-known to be solvable in O(m\sqrt{n})  time; however, for several applications this running time is still too slow. We investigate how linear-time (and almost linear-time) data reduction (used as preprocessing) can alleviate the situation. ...

Images from on-chip memories captured using the laser-assisted side-channel techniques LLSI and TLS

Krachenfels, Thilo ; Kiyan, Tuba ; Tajik, Shahin ; Seifert, Jean-Pierre (2021-02-22)

This repository contains images of on-chip memories (in 16-bit TIFF and 8-bit PNG format) that were captured using the PHEMOS-1000 failure analysis microscope with the LLSI (Laser Logic State Imaging) and TLS (Thermal Laser Stimulation) techniques. The content of the imaged memories is provided in text and binary files. This data package is connected to the publication "Automatic Extraction of ...

DNLE: Dataset for Noise Level Estimation

Rafael, Zequeira Jiménez (2021)

The DNLE dataset contains 1668 environmental background noise recordings labeled according to type and level of noise. The recordings are approximately equally balanced between three main categories, i.e., "mechanic", "melodic", and "quiet". These noise categories were selected as we found in a previous study that these are the background noises that can distract users in crowdsourcing when per...

Data Science für alle: Grundlagen der Datenprogrammierung

Abedjan, Ziawasch ; Anuth, Hagen ; Esmailoghli, Mahdi ; Mahdavi, Mohammad ; Neutatz, Felix ; Chen, Binger (2020-02-27)

Die Nachfrage nach Data Scientists in den verschiedensten Bereichen der Industrie, Gesellschaft und Forschung stellt Universitäten vor die Frage, in welcher Form eine Data-Science-Ausbildung ermöglicht werden soll. Neben dem traditionellen Ansatz, Data Science als Studienfach anzubieten, gibt es auch Forderungen nach Einbettung von Data-Science-Veranstaltungen in informatik- und mathematikfremd...

A fast shortest path algorithm on terrain-like graphs

Froese, Vincent ; Renken, Malte (2020-08-04)

Terrain visibility graphs are a well-known graph class in computational geometry. They are closely related to polygon visibility graphs, but a precise graph-theoretical characterization is still unknown. Over the last decade, terrain visibility graphs attracted considerable attention in the context of time series analysis (there called time series visibility graphs) with various practical appli...

Comparing temporal graphs using dynamic time warping

Froese, Vincent ; Jain, Brijnesh ; Niedermeier, Rolf ; Renken, Malte (2020-06-29)

Within many real-world networks, the links between pairs of nodes change over time. Thus, there has been a recent boom in studying temporal graphs. Recognizing patterns in temporal graphs requires a proximity measure to compare different temporal graphs. To this end, we propose to study dynamic time warping on temporal graphs. We define the dynamic temporal graph warping (dtgw) distance to dete...

Human processing of transmitted speech varying in perceived quality

Uhrig, Stefan Josef (2021)

The present thesis addresses human information processing of technologically transmitted speech, especially the effects of varying speech transmission quality (e.g. due to background noise or limitations in transmission bandwidth). The concept of “perceived quality” refers to an evaluative perceptual feature that integrates a subset of more descriptive perceptual features or “perceptual quality...

Dynamical strengthening of covalent and non-covalent molecular interactions by nuclear quantum effects at finite temperature

Sauceda, Huziel E. ; Vassilev-Galindo, Valentin ; Chmiela, Stefan ; Müller, Klaus-Robert ; Tkatchenko, Alexandre (2021-01-19)

Nuclear quantum effects (NQE) tend to generate delocalized molecular dynamics due to the inclusion of the zero point energy and its coupling with the anharmonicities in interatomic interactions. Here, we present evidence that NQE often enhance electronic interactions and, in turn, can result in dynamical molecular stabilization at finite temperature. The underlying physical mechanism promoted b...

RHEEMix in the data jungle: a cost-based optimizer for cross-platform systems

Kruse, Sebastian ; Kaoudi, Zoi ; Contreras-Rojas, Bertty ; Chawla, Sanjay ; Naumann, Felix ; Quiané-Ruiz, Jorge-Arnulfo (2020-05-18)

Data analytics are moving beyond the limits of a single platform. In this paper, we present the cost-based optimizer of Rheem, an open-source cross-platform system that copes with these new requirements. The optimizer allocates the subtasks of data analytic tasks to the most suitable platforms. Our main contributions are: (i) a mechanism based on graph transformations to explore alternative exe...

Safe, intelligent and explainable self-adaptive systems

Klös, Verena (2021)

Intelligent cyber-physical systems, such as self-driving cars, smart homes or e-health solutions, will increasingly influence our daily lives. They will deal with increasingly uncertain and changing environments and simultaneously must adhere to strict safety requirements. In addition, we need to trust those systems, as we will hand over control on our daily life to them. This increasing list o...

Towards speech quality assessment using a crowdsourcing approach: evaluation of standardized methods

Naderi, Babak ; Zequeira Jiménez, Rafael ; Hirth, Matthias ; Möller, Sebastian ; Metzger, Florian ; Hoßfeld, Tobias (2020-11-22)

Subjective speech quality assessment has traditionally been carried out in laboratory environments under controlled conditions. With the advent of crowdsourcing platforms tasks, which need human intelligence, can be resolved by crowd workers over the Internet. Crowdsourcing also offers a new paradigm for speech quality assessment, promising higher ecological validity of the quality judgments at...

h-Index manipulation by undoing merges

Bevern, René van ; Komusiewicz, Christian ; Molter, Hendrik ; Niedermeier, Rolf (2020-12-30)

The h-index is an important bibliographic measure used to assess the performance of researchers. Dutiful researchers merge different versions of their articles in their Google Scholar profile even though this can decrease their h-index. In this article, we study the manipulation of the h-index by undoing such merges. In contrast to manipulation by merging articles, such manipulation is harder t...