Opening the machine learning black box with Layer-wise Relevance Propagation

Lapuschkin, Sebastian

Opening the machine learning black box with Layer-wise Relevance Propagation

dc.contributor.advisor	Müller, Klaus-Robert
dc.contributor.author	Lapuschkin, Sebastian
dc.contributor.grantor	Technische Universität Berlin	en
dc.contributor.referee	Müller, Klaus-Robert
dc.contributor.referee	Wiegand, Thomas
dc.contributor.referee	Principe, Jose C.
dc.date.accepted	2018-12-19
dc.date.accessioned	2019-01-30T10:00:47Z
dc.date.available	2019-01-30T10:00:47Z
dc.date.issued	2019
dc.description.abstract	Machine learning techniques such as (Deep) Neural Networks are successfully solving a plethora of tasks, e.g. in image recognition and text analysis, and provide novel predictive models for complex physical, biological and chemical systems. However, due to the nested complex and non-linear structure of many machine learning models, this comes with the disadvantage of them acting as a black box, providing little or no information about the internal reasoning. This black box character hampers acceptance and application of non-linear methods in many application domains, where understanding individual model predictions and thus trust in the model’s decisions are critically important. In this thesis, we describe a novel method for explaining non-linear classifier decisions by decomposing the prediction function, called Layer-wise Relevance Propagation (LRP). We apply our method to Neural Networks, kernelized Support Vector Machines (with non-linear kernels) and Bag of Words feature extraction pipelines and evaluate LRP theoretically, qualitatively and quantitatively in comparison to other recent methods for interpreting model predictions. Using our method as a tool for comparative analyses between various pre-trained models we reveal different learned prediction strategies and flaws in datasets, predictors and the training thereof.	en
dc.description.abstract	Techniken des maschinellen Lernens wie (Tiefe) Neuronale Netze lösen eine Vielzahl an Aufgaben mit großem Erfolg, beispielsweise in der Bilderkennung und Textanalyse, und bieten neuartige Vorhersagemodelle für komplexe physikalische, biologische und chemische Zusammenhänge auf. Dies geht jedoch durch die verschachtelte und komplex-nichtlineare Struktur vieler Modelle des maschinellen Lernens mit dem Nachteil einher, dass diese Modelle sich wie Black Boxes verhalten und keine oder nur wenig Informationen über interne Schlussfolgerungen preisgeben. Dieser Black Box-Charakter beeinträchtigt die Anwendung und Akzeptanz von nichtlinearen Methoden in zahlreichen Anwendungsgebieten, in denen das Verstehen individueller Modellvorhersagen, und somit das Vertrauen in das Vorhersagemodell unumgänglich ist. Diese Dissertation behandelt eine neuartige Methode, genannt Layer-wise Relevance Propagation (LRP), zur Erklärung nichtlinearer Klassifikationsentscheidungen mittels der Zerlegung der Vorhersagefunktion. Wir wenden unsere Methode auf Neuronale Netze, Support Vector Maschinen (mit nichtlinearen Kernen) und Bag of Words Merkmalsextraktionssysteme an, und evaluieren LRP auf theoretischer, qualitativer und quantitativer Ebene im Vergleich zu weiteren aktuellen Methoden zur Interpretation von Modellvorhersagen. Unsere Methode als Analysewerkzeug nutzend decken wir vergleichend zwischen diversen vortrainierten Modellen verschiedene erlernte Vorhersagestrate gien und Schwächen in Datensätzen, Prädiktionsmodellen und deren Training auf.	de
dc.identifier.uri	https://depositonce.tu-berlin.de/handle/11303/8813
dc.identifier.uri	http://dx.doi.org/10.14279/depositonce-7942
dc.language.iso	en	en
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/	en
dc.subject.ddc	004 Datenverarbeitung; Informatik	de
dc.subject.ddc	006 Spezielle Computerverfahren	de
dc.subject.other	machine learning	en
dc.subject.other	Layer-wise Relevance Propagation	en
dc.subject.other	Taylor decomposition	en
dc.subject.other	spectral relevance analysis	en
dc.subject.other	explainable artificial intelligence	en
dc.subject.other	maschinelles Lernen	de
dc.subject.other	Taylor-Zerlegung	de
dc.subject.other	spektrale Relevanzanalyse	de
dc.subject.other	erklärbare künstliche Intelligenz	de
dc.title	Opening the machine learning black box with Layer-wise Relevance Propagation	en
dc.title.translated	Öffnen der Black Box des maschinellen Lernens mit Layer-wise Relevance Propagation	de
dc.type	Doctoral Thesis	en
dc.type.version	acceptedVersion	en
tub.accessrights.dnb	free	en
tub.affiliation	Fak. 4 Elektrotechnik und Informatik::Inst. Softwaretechnik und Theoretische Informatik::FG Maschinelles Lernen	de
tub.affiliation.faculty	Fak. 4 Elektrotechnik und Informatik	de
tub.affiliation.group	FG Maschinelles Lernen	de
tub.affiliation.institute	Inst. Softwaretechnik und Theoretische Informatik	de
tub.publisher.universityorinstitution	Technische Universität Berlin	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: lapuschkin_sebastian.pdf
Size:: 17.05 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 4.9 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Publications