Computing high-dimensional value functions of optimal feedback control problems using the Tensor-train format

Sallandt, Leon Jasper

Computing high-dimensional value functions of optimal feedback control problems using the Tensor-train format

dc.contributor.advisor	Schneider, Reinhold
dc.contributor.author	Sallandt, Leon Jasper
dc.contributor.grantor	Technische Universität Berlin	en
dc.contributor.referee	Schneider, Reinhold
dc.contributor.referee	Kunisch, Karl
dc.contributor.referee	Breiten, Tobias
dc.date.accepted	2021-12-06
dc.date.accessioned	2022-01-13T14:41:22Z
dc.date.available	2022-01-13T14:41:22Z
dc.date.issued	2022
dc.description.abstract	We consider high-dimensional, non-linear functional equations. These functional equations are mostly the Bellman equation known from optimal control or related fields. Within this framework we deal with the occurring non-linearity using fixed-point iterations, for the most part the Policy Iteration algorithm, reducing them to a series of linear problems. These linear problems suffer from the so-called curse of dimensionality. We apply hierarchical tensor formats, in particular tensor-trains, to represent the sought function. Here, we also make use of an extension of the tensor-train format, where single functions can be added into the function space. The linear problems are approximated by regression and minimal residual formulations, which means that high-dimensional integrals appear. We apply Monte Carlo methods to estimate these integrals. Applying this framework, we compute feedback controllers of infinite and finite horizon optimal control problems. For the finite horizon case we also consider an algorithm based on open-loop control and provide a novel error propagation bound. We also consider the case of stochastic exit-time control problems. Finally, we consider a regression approach in the context of parabolic partial differential equations, which can be reformulated to backward stochastic differential equations. In this context, we apply the tensor-train model and compare to state-of-the-art neural network methods with respect to run-time and accuracy. We numerically observe that for many problems, low-rank approximation of the sought functions can be found, yielding close to optimal feedback controllers.	en
dc.description.abstract	Wir betrachten hochdimensionale, nicht-lineare Funktionengleichungen, wie zum Beispiel die Bellmangleichung, bekannt aus dem Gebiet der optimalen Steuerung. Die auftretende nicht-Linearität behandeln wir mit Fixpunktiterationen, insbesondere der Policy Iteration, und erhalten damit eine Folge von linearen Problemen. Diese Probleme leiden in hohen Dimensionen unter dem sogenannten Fluch der Dimensionalität (curse of dimensionality), was wir mit der Verwendung von hierarchischen Tensorformaten, insbesondere Tensor-Trains, behandeln. Wir stellen damit die gesuchten Funktionen dar und verwenden auch eine Erweiterung des Konzepts, bei der einzelne Funktionen in den Funktionenraum hinzugefügt werden. Die auftretenden linearen Probleme werden dann mithilfe von Regression und ähnlichen Methoden gelöst. Die daher auftretenden hoch-dimensionalen Integrale werden mithilfe von Monte Carlo Methoden approximiert. Mithilfe dieses Ansatzes werden optimale Feedbacksteuerungen von verschiedenen Optimalsteuerungsproblemen berechnet - von (deterministischen) Problemen mit endlichem und unendlichem Zeithorizont zu stochastischen Problemen mit Exit-Bedingung. Schlussendlich werden noch allgemeine semi-lineare parabolische Differentialgleichungen mithilfe von backward stochastic differential equations gelöst, wobei wir die Ergebnisse mit state-of-the-art neuronalen Netz Methoden vergleichen. Hier achten wir auf die Genauigkeit der Ergebnisse und auf die Laufzeit des Algorithmus. Wir beobachten numerisch, dass für viele Probleme gute Approximationen der gesuchten Funktionen, und damit auch des optimalen Feedbackgesetzes, mithilfe des Tensor-Train Ansatzes gefunden werden können.	de
dc.identifier.uri	https://depositonce.tu-berlin.de/handle/11303/14013
dc.identifier.uri	http://dx.doi.org/10.14279/depositonce-12786
dc.language.iso	en	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject.ddc	518 Numerische Analysis	de
dc.subject.ddc	519 Wahrscheinlichkeiten, angewandte Mathematik	de
dc.subject.other	Tensor train	en
dc.subject.other	feedback control	en
dc.subject.other	highdimensional PDE	en
dc.subject.other	value function	en
dc.subject.other	BSDE	en
dc.subject.other	Tensor-Zug	de
dc.subject.other	Feedback-Steuerung	de
dc.subject.other	hochdimensionale PDE	de
dc.subject.other	Wertefunktion	de
dc.title	Computing high-dimensional value functions of optimal feedback control problems using the Tensor-train format	en
dc.title.translated	Berechnung von Wertefunktionen von optimalen Feedbacksteuerungsproblemen mit Nutzung des Tensor-Train-Formats	de
dc.type	Doctoral Thesis	en
dc.type.version	acceptedVersion	en
tub.accessrights.dnb	free	en
tub.affiliation	Fak. 2 Mathematik und Naturwissenschaften::Inst. Mathematik::FG Modellierung, Simulation und Optimierung in Natur- und Ingenieurwissenschaften	de
tub.affiliation.faculty	Fak. 2 Mathematik und Naturwissenschaften	de
tub.affiliation.group	FG Modellierung, Simulation und Optimierung in Natur- und Ingenieurwissenschaften	de
tub.affiliation.institute	Inst. Mathematik	de
tub.publisher.universityorinstitution	Technische Universität Berlin	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: sallandt_leon.pdf
Size:: 1.9 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 4.86 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Publications