Stochastic Control for Bayesian Neural Network Training

dc.contributor.authorWinkler, Ludwig
dc.contributor.authorOjeda, César
dc.contributor.authorOpper, Manfred
dc.date.accessioned2022-09-19T11:07:02Z
dc.date.available2022-09-19T11:07:02Z
dc.date.issued2022-08-09
dc.date.updated2022-09-07T13:01:41Z
dc.description.abstractIn this paper, we propose to leverage the Bayesian uncertainty information encoded in parameter distributions to inform the learning procedure for Bayesian models. We derive a first principle stochastic differential equation for the training dynamics of the mean and uncertainty parameter in the variational distributions. On the basis of the derived Bayesian stochastic differential equation, we apply the methodology of stochastic optimal control on the variational parameters to obtain individually controlled learning rates. We show that the resulting optimizer, StochControlSGD, is significantly more robust to large learning rates and can adaptively and individually control the learning rates of the variational parameters. The evolution of the control suggests separate and distinct dynamical behaviours in the training regimes for the mean and uncertainty parameters in Bayesian neural networks.en
dc.description.sponsorshipDFG, 318763901, SFB 1294: Datenassimilation – Die nahtlose Verschmelzung von Daten und Modellenen
dc.description.sponsorshipBMBF, 01IS18025A, Verbundprojekt BIFOLD-BBDC: Berlin Institute for the Foundations of Learning and Dataen
dc.description.sponsorshipBMBF, 01IS18037A, Verbundprojekt BIFOLD-BZML: Berlin Institute for the Foundations of Learning and Dataen
dc.description.sponsorshipTU Berlin, Open-Access-Mittel – 2022
dc.identifier.eissn1099-4300
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/17492
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-16273
dc.language.isoenen
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subject.ddc004 Datenverarbeitung; Informatikde
dc.subject.otherBayesian inferenceen
dc.subject.otherBayesian neural networksen
dc.subject.otherlearningen
dc.titleStochastic Control for Bayesian Neural Network Trainingen
dc.typeArticleen
dc.type.versionpublishedVersionen
dcterms.bibliographicCitation.articlenumber1097en
dcterms.bibliographicCitation.doi10.3390/e24081097en
dcterms.bibliographicCitation.issue8en
dcterms.bibliographicCitation.journaltitleEntropyen
dcterms.bibliographicCitation.originalpublishernameMDPIen
dcterms.bibliographicCitation.originalpublisherplaceBaselen
dcterms.bibliographicCitation.volume24en
tub.accessrights.dnbfreeen
tub.affiliationFak. 4 Elektrotechnik und Informatik::Inst. Softwaretechnik und Theoretische Informatik::FG Maschinelles Lernende
tub.affiliation.facultyFak. 4 Elektrotechnik und Informatikde
tub.affiliation.groupFG Maschinelles Lernende
tub.affiliation.instituteInst. Softwaretechnik und Theoretische Informatikde
tub.publisher.universityorinstitutionTechnische Universität Berlinen

Files

Original bundle
Now showing 1 - 1 of 1
Loading…
Thumbnail Image
Name:
entropy-24-01097-v3.pdf
Size:
843.7 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.86 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections