Wilhelm Killing Kolloquium: Prof. Dr. Govind Menon (Brown University): Towards a geometric theory of deep learning
Thursday, 08.05.2025 14:15 im Raum M4
The mathematical core of deep learning is function approximation by neural networks trained on data using stochastic gradient descent.
I will present a collection of results on training dynamics for the deep linear network (DLN). The DLN is a phenomenological model of deep learning for linear functions that was introduced by computer scientists. It allows a comprehensive analysis that reveals interesting ties with several areas of mathematics and several lessons for 'true' deep learning.
This is joint work with several co-authors: Nadav Cohen (Tel Aviv), Kathryn Lindsey (Boston College), Alan Chen, Zsolt Veraszto and Tianmin Yu (Brown).
Angelegt am 18.03.2025 von Claudia Lückert
Geändert am 27.03.2025 von Claudia Lückert
[Edit | Vorlage]