|
Claudia Lückert

Wilhelm Killing Kolloquium: Prof. Dr. Govind Menon (Brown University): Towards a geometric theory of deep learning

Thursday, 08.05.2025 14:15 im Raum M4

Mathematik und Informatik

The mathematical core of deep learning is function approximation by neural networks trained on data using stochastic gradient descent.

I will present a collection of results on training dynamics for the deep linear network (DLN). The DLN is a phenomenological model of deep learning for linear functions that was introduced by computer scientists. It allows a comprehensive analysis that reveals interesting ties with several areas of mathematics and several lessons for 'true' deep learning.

This is joint work with several co-authors: Nadav Cohen (Tel Aviv), Kathryn Lindsey (Boston College), Alan Chen, Zsolt Veraszto and Tianmin Yu (Brown).



Angelegt am 18.03.2025 von Claudia Lückert
Geändert am 27.03.2025 von Claudia Lückert
[Edit | Vorlage]

Kolloquium Wilhelm Killing