Feature Learning in Two-layer Neural Networks: The Effect of Data Covariance

We study the effect of gradient-based optimization on feature learning in two-layer neural networks. We consider a setting where the number of samples is of the same order as the input dimension and show that, when the input data is isotropic, gradient descent always improves upon the initial random features model in terms of prediction risk, for a certain class of targets. Further leveraging the practical observation that data often contains additional structure, i.e., the input covariance has non-trivial alignment with the target, we prove that the class of learnable targets can be significantly extended, demonstrating a clear separation between kernel methods and two-layer neural networks in this regime.

https://www.youtube.com/watch?v=9J-tVrDSm44&list=PLhnghgyZINr9Z0wxdeBZhkbUjGuzI889A&index=1

Online

Lifelong Learning

Speaker Information

Murat Erdogdu, University of Toronto

URL

https://www.youtube.com/watch?v=9J-tVrDSm44&list=PLhnghgyZINr9Z0wxdeBZhkbUjGuzI889A&index=1

Previous Next

Search

Date and Time

Location

Feature Learning in Two-layer Neural Networks: The Effect of Data Covariance

Speaker Information

URL

Erişilebilirlik Aracı

Dolaşım Ayarları

İçerik Ayarları

İmleç

Font Boyutlandırması

Renk Ayarları

Özel Renk

Date and Time

Location

Feature Learning in Two-layer Neural Networks: The Effect of Data Covariance

Speaker Information

URL

Cookie Policy

Necessary Cookies

Statistical Cookies

Targeting Cookies

Erişilebilirlik Aracı

Dolaşım Ayarları

İçerik Ayarları

İmleç

Font Boyutlandırması

Renk Ayarları

Özel Renk