What Does a Neural Network Look Like If It Is Trained for a Long Time?

Watch Video

04 26, 2022

 

Speaker: Prof. Weijie Su, University of Pennsylvania

Title: What Does a Neural Network Look Like If It Is Trained for a Long Time?

Time& VenueApril 26, 2022 09:00-10:00 Tencent Meeting: 280-364-453

Abstract: The remarkable development of deep learning over the past decade relies heavily on sophisticated heuristics and tricks. To better exploit its potential in the coming decade, perhaps a rigorous framework for reasoning deep learning is needed, which however is not easy to build due to the intricate details of modern neural networks. For near-term purposes, a practical alternative is to develop a mathematically tractable surrogate model that yet maintains many characteristics of deep learning models. This talk introduces a model of this kind as a tool toward understanding deep learning. The effectiveness of this model, which we term the Layer-Peeled Model, is evidenced by two use cases. First, we use this model to explain an empirical pattern of deep learning recently discovered by David Donoho and his students. Moreover, this model predicts a hitherto unknown phenomenon that we term Minority Collapse in deep learning training. This is based on arXiv:2101.12699 and arXiv:2110.02796.

 

 

 

 

Contacts: Prof. Weijie Su

E-mail:

Copyright@2008,All Rights Reserved, Academy of Mathematics and Systems Science,CAS
Tel:86-10-82541777 Fax: 86-10-82541972 E-mail: contact@amss.ac.cn
京ICP备05002806-1号 京公网安备110402500020号