Abstract
Neural networks constitute a class of functions that are typically non-surjective, with high-dimensional fibers and complicated image. We prove two main results concerning the geometry of the loss landscape of a neural network. First, we provide an explicit effective bound on the sizes of the hidden layers so that the loss landscape has no spurious valleys, which guarantees the success of gradient descent methods. Second, we present a novel method for analyzing whether a given neural network architecture with monomial activation function can represent a target function of interest. The core of our analysis method is the study of a specific set of error values, and its behavior depending on different training datasets.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 40th International Conference on Machine Learning |
| Editors | Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett |
| Publisher | Proceedings of Machine Learning Research (PMLR) |
| Pages | 28079-28099 |
| Number of pages | 21 |
| Volume | 202 |
| Publication status | Published - Jul 2023 |
| Event | Fortieth International Conference on Machine Learning - Hawaiʻi Convention Center, 1801 Kalākaua Avenue , Honolulu, United States Duration: 23 Jul 2023 → 29 Jul 2023 Conference number: 40 https://icml.cc/Conferences/2023 |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| ISSN (Electronic) | 2640-3498 |
Conference
| Conference | Fortieth International Conference on Machine Learning |
|---|---|
| Abbreviated title | ICML 2023 |
| Country/Territory | United States |
| City | Honolulu |
| Period | 23/07/23 → 29/07/23 |
| Internet address |
Bibliographical note
Funding Information:The results contained in this paper are part of the author's Master's thesis project, carried out at Bonn University with advisors Daniel Huybrechts and Emre Setöz. The paper was completed during the first year of the author's doctoral studies at Vrije Universiteit Brussel with supervisor Bart Bogaerts. This work was partially supported by Fonds Wetenschappelijk Onderzoek - Vlaanderen (project G0B2221N) and the Flemish Government (Onderzoeksprogramma Artificiële Intelligentie (AI) Vlaanderen).
Funding Information:
The results contained in this paper are part of the author’s Master’s thesis project, carried out at Bonn University with advisors Daniel Huybrechts and Emre Setöz. The paper was completed during the first year of the author’s doctoral studies at Vrije Universiteit Brussel with supervisor Bart Bo-gaerts. This work was partially supported by Fonds Weten-schappelijk Onderzoek – Vlaanderen (project G0B2221N) and the Flemish Government (Onderzoeksprogramma Arti-ficiële Intelligentie (AI) Vlaanderen).
Publisher Copyright:
© 2023 Proceedings of Machine Learning Research. All rights reserved.
Fingerprint
Dive into the research topics of 'Spurious Valleys and Clustering Behavior of Neural Networks'. Together they form a unique fingerprint.Research output
- 1 Poster
-
Spurious Valleys and Clustering Behavior of Neural Networks
Pollaci, S., 21 Sept 2023, (Unpublished).Research output: Unpublished contribution to conference › Poster
-
Fortieth International Conference on Machine Learning
Pollaci, S. (Participant)
23 Jul 2023 → 29 Jul 2023Activity: Participating in or organising an event › Participation in conference
-
Spurious Valleys and Clustering Behavior of Neural Networks
Pollaci, S. (Speaker)
26 Jul 2023Activity: Talk or presentation › Talk or presentation at a conference
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver