Abstract
We develop a diagrammatic approach analyzing Gibbs measures (i.e., Bayesian posteriors) in deep shaped multilayer perceptrons at arbitrary temperature. This gives the first (perturbatively) solvable model of learning with nonlinear neural networks where the input dimension N0, depth L, width N, and number of samples P can all be large without severe assumptions on either initialization or training data statistics. The limits N0,N,L,P→∞ do not commute, resulting in a rich phase diagram of learning regimes that we study to first order in 1/N. We find in particular that the ratio LP/N defines a critical depth necessary for feature learning: if LP/N→0, then Bayesian posteriors coincide with those of a kernel method. Regimes where LP/N→λ>0, in contrast, correspond to learning with data-dependent deformations of these kernels, and we provide explicit formulas for the resulting features to first order in 1/N.
| Original language | English (US) |
|---|---|
| Article number | 067301 |
| Journal | Physical review letters |
| Volume | 136 |
| Issue number | 6 |
| DOIs | |
| State | Published - Feb 13 2026 |
| Externally published | Yes |
All Science Journal Classification (ASJC) codes
- General Physics and Astronomy
Fingerprint
Dive into the research topics of 'Gibbs Measures from Deep Shaped Multilayer Perceptrons'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver