Non-perturbative renormalization for the neural network-QFT correspondence

Erbin, H and Lahoche, V and Ousmane Samary, D (2022) Non-perturbative renormalization for the neural network-QFT correspondence. Machine Learning: Science and Technology, 3 (1). 015027. ISSN 2632-2153

[thumbnail of Erbin_2022_Mach._Learn.__Sci._Technol._3_015027.pdf] Text
Erbin_2022_Mach._Learn.__Sci._Technol._3_015027.pdf - Published Version

Download (1MB)

Abstract

In a recent work (Halverson et al 2021 Mach. Learn.: Sci. Technol. 2 035002), Halverson, Maiti and Stoner proposed a description of neural networks (NNs) in terms of a Wilsonian effective field theory. The infinite-width limit is mapped to a free field theory while finite N corrections are taken into account by interactions (non-Gaussian terms in the action). In this paper, we study two related aspects of this correspondence. First, we comment on the concepts of locality and power-counting in this context. Indeed, these usual space-time notions may not hold for NNs (since inputs can be arbitrary), however, the renormalization group (RG) provides natural notions of locality and scaling. Moreover, we comment on several subtleties, for example, that data components may not have a permutation symmetry: in that case, we argue that random tensor field theories could provide a natural generalization. Second, we improve the perturbative Wilsonian renormalization from Halverson et al (2021 Mach. Learn.: Sci. Technol. 2 035002) by providing an analysis in terms of the non-perturbative RG using the Wetterich-Morris equation. An important difference with usual non-perturbative RG analysis is that only the effective infrared 2-point function is known, which requires setting the problem with care. Our aim is to provide a useful formalism to investigate NNs behavior beyond the large-width limit (i.e. far from Gaussian limit) in a non-perturbative fashion. A major result of our analysis is that changing the standard deviation of the NN weight distribution can be interpreted as a renormalization flow in the space of networks. We focus on translations invariant kernels and provide preliminary numerical results.

Item Type: Article
Subjects: Digital Academic Press > Multidisciplinary
Depositing User: Unnamed user with email support@digiacademicpress.org
Date Deposited: 06 Jul 2023 04:17
Last Modified: 14 Sep 2024 04:04
URI: http://science.researchersasian.com/id/eprint/1663

Actions (login required)

View Item
View Item