2000 character limit reached
On the Neural Tangent Kernel of Equilibrium Models (2310.14062v1)
Published 21 Oct 2023 in cs.LG and cs.AI
Abstract: This work studies the neural tangent kernel (NTK) of the deep equilibrium (DEQ) model, a practical ``infinite-depth'' architecture which directly computes the infinite-depth limit of a weight-tied network via root-finding. Even though the NTK of a fully-connected neural network can be stochastic if its width and depth both tend to infinity simultaneously, we show that contrarily a DEQ model still enjoys a deterministic NTK despite its width and depth going to infinity at the same time under mild conditions. Moreover, this deterministic NTK can be found efficiently via root-finding.
- Zhili Feng (22 papers)
- J. Zico Kolter (151 papers)