On Size-Independent Sample Complexity of ReLU Networks (2306.01992v3)

Published 3 Jun 2023 in cs.LG and stat.ML

Abstract: We study the sample complexity of learning ReLU neural networks from the point of view of generalization. Given norm constraints on the weight matrices, a common approach is to estimate the Rademacher complexity of the associated function class. Previously Golowich-Rakhlin-Shamir (2020) obtained a bound independent of the network size (scaling with a product of Frobenius norms) except for a factor of the square-root depth. We give a refinement which often has no explicit depth-dependence at all.

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

On Size-Independent Sample Complexity of ReLU Networks (2306.01992v3)

Summary

Related Papers