Unified Neural Scaling Law Predicts Deep Learning Performance Across Tasks
A recent paper on arXiv introduces a Unified Neural Scaling Law (UNSL) that captures and predicts the scaling dynamics of deep neural networks across multiple dimensions, such as model parameters, dataset size, training and inference steps, compute, and hyperparameters. This functional model is applicable to various tasks and architectures in fields like vision, language, math, and reinforcement learning. According to the authors, UNSL provides significantly more precise extrapolations compared to current scaling laws. The paper falls under the category of Computer Science > Machine Learning and includes associated code, data, and media. Additionally, this arXiv submission is part of the arXivLabs initiative, which promotes community collaboration and openness.
Key facts
- The paper introduces a Unified Neural Scaling Law (UNSL).
- UNSL models scaling as multiple dimensions vary simultaneously.
- Dimensions include model parameters, dataset size, training steps, inference steps, compute, and hyperparameters.
- The law applies to vision, language, math, and reinforcement learning tasks.
- UNSL yields more accurate extrapolations than other functional forms.
- The paper is on arXiv under Computer Science > Machine Learning.
- Code, data, and media are associated with the article.
- The submission is part of arXivLabs, which values openness and community.
Entities
Institutions
- arXiv
- arXivLabs