There Will Be a Scientific Theory of Deep Learning

The paper "There Will Be a Scientific Theory of Deep Learning" by Jamie Simon et al. argues that a comprehensive scientific framework for understanding deep learning is beginning to take shape. It asserts that this theory, which they dub "learning mechanics," will elucidate the intricate dynamics of neural network training, representation, and performance.

The authors define a scientific theory of deep learning as one that characterizes crucial properties and statistics across the training process, hidden layers, final weights, and overall network performance.
They identify five key research areas contributing to this nascent theory: solvable idealized models, tractable limits for fundamental insights, simple mathematical laws for macroscopic behaviors, theories that separate hyperparameters, and the identification of universal behaviors.
These research directions collectively prioritize the dynamics of the training process, focus on coarse aggregate statistics, and emphasize the importance of falsifiable quantitative predictions.
The paper introduces "learning mechanics" as the proposed name for this emerging theory, positioning it as a fundamental mechanics of the learning process.
It anticipates a synergistic relationship between this "learning mechanics" and the field of mechanistic interpretability, where a deeper theoretical understanding can inform how we interpret network internals.
The authors also address common criticisms regarding the feasibility or importance of a fundamental deep learning theory, offering rebuttals and outlining future research directions.

Ultimately, this paper makes a compelling case for moving beyond empirical observation in deep learning to establish a rigorous, predictive scientific theory, providing both a roadmap for its development and a new conceptual lens through which to view the field.

There Will Be a Scientific Theory of Deep Learning

The Lowdown