Code Release for "Broken Neural Scaling Laws" (BNSL) paper (arxiv.org/abs/2210.14891)

Read Appendix A.6 of arXiv version of this paper for more details on how to use this code.

To reproduce the Fitting and Extrapolation of BNSL on 4 Digit Addition from Figure 5 Left, run

python fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis.py

To reproduce the Fitting and Extrapolation of BNSL on a noiseless simulation of the scaling behavior of 4 Digit Addition from Figure 5 Right, run

python fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.py

To reproduce the Decomposition of BNSL into Power Law Segments from Figure 1, run

python make_figure_1__decomposition_of_bnsl_into_power_law_segments.py

Note:

🚨🚨🚨

When you fit a BNSL to your own scaling data, you may need to adjust the grid search range and resolution to get a good fit.

🚨🚨🚨

Here is some bibtex to use for citation:

@inproceedings{
caballero2023broken,
title={Broken Neural Scaling Laws},
author={Ethan Caballero and Kshitij Gupta and Irina Rish and David Krueger},
booktitle={The Eleventh International Conference on Learning Representations },
year={2023},
url={https://arxiv.org/abs/2210.14891}
}

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
README.md		README.md
figure_1.png		figure_1.png
fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis.py		fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis.py
fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.py		fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.py
make_figure_1__decomposition_of_bnsl_into_power_law_segments.py		make_figure_1__decomposition_of_bnsl_into_power_law_segments.py
plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis.png		plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis.png
plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.png		plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code Release for "Broken Neural Scaling Laws" (BNSL) paper (arxiv.org/abs/2210.14891)

Note:

Here is some bibtex to use for citation:

About

Releases

Packages

Contributors 2

Languages

ethancaballero/broken_neural_scaling_laws

Folders and files

Latest commit

History

Repository files navigation

Code Release for "Broken Neural Scaling Laws" (BNSL) paper (arxiv.org/abs/2210.14891)

Note:

Here is some bibtex to use for citation:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages