Inductive bias of neural networks on 1D regression: an empirical examination

This repository houses the code for my MPhil dissertation. The full write-up is available here. Here is the abstract:

Modern network architectures generalise well even when the size of the network is very large relative to the amount of data it is trained on. This contradicts the received wisdom from statistical learning theory that models with high capacity overfit training data and do poorly on test data. One explanation for why neural networks generalise so well comes in the form of an implicit bias of the optimisation process. Recent theoretical results pinpoint the bias of gradient descent optimisation toward a class of smooth functions called interpolating splines. In this paper, we conduct a large-scale empirical evaluation of these results in the univariate regression case when subjected to changes in the training set-up along several hyperparameters commonly tweaked in practice. We find that these results are robust for shallow networks, but that the bias seems to change as network depth is increased. We additionally highlight several areas that could be further explored in order to better understand this bias and to generate practical recommendations for future machine learning systems.

Name		Name	Last commit message	Last commit date
Latest commit History 366 Commits
ckpts		ckpts
datasets		datasets
generalisation		generalisation
lightning_logs		lightning_logs
logs		logs
models		models
notebooks		notebooks
plots		plots
scripts		scripts
utils		utils
.gitignore		.gitignore
1d_regression.py		1d_regression.py
average_error.py		average_error.py
readme.md		readme.md
requirements.txt		requirements.txt
temp_errs.txt		temp_errs.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ckpts

ckpts

datasets

datasets

generalisation

generalisation

lightning_logs

lightning_logs

logs

logs

models

models

notebooks

notebooks

plots

plots

scripts

scripts

utils

utils

.gitignore

.gitignore

1d_regression.py

1d_regression.py

average_error.py

average_error.py

readme.md

readme.md

requirements.txt

requirements.txt

temp_errs.txt

temp_errs.txt

Repository files navigation

Inductive bias of neural networks on 1D regression: an empirical examination

About

Languages

inwaves/nn-inductive-bias-regression

Folders and files

Latest commit

History

Repository files navigation

Inductive bias of neural networks on 1D regression: an empirical examination

About

Resources

Stars

Watchers

Forks

Languages