Highlights
- Pro
Pinned Loading
-
early_stopping_double_descent
early_stopping_double_descent PublicForked from MLI-lab/early_stopping_double_descent
Code for reproducing figures and results in the paper ``Early stopping in deep networks: Double descent and how to eliminate it''
Jupyter Notebook
-
pytorch-original-transformer
pytorch-original-transformer PublicForked from gordicaleksa/pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.