Skip to content

Source code for the experiments on the layer-wise linear mode connectivity in deep neural networks

Notifications You must be signed in to change notification settings

link-er/layer-wise-lmc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Layer-wise linear mode connectivity

This is a repository with the source code for the paper "Layer-wise linear mode connectivity".

Structure of the code:

  1. LLMC:
  • data folder will contain all the downloaded datasets
  • traces folder will have stored checkpoints for further evaluations
  • federated_train.py and single_train_exp.py are for training models and saving checkpoints
  • layerwise_nonconvex_compute.py and layerwise_nonconvex_heatmap.py are for computing and plotting layer-wise barriers
  • layer_cumulative_compute.py and layer_cumulative_heatmap.py are for computing and plotting cumulative barriers
  • layerwise_robustness_eval.py and layerwise_robustness_plot.py are for computing and plotting robustness in averaging and random directions
  • eval_interpolation_multiple_layers.py and eval_interpolation_all.sh are scripts for robustness evaluation on ViTis from Sharpness and generalization
  1. SubspaceExperiments contains code for subspace noise robustness experiments

Note: for non-iid data separation we used scripts from PFL non iid

Note: for training LLMs locally we used LLM baselines

Note: for the robustness tests on ViTs (eval_interpolation_multiple_layers.py and eval_interpolation_all.sh) the missing files (like data.py, models.py, utils.py) can be found in the original repo Sharpness and generalization

About

Source code for the experiments on the layer-wise linear mode connectivity in deep neural networks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published