#

loss-landscape

Here are 8 public repositories matching this topic...

xxxnell / how-do-vits-work

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

pytorch transformer self-attention loss-landscape vision-transformer

Updated Jul 14, 2022
Python

logancyang / loss-landscape-anim

Create animations for the optimization trajectory of neural nets

pytorch pca neural-nets pytorch-lightning loss-landscape optimization-trajectory

Updated Jan 30, 2024
Python

VITA-Group / LTH-Pass

[TMLR] "Can You Win Everything with Lottery Ticket?" by Tianlong Chen, Zhenyu Zhang, Jun Wu, Randy Huang, Sijia Liu, Shiyu Chang, Zhangyang Wang

uncertainty interpretability robustness generalization out-of-distribution-detection pac-bayes lottery-ticket-hypothesis explanability flatness adversarial-robustness winning-tickets loss-landscape

Updated Sep 8, 2022
Python

sungyoon-lee / LossLandscapeMatters

[NeurIPS 2021] Towards Better Understanding of Training Certifiably Robust Models against Adversarial Examples | ⛰️⚠️

robustness smoothness adversarial-machine-learning adversarial-examples adversarial-attacks robustness-verification loss-landscape certified-defense certified-training

Updated Jul 11, 2022
Python

fanghenshaometeor / ood-mode-ensemble

[Int. J. Comput. Vis. 2024] Revisiting Deep Ensemble for Out-of-Distribution Detection: A Loss Landscape Perspective

mode ensemble out-of-distribution-detection loss-landscape

Updated Jul 18, 2024
Python

HJHGJGHHG / Optimizer-papers

Worth-reading papers and related awesome resources on deep learning optimization algorithms. 值得一读的深度学习优化器论文与相关资源。

optimization-algorithms convergence-analysis loss-landscape

Updated Mar 24, 2023
Python

mortfer / keras-gsam

Surrogate Gap Guided Sharpness-Aware Minimization (GSAM) implementation for keras/tensorflow 2

deep-learning tensorflow keras sam optimizer loss-landscape gsam

Updated Jul 13, 2023
Python

HuanranLi / Grokking-in-Transformer

This project builds on recent research that explores the phenomenon of Grokking. The goal is to investigate when, why, and how grokking occurs, focusing on transformers under various batch sizes.

pytorch transformer arithmetic loss-landscape

Updated Oct 18, 2024
Python

Improve this page

Add a description, image, and links to the loss-landscape topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the loss-landscape topic, visit your repo's landing page and select "manage topics."