-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Issues: Lightning-AI/pytorch-lightning
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Deepspeed ZERO MiCS support
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20378
opened Oct 31, 2024 by
hehepig4
Custom Subcommand without Model arg
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20374
opened Oct 29, 2024 by
enrico-stauss
FSDP checkpoint loading fails
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20373
opened Oct 29, 2024 by
Nilabhra
metrics csv in ddp mode
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#20371
opened Oct 29, 2024 by
ruyanyinian
FutureWarning: Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
torch.cuda.amp.custom_bwd(args...)
is deprecated. Please use torch.amp.custom_bwd(args..., device_type='cuda')
instead.
bug
#20370
opened Oct 28, 2024 by
loretoparisi
Wandb 1.x step handling
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20368
opened Oct 28, 2024 by
edmcman
Training stuck at the first iter can't get corresponding pid
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#20367
opened Oct 28, 2024 by
yejr0229
Tuner.scale_batch_size(max_val=1024)
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20364
opened Oct 24, 2024 by
edmcman
Resume training from checkpoints
docs
Documentation related
needs triage
Waiting to be triaged by maintainers
#20361
opened Oct 23, 2024 by
ArkashJ
load data sequence is confusing
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20360
opened Oct 22, 2024 by
workhours
load data sequence is confusing
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20359
opened Oct 22, 2024 by
workhours
load data sequence is confusing
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20358
opened Oct 22, 2024 by
workhours
Type annotation for Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
BasePredictionWriter
subclass
bug
#20356
opened Oct 22, 2024 by
saiden89
LearningRateFinder creates errors for schedulers in Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
val
stage
bug
#20355
opened Oct 21, 2024 by
DeanLa
Gradient accumulation calcluation may be incorrect
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20350
opened Oct 19, 2024 by
tyler-rt
Add support S3 as a storage option for profiling results
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20348
opened Oct 18, 2024 by
kimminw00
Can't resume automatically a job, ckpt_path="hpc" throws ValueError from the start
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20347
opened Oct 18, 2024 by
F-Barto
tensorboard step and self.global_step do not correspond under accumulate_grad
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20346
opened Oct 18, 2024 by
wuzhiyue111
Everything prints fine, but the loss doesn't descent
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
#20344
opened Oct 15, 2024 by
2catycm
Impove how argument passing via CLI and config file is handled in regards to argument linking
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20341
opened Oct 14, 2024 by
MrWhatZitToYaa
DDP and BackboneFinetuning: model weights get out of sync when unfreezing layers for training
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20340
opened Oct 13, 2024 by
ksikka
PyTorchProfiler: not showing CPU memory used even with Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
profile_memory=True
bug
#20339
opened Oct 13, 2024 by
Jack12xl
restore_training_state before on_fit_start?
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20338
opened Oct 12, 2024 by
lampuiho
LightningCLI
doesn't fail when config.yaml
contains invalid arguments
bug
#20337
opened Oct 11, 2024 by
adosar
Unreadable font color theme of YAML files
docs
Documentation related
needs triage
Waiting to be triaged by maintainers
#20335
opened Oct 10, 2024 by
MrWhatZitToYaa
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.