Skip to content

Issues: Lightning-AI/pytorch-lightning

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

ModelCheckpoint does not work when using the monitor bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.2.x
#19929 opened Jun 1, 2024 by QianhangFeng
Lightning Fabric: generic method to get the full state dict feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers
#19923 opened May 30, 2024 by Xynonners
forward method missing required positional argument ‘masks’ in PyTorch Lightning bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 1.8.x
#19921 opened May 30, 2024 by YuyaWake
The training process will stop unexpectedly bug Something isn't working needs triage Waiting to be triaged by maintainers
#19920 opened May 30, 2024 by 5huanghuai
XLA FSDP strategy has undocumented requirement for using activation checkpointing bug Something isn't working needs triage Waiting to be triaged by maintainers
#19919 opened May 29, 2024 by ebreck
LR_FIND() does not work in DDP anymore, RuntimeError: No backend type associated with device type cpu bug Something isn't working needs triage Waiting to be triaged by maintainers
#19912 opened May 27, 2024 by asusdisciple
"save_last" could not save a complete checkpoint bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 1.9.x
#19909 opened May 27, 2024 by kxgong
AttributeError: type object 'Trainer' has no attribute 'add_argparse_args' bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.0.x ver: 2.1.x
#19905 opened May 24, 2024 by Park-yebin
Creating A Second Comet Logger Disables The First bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.1.x
#19900 opened May 23, 2024 by EtayLivne
Fabric: Incorrect num_replicas (ddp/fsdp) when number of GPUs on each node is different bug Something isn't working needs triage Waiting to be triaged by maintainers
#19898 opened May 23, 2024 by shaibagon
Error when fast_dev_run=True or num_sanity_val_steps=0 and using torchmetrics MetricTracker bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.2.x
#19895 opened May 22, 2024 by MoustHolmes
MisconfigurationException: Do not set gradient_accumulation_steps in the DeepSpeed config bug Something isn't working needs triage Waiting to be triaged by maintainers
#19891 opened May 22, 2024 by mxkrn
Is "Prepare a config file for the CLI" out of date? bug Something isn't working needs triage Waiting to be triaged by maintainers
#19890 opened May 22, 2024 by zengchang233
MLFlowLogger fails when logging hyperparameters as Trainer already does automatically bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.1.x
#19889 opened May 22, 2024 by CristoJV
Lightning stalls with 2 GPUs on 1 node with SLURM (and apptainer) bug Something isn't working needs triage Waiting to be triaged by maintainers
#19883 opened May 20, 2024 by sorenwacker
can't fit with ddp_notebook on a Vertex AI Workbench instance (CUDA initialized) bug Something isn't working needs triage Waiting to be triaged by maintainers
#19880 opened May 16, 2024 by jasonbrancazio
Using the MLflow logger produces Inconsistent metric plots bug Something isn't working needs triage Waiting to be triaged by maintainers
#19874 opened May 16, 2024 by gboeer
Error loading a saved model to run inference (using ddp_notebook strategy) bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.1.x
#19869 opened May 15, 2024 by carlos-havier
Possible bug in recognizing mps accelerator even though PyTorch seems to register the mps device? bug Something isn't working needs triage Waiting to be triaged by maintainers
#19868 opened May 14, 2024 by adam2392
Resume training, how to change learning scheduler? bug Something isn't working needs triage Waiting to be triaged by maintainers ver: 2.2.x
#19865 opened May 13, 2024 by jzhanghzau
ProTip! Type g i on any issue or pull request to go back to the issue listing page.