Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Lightning-AI / pytorch-lightning Public

Notifications You must be signed in to change notification settings
Fork 3.4k
Star 28.3k

Code
Issues 807
Pull requests 71
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Lightning-AI pytorch-lightning Discussions

Pinned Discussions

Welcome to Lightning Discussions!
General williamFalcon

Search all discussions

Clear

Sort by: Latest activity

Latest activity

Top: Past month

Filter by label

Sorry, something went wrong.

Filter

Loading

Sorry, something went wrong.

No labels found. Sorry about that.

Use alt + click/return to exclude labels.

Open
Closed
Locked
Unlocked
Answered
Unanswered
All

Categories, most helpful, and community links

Categories

View all discussions
code help: CV
code help: NLP / ASR / TTS
code help: RL / MetaLearning
DDP / multi-GPU / multi-node
General
Idea pool
Lightning App API: LightningApp, LightningFlow, LightningWork
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Polls
Show off your work

Loading

Community links

Code of conduct
lightning.ai

Discussions

You must be logged in to vote

What is trainer/global_step in wandb logging?

edmcman asked Oct 30, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

1
You must be logged in to vote

Gradient checkpointing with DDP in a loop

shivammehta25 asked Nov 11, 2021 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

8
You must be logged in to vote

Optuna HPO & Lightning Multi-GPU Training using DDP on SLURM - ValueError: World Size does not Match

eTuDpy asked May 30, 2024 in DDP / multi-GPU / multi-node · Unanswered

4
You must be logged in to vote

Exact behaviour of .prepare_data() and .setup() in LightningDataModule when passing DM or dataloaders to Trainer in respect to internal hooks.

tiefenthaler asked Oct 29, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

0
You must be logged in to vote

How to change auto-requeue hpc.ckpt path

arijit-hub started Oct 22, 2024 in General · Closed

1
You must be logged in to vote

Active Learning Trainer

BerAnton asked Oct 25, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

0
You must be logged in to vote

Collect Threads after Trainer.fit

42elenz asked Oct 23, 2024 in DDP / multi-GPU / multi-node · Unanswered

0
You must be logged in to vote

Logging in Multi - GPU and new on_validation_step function

42elenz asked Oct 23, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

0
You must be logged in to vote

Both GPUs are getting same data from IterableDataset with DDP

akshat-suwalka asked Jul 22, 2024 in DDP / multi-GPU / multi-node · Unanswered

1
You must be logged in to vote

Collective mismatch at end of training epoch

valtsblukis asked Aug 3, 2022 in DDP / multi-GPU / multi-node · Answered

3
You must be logged in to vote

Using sklearn data pre-processing pipelines inside LightningDataModule

tiefenthaler asked Apr 23, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

1
You must be logged in to vote

could not find the monitored key in the returned metrics

morestart started Oct 17, 2023 in General · Closed

4
You must be logged in to vote

Issue with ToTensor Transform and Multiprocessing in PyTorch Lightning

DaniMlk asked Oct 15, 2024 in DDP / multi-GPU / multi-node · Unanswered

0
You must be logged in to vote

Dataloader reinsertion for recursive predictionsrs

leonardcaquot94 started Oct 11, 2024 in General

0
You must be logged in to vote

Conflict between Lightning and Huggingface Transformers (device_map).

richarddwang started Jun 20, 2023 in General

4
You must be logged in to vote

I want to apply custom learning rate scheduler.

sooftware asked Jun 19, 2021 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered

11
You must be logged in to vote

AttributeError: 'Trainer' object has no attribute 'val_dataloader'

kiddycharles asked Mar 5, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Closed · Unanswered

2
You must be logged in to vote

Multi-GPU Inference
distributed Generic distributed-related topic callback: prediction writer trainer: predict
ricardorei asked Sep 1, 2021 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

27
You must be logged in to vote

Timed out initializing process group in store based....

EvanZ asked Nov 9, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

5
You must be logged in to vote

Custom train loop to perform partial batch updates

leonardcaquot94 asked Oct 4, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

0
You must be logged in to vote

Alternate "prediction" loops

kaboroevich asked Oct 4, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Closed · Answered

2
You must be logged in to vote

val loss and accuracy not logging

gkrampah asked Oct 2, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

0
You must be logged in to vote

All time taken by {method 'enable' of '_lsprof.Profiler' objects} in Advanced profiler output
profiler
SeguinBe asked Feb 7, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

9
You must be logged in to vote

The trainers runs a single validation step after resume (not sanity)

cdancette asked Jul 18, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

13
You must be logged in to vote

Configuring OneCycleLR from yaml file lightning CLI

zarkoivkovicc asked Mar 23, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered

2

Previous 1 2 3 4 5 … 85 86 Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.