-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Issues: NVIDIA/NeMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Error in saving nemo checkpoint with Llama-3.1-70B SFT. /opt/NeMo/nemo/utils/callbacks/nemo_model_checkpoint.py
bug
Something isn't working
stale
#12157
opened Feb 12, 2025 by
songwang41
Possible bug in ASRDecoderTimeStamps - math.ceil on fractional tokens_per_chunk leads to timestamps displacements on long files
bug
Something isn't working
#11604
opened Dec 15, 2024 by
bene-ges
when i use container to do sft for any model, it has context not found error
bug
Something isn't working
#11825
opened Jan 11, 2025 by
munger1985
Pickling error when trying to save checkpoints with custom checkpointIO
bug
Something isn't working
#11955
opened Jan 24, 2025 by
jdnurme
Add option for prefetch factor of data loader to config
stale
#11977
opened Jan 28, 2025 by
shengshiqi-google
ASR: How to convert .ckpt to nemo correctly?
ASR
bug
Something isn't working
#12003
opened Jan 31, 2025 by
ican24
llava-like dataset implementation "LazySupervisedDataset" likely fails to handle large dataset
#12034
opened Feb 3, 2025 by
bernardhan33
AttributeError: 'HFDatasetDataModule' object has no attribute 'tokenizer'
bug
Something isn't working
#12080
opened Feb 6, 2025 by
j40903272
Bug when generating confidence scores with timestamps for a buffered rnnt model
ASR
bug
Something isn't working
#11456
opened Dec 3, 2024 by
aanchan
Fail to convert trained checkpoint to HF format
bug
Something isn't working
stale
#12124
opened Feb 10, 2025 by
Zhihan1996
I am trying to train the FastConformer 120M model from scratch, but it is not converging?
ASR
help wanted
Extra attention is needed
#12167
opened Feb 13, 2025 by
PhamDangNguyen
Update TE version for support of Something isn't working
pad_between_seqs=True
bug
#12174
opened Feb 13, 2025 by
cyanguwa
HiFiGAN Finetune "Cannot re-initialize CUDA in forked subprocess."
bug
Something isn't working
#12178
opened Feb 13, 2025 by
Fournogo
Support configuration of num_workers and max_samples_per_sequence in llava_next_pretrain
#12195
opened Feb 14, 2025 by
bernardhan33
Pre-Training Neva under pipeline parallel set to 2.
bug
Something isn't working
#12205
opened Feb 16, 2025 by
takuya576
loss divergence when CP>1 and MBS>1
bug
Something isn't working
#12210
opened Feb 17, 2025 by
hawkoli1987
HUGE Inconsistency between logged tokens_per_second_per_GPU and actual wall time and Global Step is Not Monotonically Increasing
bug
Something isn't working
#12727
opened Mar 21, 2025 by
aflah02
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.