Issues: mosaicml/llm-foundry
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Observing 1/2 the throughput on AMD MI250
bug
Something isn't working
#1153
opened Apr 30, 2024 by
staghado
Fine-tune dbrx-instruct on a single VM with 8 H100s
question
Further information is requested
#1105
opened Apr 10, 2024 by
hosseinsarshar
Installation issue from habana_alpha branch
bug
Something isn't working
#1090
opened Apr 4, 2024 by
palash04
Setting Dropout in MPT Prefix-LM after Exporting to HuggingFace Crashes during Fine-tuning
bug
Something isn't working
#1046
opened Mar 21, 2024 by
timsteuer
Composer crashes when attempting to load sharded checkpoint
bug
Something isn't working
#998
opened Feb 27, 2024 by
growlix
How to support multi-threaded parallel data preprocessing?
enhancement
New feature or request
#870
opened Jan 14, 2024 by
YixinSong-e
Any plan for supporting DPO?
enhancement
New feature or request
#846
opened Jan 8, 2024 by
lorabit110
Converted PrefixLM HF snapshot must enable cache for generation in config
bug
Something isn't working
#780
opened Dec 6, 2023 by
timsteuer
eval.py
hangs when config yaml's model hparams don't match model checkpoint hparams
bug
#755
opened Nov 21, 2023 by
growlix
Converting a composer seq2seq t5 model throws an exception
bug
Something isn't working
#754
opened Nov 21, 2023 by
timsteuer
PrefixLM is loaded as CausalLM after HuggingFace export
bug
Something isn't working
#739
opened Nov 15, 2023 by
timsteuer
Benchmarking GLUE tasks for in-context learning
question
Further information is requested
#707
opened Oct 31, 2023 by
ashim95
mosaicml-turbo: Where to find the repo?
question
Further information is requested
#565
opened Aug 29, 2023 by
agarvic
[Bug] Different batch_size return different evaluating result
bug
Something isn't working
#541
opened Aug 21, 2023 by
SingL3
[FEATURE] Return attention_mask for GPTQ to work
enhancement
New feature or request
#491
opened Jul 27, 2023 by
casper-hansen
Incorrect use of Something isn't working
dtype
resulting into incorrect token_ids
bug
#452
opened Jul 11, 2023 by
damin604
Gradient checkpointing issue when running QLoRA finetuning
question
Further information is requested
#413
opened Jul 1, 2023 by
tytung2020
Unify export script arguments after next version release
enhancement
New feature or request
#394
opened Jun 29, 2023 by
codestar12
StreamingTextDataset
's default dtype for binarized data should be int32
enhancement
#321
opened Jun 14, 2023 by
vancoykendall
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.