Skip to content

Issues: mosaicml/llm-foundry

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Observing 1/2 the throughput on AMD MI250 bug Something isn't working
#1153 opened Apr 30, 2024 by staghado
Fine-tune dbrx-instruct on a single VM with 8 H100s question Further information is requested
#1105 opened Apr 10, 2024 by hosseinsarshar
Installation issue from habana_alpha branch bug Something isn't working
#1090 opened Apr 4, 2024 by palash04
Composer crashes when attempting to load sharded checkpoint bug Something isn't working
#998 opened Feb 27, 2024 by growlix
Any plan for supporting DPO? enhancement New feature or request
#846 opened Jan 8, 2024 by lorabit110
Converting a composer seq2seq t5 model throws an exception bug Something isn't working
#754 opened Nov 21, 2023 by timsteuer
PrefixLM is loaded as CausalLM after HuggingFace export bug Something isn't working
#739 opened Nov 15, 2023 by timsteuer
Benchmarking GLUE tasks for in-context learning question Further information is requested
#707 opened Oct 31, 2023 by ashim95
mosaicml-turbo: Where to find the repo? question Further information is requested
#565 opened Aug 29, 2023 by agarvic
Finetuning Models bug Something isn't working
#562 opened Aug 27, 2023 by ak2028
[Bug] Different batch_size return different evaluating result bug Something isn't working
#541 opened Aug 21, 2023 by SingL3
[FEATURE] Return attention_mask for GPTQ to work enhancement New feature or request
#491 opened Jul 27, 2023 by casper-hansen
eval.py error while benchmarking T5 bug Something isn't working
#460 opened Jul 14, 2023 by sigjhl
Incorrect use of dtype resulting into incorrect token_ids bug Something isn't working
#452 opened Jul 11, 2023 by damin604
Gradient checkpointing issue when running QLoRA finetuning question Further information is requested
#413 opened Jul 1, 2023 by tytung2020
ProTip! Add no:assignee to see everything that’s not assigned.