Issues: CarperAI/trlx
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[New Feature Request] Add KTO
feature request
New feature or request
#590
opened Jan 27, 2024 by
1485840691-eng
Integration of Self-Play Fine-Tuning (SPIN) Method for Enhancing Large Language Models
feature request
New feature or request
#588
opened Jan 11, 2024 by
SeungyounShin
Runtime error when running examples (ilql_sentiments_t5.py)
bug
Something isn't working
#587
opened Jan 8, 2024 by
youxiho1
Multi-GPU training errors with peft
bug
Something isn't working
#581
opened Nov 20, 2023 by
AliengirlLiv
Issue since most recent transformers update
bug
Something isn't working
#580
opened Nov 11, 2023 by
siddharthverma314
multigpu support for summarization ppo example
bug
Something isn't working
#571
opened Oct 21, 2023 by
sayan1101
Problem with LLama training with LoRA
bug
Something isn't working
#567
opened Oct 17, 2023 by
freQuensy23-coder
Question about saving peft checkpoint
bug
Something isn't working
#565
opened Oct 13, 2023 by
nhanph
How to generate reward-labeled dataset
feature request
New feature or request
#561
opened Sep 20, 2023 by
mikkelmedm
Increasing max new tokens for generation arguments lead to errors
bug
Something isn't working
#553
opened Sep 4, 2023 by
wise-east
Add support for Falcon 7B/40B
feature request
New feature or request
#532
opened Jul 19, 2023 by
cvetanovskaa
Implement Asynchronous PPO
feature request
New feature or request
#531
opened Jul 18, 2023 by
Dahoas
ppo using GLM2-6b as a backbone?
feature request
New feature or request
#523
opened Jul 17, 2023 by
fanxinyun1991
support base model + multi adapter for actor, critic, ref and reward model
feature request
New feature or request
#518
opened Jul 7, 2023 by
akk-123
Add support for safetensors
feature request
New feature or request
#505
opened Jun 14, 2023 by
glerzing
Previous Next
ProTip!
Updated in the last three days: updated:>2024-05-12.