Fine-tuning LLMs using conditional training to learn two human preferences. UCL Module Project: Statistical Natural Language Processing (COMP0087).
-
Updated
Aug 9, 2023 - Python
Fine-tuning LLMs using conditional training to learn two human preferences. UCL Module Project: Statistical Natural Language Processing (COMP0087).
Add a description, image, and links to the fine-tuning-nlp topic page so that developers can more easily learn about it.
To associate your repository with the fine-tuning-nlp topic, visit your repo's landing page and select "manage topics."