#

rlhf-v

Here is 1 public repository matching this topic...

RLHF-V / RLHF-V

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

chatbot llama multimodal multi-modality gpt-4 visual-language-learning rlhf-v

Updated Sep 11, 2024
Python

Improve this page

Add a description, image, and links to the rlhf-v topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rlhf-v topic, visit your repo's landing page and select "manage topics."