On UI, basically just add an upload image button in the chat box. On backend side, need to integrate with multimodal models, e.g. LLaVa, MiniGPT4.
On UI, basically just add an upload image button in the chat box. On backend side, need to integrate with multimodal models, e.g. LLaVa, MiniGPT4.