Yolov_LLM Video Object Detection (Yolov5) and Multimodal Vision Language Model (Llava 13b) HuggingFace Hub uploaded a fine-tuned model checkpoints- https://huggingface.co/AgamP/LLM_Custom_1/tree/main The proposal, Project report and summary is available above, follow to understand the context.