mllm
Here are 50 public repositories matching this topic...
This is the official implementation (code, data) of the paper "MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?""
-
Updated
Jul 5, 2024 - JavaScript
Awesome list for attacks on large language models.
-
Updated
Mar 1, 2024
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
-
Updated
Jun 28, 2024 - Python
MOSSBench: A webpage for an oversensitivity benchmark
-
Updated
Jul 5, 2024 - JavaScript
Composition of Multimodal Language Models From Scratch
-
Updated
Jul 2, 2024 - Jupyter Notebook
Code for the MultipanelVQA benchmark "Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA"
-
Updated
Apr 11, 2024 - Jupyter Notebook
Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
-
Updated
Jun 6, 2024 - Python
Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team. Datasets also on Hugging Face: https://huggingface.co/parsee-ai
-
Updated
May 15, 2024 - Jupyter Notebook
AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification.
-
Updated
Jul 16, 2024
MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discover
-
Updated
Mar 5, 2024 - Python
Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?
-
Updated
Mar 27, 2023 - C
This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
-
Updated
Sep 15, 2023 - Python
A Video Chat Agent with Temporal Prior
-
Updated
Feb 28, 2024 - Python
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
-
Updated
May 22, 2024 - Python
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
-
Updated
May 31, 2024 - Python
Undergraduate Dissertation of Guilin University of Electronic Technology
-
Updated
May 24, 2024 - Python
Improve this page
Add a description, image, and links to the mllm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the mllm topic, visit your repo's landing page and select "manage topics."