🦜 Ask-Anything [Paper]

| | | |
Open in Spaces [VideoChat-7B-8Bit] End2End ChatBOT for video and image.

[VideoChat2-7B]End2End ChatBOT for video and image.

🚀: We update video_chat by instruction tuning for video & image chatting now! Find its details here. We release instruction data at InternVideo. The old version of video_chat moved to video_chat_with_chatGPT.

⭐️: We are also working on a updated version, stay tuned!

🎬 [End2End ChatBot]

english.mp4

🎥 [Communication with ChatGPT]

intro.mp4

🔥 Updates

2024/05/22: 📢 We release VideoChat2_mistral, which shows better capacity on diverse tasks (60.4% on MVBench, 78.6% on NExT-QA, 63.8% on STAR, 46.4% on TVQA, 54.4% on EgoSchema-full and 80.5% on IntentQA). More details will be updated in the paper. Have a try! 🏃🏻‍♀️🏃🏻
2024/04/05 MVBench is selected as Poster (Highlight)!
2024/2/27 MVBench is accepted by CVPR2024.
2023/11/29 VideoChat2 and MVBench are released.
- VideoChat2 is a robust baseline built on UMT and Vicuna-v0.
- 1.9M diverse instruction data are released for effective tuning.
- MVBench is a comprehensive benchmark for video understanding.
2023/05/11 End-to-end VideoChat and its technical report.
- VideoChat: Instruction tuning for video chatting (also supports image one).
- Paper: We present how we craft VideoChat with two versions (via text and embed) along with some discussions on its background, applications, and more.
2023/04/25 Watch videos longer than one minute with chatGPT
- VideoChat LongVideo: Incorporating langchain and whisper into VideoChat.
2023/04/21 Chat with MOSS
- VideoChat with MOSS: Explicit communication with MOSS.
2023/04/20: Chat with StableLM
- VideoChat with StableLM: Explicit communication with StableLM.
2023/04/19: Code release & Online Demo
- VideoChat with ChatGPT: Explicit communication with ChatGPT. Sensitive with time. demo is available!
- MiniGPT-4 for video: Implicit communication with Vicuna. Not sensitive with time. (Simple extension of MiniGPT-4, which will be improved in the future.)

🔨 Getting Started

Build video chat with:

📄 Citation

If you find this project useful in your research, please consider cite:

@article{2023videochat,
  title={VideoChat: Chat-Centric Video Understanding},
  author={Li, Kunchang and He, Yinan and Wang, Yi and Li, Yizhuo and Wang, Wenhai and Luo, Ping and Wang, Yali and Wang, Limin and Qiao, Yu},
  journal={arXiv preprint arXiv:2305.06355},
  year={2023}
}

⏳ Ongoing

Our team constantly studies general video understanding and long-term video reasoning:

Strong video foundation model.
Video-text dataset and video reasoning benchmark.
Video-language system with LLMs.
Artificial Intelligence Generated Content (AIGC) for Video.
...

🌤️ Discussion Group

If you have any questions during the trial, running or deployment, feel free to join our WeChat group discussion! If you have any ideas or suggestions for the project, you are also welcome to join our WeChat group discussion!

We are hiring researchers, engineers and interns in General Vision Group, Shanghai AI Lab. If you are interested in working with us, please contact Yi Wang (wangyi@pjlab.org.cn).

Name		Name	Last commit message	Last commit date
Latest commit History 164 Commits
example		example
video_chat		video_chat
video_chat2		video_chat2
video_chat_with_ChatGPT		video_chat_with_ChatGPT
video_chat_with_MOSS		video_chat_with_MOSS
video_chat_with_StableLM		video_chat_with_StableLM
video_miniGPT4		video_miniGPT4
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_cn.md		README_cn.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

example

example

video_chat

video_chat

video_chat2

video_chat2

video_chat_with_ChatGPT

video_chat_with_ChatGPT

video_chat_with_MOSS

video_chat_with_MOSS

video_chat_with_StableLM

video_chat_with_StableLM

video_miniGPT4

video_miniGPT4

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

README_cn.md

README_cn.md

Repository files navigation

🦜 Ask-Anything [Paper]

🎬 [End2End ChatBot]

🎥 [Communication with ChatGPT]

🔥 Updates

🔨 Getting Started

Build video chat with:

📄 Citation

⏳ Ongoing

🌤️ Discussion Group

About

Releases

Packages

Contributors 11

Languages

License

OpenGVLab/Ask-Anything

Folders and files

Latest commit

History

Repository files navigation

🦜 Ask-Anything [Paper]

🎬 [End2End ChatBot]

🎥 [Communication with ChatGPT]

🔥 Updates

🔨 Getting Started

Build video chat with:

📄 Citation

⏳ Ongoing

🌤️ Discussion Group

About

Topics

Resources

License

Stars

Watchers

Forks

Languages