Do you plan to release train code ? #1

daixiangzi · 2024-06-21T10:10:44Z

No description provided.

Liuziyu77 · 2024-06-22T07:09:23Z

We will release our model soon. You can also train your own model with MMUD by yourself. Train code depends on which model you are using.

Liuziyu77 · 2024-06-22T07:10:17Z

MMDU can be applied to various LVLMs

daixiangzi · 2024-06-23T13:20:59Z

We will release our model soon. You can also train your own model with MMUD by yourself. Train code depends on which model you are using.

hh，we are preparing to do this。

daixiangzi · 2024-06-23T13:28:11Z

MMDU can be applied to various LVLMs

max image num is 20 in MMDU.in fact ,if I use llava3-clip-l14-336(max token is 8k),I think I need to use token compression,have you done any research in this area?

Liuziyu77 · 2024-06-26T07:06:20Z

MMDU can be applied to various LVLMs

max image num is 20 in MMDU.in fact ,if I use llava3-clip-l14-336(max token is 8k),I think I need to use token compression,have you done any research in this area?

One of the purposes of MMDU-45k is to enhance the dialogue capabilities of LVLMs in long multi-modal contexts involving text and images. The maximum token length for MMDU-45k is 17k. During the finetuning of the model, we generally use lengths of 16k or 32k to train the model, without considering the issue of token compression.

Liuziyu77 · 2024-06-26T07:08:29Z

The main data length distribution of MMDU-45k and the MMDU benchmark is around 8k. Therefore, using MMDU-45k to finetune an 8k-LVLM is also feasible.

daixiangzi · 2024-07-02T12:16:21Z

I tried fine-tuning clip_l14_336-llama3-8b using mmdu, and even with a batch size of 1, it still runs out of memory on an 80G A100.

Liuziyu77 · 2024-07-02T12:24:29Z

I tried fine-tuning clip_l14_336-llama3-8b using mmdu, and even with a batch size of 1, it still runs out of memory on an 80G A100.

MMDU has long-context use zero3.json

daixiangzi · 2024-07-03T04:13:10Z

I tried fine-tuning clip_l14_336-llama3-8b using mmdu, and even with a batch size of 1, it still runs out of memory on an 80G A100.

MMDU has long-context use zero3.json

I use zero3 in fact.but still oom

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do you plan to release train code ? #1

Do you plan to release train code ? #1

daixiangzi commented Jun 21, 2024

Liuziyu77 commented Jun 22, 2024

Liuziyu77 commented Jun 22, 2024

daixiangzi commented Jun 23, 2024

daixiangzi commented Jun 23, 2024

Liuziyu77 commented Jun 26, 2024

Liuziyu77 commented Jun 26, 2024

daixiangzi commented Jul 2, 2024 •

edited

Loading

Liuziyu77 commented Jul 2, 2024

daixiangzi commented Jul 3, 2024 •

edited

Loading

Do you plan to release train code ? #1

Do you plan to release train code ? #1

Comments

daixiangzi commented Jun 21, 2024

Liuziyu77 commented Jun 22, 2024

Liuziyu77 commented Jun 22, 2024

daixiangzi commented Jun 23, 2024

daixiangzi commented Jun 23, 2024

Liuziyu77 commented Jun 26, 2024

Liuziyu77 commented Jun 26, 2024

daixiangzi commented Jul 2, 2024 • edited Loading

Liuziyu77 commented Jul 2, 2024

daixiangzi commented Jul 3, 2024 • edited Loading

daixiangzi commented Jul 2, 2024 •

edited

Loading

daixiangzi commented Jul 3, 2024 •

edited

Loading