Model EVO-2 takes a long time for generating moderately long sequences with Vortex (e.g. 16,384)

Model take a long time for generating moderately long sequences. How can I make use of multiple GPUs for this case?
`from torch.nn.parallel import DistributedDataParallel` also didn't help.

12 Minutes for generating 16,384 long sequence.

Is there any way to improve this using multiple GPUs?

<img width="1201" height="736" alt="Image" src="https://github.com/user-attachments/assets/159e2927-3140-4ff7-9d2b-5557ccf5493c" />

<img width="949" height="666" alt="Image" src="https://github.com/user-attachments/assets/26634946-8b63-4625-a926-e67a34e3982c" />


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model EVO-2 takes a long time for generating moderately long sequences with Vortex (e.g. 16,384) #69

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Model EVO-2 takes a long time for generating moderately long sequences with Vortex (e.g. 16,384) #69

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions