About the equal token number calculation

Thanks for the excellent work!

In the codes for equal-token-number calculation, is it correct to use the mean number of visual output tokens across all layers? I think it would be more appropriate to use the mean number of visual input tokens across layers instead. For instance, if pruning occurs after layer 0 (which reduces to N visual tokens), the original calculation method will record the token number in layer 0 as N, not the initial 576.

Thanks for the excellent work again!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the equal token number calculation #35

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

About the equal token number calculation #35

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions