Skip to content

Conversation

@bluebread
Copy link

Make sure to read the contributing guidelines before submitting a PR

@sfallah
Some people might be interested in tiny/small modes to compress hundreds of tokens into only 64/100 tokens (gundam mode generates at least 256 tokens). The current implementation is to add a crop_mode hyperparameter and automatically switch to the closest resolution based on the given image size. I don't know if we're allowed to add a new CLI argument or not. I think that would be a better way to implement this feature though.

@bluebread bluebread changed the base branch from master to sf/deepseek-ocr November 22, 2025 16:06
@sfallah sfallah merged commit 7941f5d into sfallah:sf/deepseek-ocr Nov 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants