Feature Request: add mtmd functions to get vision image size and patch size

### Prerequisites

- [x] I am running the latest code. Mention the version if possible as well.
- [x] I carefully followed the [README.md](https://github.com/ggml-org/llama.cpp/blob/master/README.md).
- [x] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
- [x] I reviewed the [Discussions](https://github.com/ggml-org/llama.cpp/discussions), and have a new and useful enhancement to share.

### Feature Description

It would be very helpful to be able to determine the expected image size and batch size for vision models.

This information is already available, just not exposed via a convenient function the way that `mtmd_get_audio_bitrate` does.

I propose adding 2 new functions:

```c
// get vision image size in pixels, for example 1024
// return -1 if vision is not supported
MTMD_API int mtmd_get_vision_image_size(mtmd_context * ctx);

// get vision patch size, for example 14
// return -1 if vision is not supported
MTMD_API int mtmd_get_vision_patch_size(mtmd_context * ctx);
```

### Motivation

This will make it easier to do any image preprocessing before calling into the projector/model.

### Possible Implementation

```c
int mtmd_get_vision_image_size(mtmd_context * ctx) {
    if (!ctx->ctx_v) {
        return -1;
    }

    return clip_get_image_size(ctx->ctx_v);
}

int mtmd_get_vision_patch_size(mtmd_context * ctx) {
    if (!ctx->ctx_v) {
        return -1;
    }

    return clip_get_patch_size(ctx->ctx_v);
}
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: add mtmd functions to get vision image size and patch size #16703

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature Request: add mtmd functions to get vision image size and patch size #16703

Description

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions