Skip to content

Eval bug: image encode time slow on mobile device #11856

@perp

Description

@perp

Name and Version

Model:NexaAIDev/OmniVLM-968M
In function clip_image_batch_encode, it takes about 18 seconds to encode the image? far more beyond the llm part ,it task about 1.5 seconds.
ggml_compute_forward compute the model graph about 1102 nodes, how to accelerate?
Deivice: Android, SamSung S23

Image

Operating systems

Other? (Please let us know in description)

GGML backends

CPU

Hardware

Android, SamSung S23

Models

Model:NexaAIDev/OmniVLM-968M

Problem description & steps to reproduce

load time ,encode image too long

First Bad Commit

Image

Relevant log output

no log

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions