Skip to content

Performance (prefill, decode, peak memory) is good and didn't regress for Llava and Llama for 1.0 release #15041

@mergennachin

Description

@mergennachin

On CPU via XNNPACK

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions