Skip to content

model going over max context #982

@acoliver

Description

@acoliver

What happened?

(main*) Context: 136,565/131,134 tokens | TPM: 45.00k | Wait: 0ms | 18:29:59
~/projects/llxprt/branch-1/llxprt-code [no sandbox (see /docs)] qwen3-next-80b-a3b-instruct-mlx | Tokens: 5,797,852 | ✖ 1 error (ctrl+o for details)

A compress was not triggered. its just going over and letting lmstudio handle it.

What did you expect to happen?

there is a compression ratio that should have fired long ago. Also, there is a guard that is supposed to prevent it from going way over.

Client information

v0.8.0-nightly.260102.9e1b7f4f6
was running lmstudio provider alias and the listed model

Login information

No response

Anything else we need to know?

No response

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions