Skip to content

Conversation

@codelion
Copy link
Member

Added '-mlx-' to the list of MLX model patterns in should_use_mlx for broader matching. Reduced max_tokens from 32768 to 8192 in get_llm_response within eval_math500_benchmark.py to limit token usage.

codelion added 2 commits June 30, 2025 14:43
Added '-mlx-' to the list of MLX model patterns in should_use_mlx for broader matching. Reduced max_tokens from 32768 to 8192 in get_llm_response within eval_math500_benchmark.py to limit token usage.
Update version number in __init__.py and setup.py to 0.1.18 for new release.
@codelion codelion merged commit 50f5f7a into main Jun 30, 2025
1 check passed
@codelion codelion deleted the fix-mlx-model-id branch June 30, 2025 06:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants