Fix bug mps #203

codelion · 2025-06-24T02:21:58Z

Add support for mlx based inference on apple silicon devices

This reverts commit 8287454.

Introduces a _robust_mlx_generate method that attempts MLX text generation using several parameter combinations to handle different MLX-LM versions. Improves error handling and logging for easier debugging, and ensures token counting is robust to different response types.

Update __version__ in optillm/__init__.py and version in setup.py to 0.1.16 for a new release.

codelion added 5 commits June 17, 2025 16:00

fix mps

8287454

Revert "fix mps"

34b57c9

This reverts commit 8287454.

j

c50394e

Bump version to 0.1.16

c27a095

Update __version__ in optillm/__init__.py and version in setup.py to 0.1.16 for a new release.

codelion merged commit 2e4c0da into main Jun 24, 2025
1 check passed

codelion deleted the fix-bug-mps branch June 24, 2025 02:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bug mps #203

Fix bug mps #203

Uh oh!

codelion commented Jun 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix bug mps #203

Fix bug mps #203

Uh oh!

Conversation

codelion commented Jun 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants