Skip to content

Try a stack-based DFS for eval#980

Merged
awni merged 3 commits intomainfrom
stack_dfs
Apr 11, 2024
Merged

Try a stack-based DFS for eval#980
awni merged 3 commits intomainfrom
stack_dfs

Conversation

@awni
Copy link
Copy Markdown
Member

@awni awni commented Apr 10, 2024

Benchmarks:

100 layer MNIST on the CPU (for maximum graph overhead):

Pre: 4.206 (s)
Post: 4.143 (s)

TPS for 4-bit Mistral is unaffected:

MLX_MAX_OPS_PER_BUFFER=50 python -m mlx_lm.generate --model mlx-community/NeuralBeagle14-7B-4bit-mlx --prompt "Write a story about Einstein" --temp 0.0 --max-tokens 256

Awni Hannun added 2 commits April 10, 2024 14:23
@awni awni marked this pull request as ready for review April 10, 2024 23:16
@awni awni requested review from angeloskath and jagrit06 April 10, 2024 23:16
Copy link
Copy Markdown
Member

@angeloskath angeloskath left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Perfectly done and needed for a while!

@awni awni merged commit 8580d99 into main Apr 11, 2024
@awni awni deleted the stack_dfs branch April 11, 2024 00:05
davidkoski added a commit to davidkoski/mlx-swift that referenced this pull request Apr 11, 2024
- this doesn't include any new API (coming later)
- includes two fixes important for smaller stack sizes (e.g. running on a async thread)
	- ml-explore/mlx#980
	- ml-explore/mlx#982
davidkoski added a commit to ml-explore/mlx-swift that referenced this pull request Apr 12, 2024
* update to pix up mlx v0.10.0

- this doesn't include any new API (coming later)
- includes two fixes important for smaller stack sizes (e.g. running on a async thread)
	- ml-explore/mlx#980
	- ml-explore/mlx#982
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants