Skip to content

Conversation

@ggerganov
Copy link
Member

ref #17211 (comment)

std::function-less std::sort

Backend GGML op Op parameters Bandwidth (GB/s) master Bandwidth (GB/s) gg/ggml-argsort-better Speedup
CPU ARGSORT type=f32,ne=[1024,1,1,1],order=0 0.17 0.59 3.58
CPU ARGSORT type=f32,ne=[1024,1,1,1],order=1 0.24 0.71 2.95
CPU ARGSORT type=f32,ne=[16,10,10,10],order=0 5.16 10.07 1.95
CPU ARGSORT type=f32,ne=[16,10,10,10],order=1 5.46 7.40 1.36
CPU ARGSORT type=f32,ne=[16384,1,1,1],order=0 0.09 0.12 1.36
CPU ARGSORT type=f32,ne=[16384,1,1,1],order=1 0.09 0.13 1.43
CPU ARGSORT type=f32,ne=[2,8,8192,1],order=0 1.51 7.02 4.65
CPU ARGSORT type=f32,ne=[2,8,8192,1],order=1 1.45 5.50 3.79
CPU ARGSORT type=f32,ne=[60,10,10,10],order=0 3.14 5.21 1.66
CPU ARGSORT type=f32,ne=[60,10,10,10],order=1 3.33 6.13 1.84
CPU ARGSORT type=f32,ne=[8,1,1,1],order=0 0.02 0.02 0.99
CPU ARGSORT type=f32,ne=[8,1,1,1],order=1 0.02 0.02 0.84

@ggerganov ggerganov requested a review from slaren as a code owner November 13, 2025 06:43
@github-actions github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Nov 13, 2025
@ggerganov
Copy link
Member Author

Didn't see #17222 is already open

@ggerganov ggerganov closed this Nov 13, 2025
@ggerganov ggerganov deleted the gg/ggml-argsort-better branch November 13, 2025 09:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants