I see: https://boydjohnson.dev/blog/concurrency-matrix-multiplication/ Note that rayon collect() preserves the original order. Thus, it is not necessary to use par_sort_by(). See https://dev.to/eblocha/parallel-matrix-multiplication-in-rust-39f6 Excuse me if I'm mistaken!