feat(multithread): add rayon & par_iter for multithreaded execution #180

NishantJoshi00 · 2023-01-26T10:31:42Z

Description

This is a feature-gated implementation of multi-threading for this project. This utilizes rayon for performing multi-threading.

This PR is raised against #128.

Reasoning

Following are some of the decisions taken while building the solution. Feel free to comment on any of them, as I might have missed something during the process.

Reason for using #[cfg(...)]

In the current implementation of the code, all the smart pointers used aren't thread-safe. This being the case, thread-safety wasn't a requirement for the implementation as these pointers are much more performant than their thread-safe alternatives Rc<T> -> Arc<T>. While some might require multithreading others might want to go with the legacy implementation and I all the smart pointers are made to be thread-safe this could hamper the performance of the legacy implementation. This is avoided by using #[cfg(... gates on variables and arguments.

Reason for TargetDirPool

Adding multi-threading to the entire implementation isn't difficult the main problem is to tackle the locking mechanism that cargo has in place. In order to do so multiple target directories were created and they were assigned for execution, as well as reused to take advantage of caching

Reason for rayon

While using the traditional multi-threading model was always into consideration, rayon had some added benefits. Being lightweight and guaranteeing data-race freedom, with the added benefit of having great abstraction.

Ignored tests: 2

Here I have ignored 2 test cases as the order of the output was not what was expected in the assertion. I wasn't able to implement tests to replace them. Please, feel free to modify them as necessary

…nto multithreaded

taiki-e · 2023-01-29T13:33:39Z

Thanks for the PR!

I would prefer to make this an option that can be opt-in with an environment variable or CLI option, rather than a feature flag. If the user needs to reinstall cargo-hack to try parallelized cargo-hack, that would be a pain.
What does the output from parallelized cargo-hack look like? I'm concerned that if warnings/errors from different threads are alternately displayed, the messages may not be readable to the user.

NishantJoshi00 · 2023-02-01T17:35:00Z

I agree with what you are presenting, I actually started working on the second issue that you suggested, a little while before raising this PR, but sadly I wasn't able to include those changes in this PR at the moment.

However, I am working on it and will add those changes to this branch. Speaking about the first issue that you mentioned, I was also trying to achieve the same, but to achieve the current performance on the binary, some of the functionality used cannot support multi-threading, I do acknowledge that this tradeoff had to be made but, wasn't able to test how much a performance bottleneck is caused by using Atomically safe wrappers.

I'll try to benchmark it in the meantime, but theoretically, that will cause some performance bottleneck if the application is not running on the multithreaded mode as that is the intention of that change. Still, I'll try to find other ways to achieve something similar or to find a workaround for the feature-gated solution.
Thanks for the feedback

taiki-e · 2023-02-05T06:02:30Z

performance bottleneck

I haven't measured this, but

As for Rc -> Arc, the number of clones is not very large and I think it should not cause problems.
As for the TargetDirPool, that's not a problem, since we don't need to access it if parallelization is not enabled.
Mutex for Progress may cause a performance bottleneck because the lock is held even during outputting commands.
However, the only field that changes after parallelization in Progress is count, so instead of using Mutex entire Progress, using Arc<AtomicUsize> on count and using fetch_add(1, Relaxed) to increment counter should probably be enough to lower its cost.

Pat-Lafon · 2023-04-28T20:17:25Z

Hello! I was wondering about this kind of functionality and came across this pr. I'm curious about it's status? I'm sure it would be super useful for projects with larger sets of features(where cargo-hack is needed the most!).

NishantJoshi00 · 2023-05-04T05:38:46Z

Hey @Pat-Lafon, I wasn't able to pick up on the comments left by @taiki-e, but I am planning to pick it up as soon as possible. I was able to come up with a solution to unify the logs that the application will emit. But, haven't implemented it in this branch just yet. Will try to pick it up in the very near future.

NishantJoshi00 added 5 commits December 16, 2022 00:25

feat(multithread): add rayon & par_iter for multithreaded execution

5d6dac6

Merge branch 'main' of https://github.com/NishantJoshi00/cargo-hack i…

ca9e9fb

…nto multithreaded

feat: make multithread as a feature

74009a2

Merge branch 'main' of https://github.com/NishantJoshi00/cargo-hack i…

ed19786

…nto multithreaded

chore: post merge cleanup

c593e32

Pat-Lafon mentioned this pull request Apr 30, 2023

Move cargo hack to it's own workflow and cleanup lalrpop/lalrpop#794

Merged

taiki-e force-pushed the main branch from d6d9c0a to 9e29733 Compare October 10, 2023 14:15

Pat-Lafon mentioned this pull request Apr 5, 2024

Multithreaded2 #248

Open

ryoqun mentioned this pull request Jul 16, 2024

Add --partition M/N to distribute builds #253

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(multithread): add rayon & par_iter for multithreaded execution #180

feat(multithread): add rayon & par_iter for multithreaded execution #180

NishantJoshi00 commented Jan 26, 2023

taiki-e commented Jan 29, 2023

NishantJoshi00 commented Feb 1, 2023 •

edited

Loading

taiki-e commented Feb 5, 2023

Pat-Lafon commented Apr 28, 2023

NishantJoshi00 commented May 4, 2023

feat(multithread): add rayon & par_iter for multithreaded execution #180

Are you sure you want to change the base?

feat(multithread): add rayon & par_iter for multithreaded execution #180

Conversation

NishantJoshi00 commented Jan 26, 2023

Description

Reasoning

taiki-e commented Jan 29, 2023

NishantJoshi00 commented Feb 1, 2023 • edited Loading

taiki-e commented Feb 5, 2023

Pat-Lafon commented Apr 28, 2023

NishantJoshi00 commented May 4, 2023

NishantJoshi00 commented Feb 1, 2023 •

edited

Loading