ft[torch]: How can we exploit cpu/gpu-parallelization with fabrics. #130

maxspahn · 2024-02-19T22:43:36Z

The idea behind this PR is simple:

Export planner as native c code.
Parse the c-function into a numpy function using a custom translation.
Enjoy parallelization with the generated numpy function.

Very similar approach should be applicable to torch.

@saraybakker1 @AndreuMatoses

…py functions.

saraybakker1 · 2024-02-21T07:48:29Z

Thanks @maxspahn, will have a look at it! :)

AndreuMatoses · 2024-02-21T10:12:52Z

Thanks, I will have a look, too!

…-torch-wrapper

maxspahn · 2024-04-18T20:33:47Z

Adds simple comparison between loop and numpy parallelization.
It turns out that for 100 samples, you have a speed up of around 120 for the
planner from the panda.py example.
@AndreuMatoses

AndreuMatoses · 2024-04-18T20:57:06Z

Okay, a 120x increase is indeed relevant. I will see if i can implement it for my case and potentially try to make it for torch. Thanks!

maxspahn · 2024-04-19T06:45:39Z

Also, the speedup scales with the number of samples, so for 1000 environments, the speed-up is even bigger. But I wasn't patient enough to wait for the result:D

AndreuMatoses · 2024-04-25T13:50:49Z

I have added the translator to torch code from the .c function. I also added some examples using the dingo+kinova and cubic obstacles.
CAREFUL: the generated torch code may have an issue when using torch.fmax(a,b) if, for example, b is a float and not a tensor. This seems to always happen as one of the first variables Casadi declares is something like a1 = 0.000, and then a2 is always used in the fmax functions. For now, I just changed by hand the variable in the resulting Python script to a1 = torch.tensor(0.0, device='cuda:0'), but this should be made in a more consistent way if someone wants to use this for any problem.

AndreuMatoses · 2024-04-25T13:58:52Z

To have an idea of the difference in performance with the different options (casadi function, parallelized numpy function, parallelized torch function) check these computation times for the dinova example. Take them just a s a reference as the performance could change depending on many things.

Casadi (looped) vs Numpy

Numpy vs Torch

Conclusion:

For less than ~100 N, looping the casadi function is best. Between 100 and 10k N, numpy is better as it has less overhead than torch. After 10k N, torch becomes better, especially with a very large N, as it seems to remain 300ms as long as you have enough VRAM basically.

ft[torch]: Adds simple script to translate exported c function to num…

64170d5

…py functions.

Adds rollout function.

6cdae40

maxspahn marked this pull request as draft April 15, 2024 10:28

maxspahn added 3 commits April 18, 2024 22:29

Implemented simple quantatitive comparison for panda.

a0843cc

Merge branch 'ft-torch-wrapper' of github.com:tud-amr/fabrics into ft…

225bf00

…-torch-wrapper

Restores transaltion parser.

bbcf2c2

AndreuMatoses added 4 commits April 25, 2024 15:32

added gpu torch dependencies group: parallelized

9fbcba4

Add examples/images and .python-version to .gitignore

061f438

added translator to torch function

7204c60

added examples with numpy and numpy vs torch with the dinova

76aec4d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ft[torch]: How can we exploit cpu/gpu-parallelization with fabrics. #130

ft[torch]: How can we exploit cpu/gpu-parallelization with fabrics. #130

maxspahn commented Feb 19, 2024

saraybakker1 commented Feb 21, 2024

AndreuMatoses commented Feb 21, 2024

maxspahn commented Apr 18, 2024 •

edited

Loading

AndreuMatoses commented Apr 18, 2024

maxspahn commented Apr 19, 2024

AndreuMatoses commented Apr 25, 2024

AndreuMatoses commented Apr 25, 2024

ft[torch]: How can we exploit cpu/gpu-parallelization with fabrics. #130

Are you sure you want to change the base?

ft[torch]: How can we exploit cpu/gpu-parallelization with fabrics. #130

Conversation

maxspahn commented Feb 19, 2024

saraybakker1 commented Feb 21, 2024

AndreuMatoses commented Feb 21, 2024

maxspahn commented Apr 18, 2024 • edited Loading

AndreuMatoses commented Apr 18, 2024

maxspahn commented Apr 19, 2024

AndreuMatoses commented Apr 25, 2024

AndreuMatoses commented Apr 25, 2024

Casadi (looped) vs Numpy

Numpy vs Torch

Conclusion:

maxspahn commented Apr 18, 2024 •

edited

Loading