Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature request] - implementing a pd external in cuda using c++ #1866

Closed
ghost opened this issue Jan 11, 2023 · 4 comments
Closed

[feature request] - implementing a pd external in cuda using c++ #1866

ghost opened this issue Jan 11, 2023 · 4 comments
Labels
invalid not a bug/feature request/... wontfix nobody wants to spend their volunteer time to fix/implement this

Comments

@ghost
Copy link

ghost commented Jan 11, 2023

what if this could be the starting point to create a new pd external using cuda and c++:

@umlaeute umlaeute added the wontfix nobody wants to spend their volunteer time to fix/implement this label Jan 11, 2023
@umlaeute
Copy link
Contributor

umlaeute commented Jan 11, 2023

it would be cool, but i don't see what can be done on the Pd side.

specifically, i doubt that @millerpuckette would add glue to natively support some proprietary acceleration framework (however fast), but of course that is his decision.

but that shouldn't keep you from creating your own framework (if that is required at all to create CUDA-based externals).

for now i'm going to close this a wontfix 😞

@umlaeute
Copy link
Contributor

and just to add. back in the days (when the first CUDA-like devices came out), it was accepted knowledge that while the processing itself is super fast, there's a bottleneck when it comes to transferring data between main memory and GPU memory. this might not be a problem if you work with large amounts of data (as the throughput is high), but in Pd we have small blocks that should return fast (and unfortunately the latency is high as well).

i don't think that things have changed considerably, as shown in this article from 2021:

GPUs are a latency hiding device, so the naive approach of transferring 4MB files over PCIe from the CPU to the GPU for further processing would cripple workflow performance.

@ghost
Copy link
Author

ghost commented Jan 11, 2023

this could be interesting for instance for an intelligent counterpoint based harmonizer, or some control voltage crazy behaviour emulation

@Spacechild1
Copy link
Contributor

Spacechild1 commented Jan 11, 2023

This is AI generated code without any concrete purpose, copy pasted and posted as a "feature request" to three different repositories: (here, csound/csound#1688, supercollider/supercollider#5947).

Please don't do this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
invalid not a bug/feature request/... wontfix nobody wants to spend their volunteer time to fix/implement this
Projects
None yet
Development

No branches or pull requests

2 participants