New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Try new rand(Float32) in kernel #7
Comments
@maleadt Just to let you know: I just tried the implementation of |
That's good, although the memory usage of the new implementation will be (much) larger for now. I plan to revisit it before releasing CUDA 3.0 though. The performance difference with 1.5 is a problem, would be good if you could reduce to something I can have a look at 🙂 |
Yes, I will look into that and provide a minimal example! |
Awesome, thanks. |
Still not fully functional, see JuliaGPU/CUDA.jl#788 (comment) |
This is
rand
fromRandom
, made compatible by redefiningdefault_rng
on the device and using our own RNG. So yes, this is definitely the path forward, but we'll obviously have to flesh out the implementation by making sure the necessary APIs are GPU compatible and either overriding more calls or extending the RNG.Originally posted by @maleadt in JuliaGPU/CUDA.jl#772 (comment)
The text was updated successfully, but these errors were encountered: