Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] More of CUDA operations #69

Merged
merged 40 commits into from Jul 20, 2019

Conversation

@raver119
Copy link

commented Jul 19, 2019

This PR adds bunch of CUDA ops implementations

raver119 and others added 30 commits Jul 12, 2019
initial commit
Signed-off-by: raver119 <raver119@gmail.com>
Yurii
- gruCell_bp further
Signed-off-by: Yurii <yurii@skymind.io>
Yurii
- further work on gruCell_bp
Signed-off-by: Yurii <yurii@skymind.io>
Implemented sequence_mask cuda helper. Eliminated waste printf with m…
…atrix_inverse implementation. Added proper tests.
Yurii
- further work on gruCell_bp (ff/cuda)
Signed-off-by: Yurii <yurii@skymind.io>
Yurii
comment one test for gruCell_bp
Signed-off-by: Yurii <yurii@skymind.io>
Yurii
- provide cuda static_rnn
Signed-off-by: Yurii <yurii@skymind.io>
Yurii
- provide cuda scatter_update
Signed-off-by: Yurii <yurii@skymind.io>
Yurii
- finally gruCell_bp is completed
Signed-off-by: Yurii <yurii@skymind.io>

@raver119 raver119 merged commit b382e13 into master Jul 20, 2019

@raver119 raver119 deleted the r119_cuda_operations branch Jul 20, 2019

AlexDBlack added a commit that referenced this pull request Jul 20, 2019
[WIP] More of CUDA operations (#69)
* initial commit

Signed-off-by: raver119 <raver119@gmail.com>

* - gruCell_bp further

Signed-off-by: Yurii <yurii@skymind.io>

* - further work on gruCell_bp

Signed-off-by: Yurii <yurii@skymind.io>

* Inverse matrix cublas implementation. Partial working revision.

* Separation of segment ops helpers. Max separation.

* Separated segment_min ops.

* Separation of segment_mean/sum/prod/sqrtN ops heleprs.

* Fixed diagonal processing with LUP decomposition.

* Modified inversion approach using current state of LU decomposition.

* Implementation of matrix_inverse op with cuda kernels. Working revision.

* Implemented sequence_mask cuda helper. Eliminated waste printf with matrix_inverse implementation. Added proper tests.

* - further work on gruCell_bp (ff/cuda)

Signed-off-by: Yurii <yurii@skymind.io>

* comment one test for gruCell_bp

Signed-off-by: Yurii <yurii@skymind.io>

* - provide cuda static_rnn

Signed-off-by: Yurii <yurii@skymind.io>

* Refactored random_shuffle op to use new random generator.

* Refactored random_shuffle op helper.

* Fixed debug tests with random ops tests.

* Implement random_shuffle op cuda kernel helper and tests.

* - provide cuda scatter_update

Signed-off-by: Yurii <yurii@skymind.io>

* Implementation of random_shuffle for linear case with cuda kernels and tests.

* Implemented random_shuffle with cuda kernels. Final revision.

* - finally gruCell_bp is completed

Signed-off-by: Yurii <yurii@skymind.io>

* Dropout op cuda helper implementation.

* Implemented dropout_bp cuda helper.

* Implemented alpha_dropout_bp with cuda kernel helpers.

* Refactored helper.

* Implementation of suppresion helper with cuda kernels.

* - provide cpu code fot hsvToRgb, rgbToHsv, adjustHue

Signed-off-by: Yurii <yurii@skymind.io>

* Using sort by value method.

* Implementation of image.non_max_suppression op cuda-based helper.

* - correcting and testing adjust_hue, adjust_saturation cpu/cuda code

Signed-off-by: Yurii <yurii@skymind.io>

* Added cuda device prefixes to declarations.

* Implementation of hashcode op with cuda helper. Initital revision.

* rnn cu impl removed

Signed-off-by: raver119 <raver119@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants
You can’t perform that action at this time.