Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] reverse improvements #115

merged 4 commits into from Dec 5, 2019

[WIP] reverse improvements #115

merged 4 commits into from Dec 5, 2019


Copy link

raver119 commented Dec 5, 2019

This PR adds new CUDA kernel for Reverse op, in order to improve performance in certain cases.

raver119 added 4 commits Dec 5, 2019
Signed-off-by: raver119 <>
Signed-off-by: raver119 <>
Signed-off-by: raver119 <>
Signed-off-by: raver119 <>
@raver119 raver119 merged commit 355c6b6 into master Dec 5, 2019
@raver119 raver119 deleted the r119_cuda_reverse branch Dec 5, 2019
AlexDBlack pushed a commit that referenced this pull request Dec 10, 2019
* cleaned up bert iterator tests (#110)

Signed-off-by: eraly <>

* Various pre-release fixes (#111)

* Various fixes

Signed-off-by: AlexDBlack <>

* Fix default dtypes for MaxPoolWithArgmax

Signed-off-by: AlexDBlack <>

* Small pre-release tweak (#112)

* Log UI address on launch as in previous Play-based UI

Signed-off-by: AlexDBlack <>

* Logging level tweak for UI

Signed-off-by: AlexDBlack <>

* http not https

Signed-off-by: AlexDBlack <>

* datavec python ensure host (#113)

* ensure host

* one more host ensure

* info->debug

* [WIP] reverse improvements (#115)

* initial commit

Signed-off-by: raver119 <>

* reverse draft

Signed-off-by: raver119 <>

* reverse kernel

Signed-off-by: raver119 <>

* reverse kernel

Signed-off-by: raver119 <>

* 2 micro fixes

Signed-off-by: raver119 <>

* Shugeo resize fix5 (#102)

* Refactored resize images ops to use TF-like bool args as input.

* Refactored helpers for cpu implementation of resize_bilinear and resize_nearest_neighbor ops.

* Refactored cuda implementation for image.resize_bilinear and image.resize_nearest_neighbor ops helpers.

* Refactored nearest_neighbor resize op.

* Added a pair of tests for special case of resize_bilinear algorithm.

* Fixed issue with resize_bilinear op.

* Refactored cpu implementation for helpers with resize_nearest_neighbor op.

* Final fixed for resize ops to conform TF v.1.5

* Refactored cuda helpers for resize_neares_neighbor op.

* Fixed resize_bilinear to accept proper data.

* Fixed issue with non-float input for resize_bilinear op.

* Refactored cuda helper for resize_bilinear to proper process non-float inputs.

* Added tests for resize_bilinear to int inputs.

* Fixed ResizeBilinear wrapper

* Tests fixed

* Fixed float and bool constant to avoid overflow for some kind of compilers.

* Corrected float constants with float data type.

* Added f suffix for float constants.

* Corrected float constant to avoid overflow with initializing lists.

* Corrected float initializing list with float input.

* Corrected bool constant with initalizing list.

* Corrected float and bool values with initializing lists.

* Fixed wrong constant.

* Fixed issue with 1x1 input picture for resize.

* ResizeBilinear default values on import fix

Signed-off-by: raver119 <>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
None yet
1 participant
You can’t perform that action at this time.