Skip to content
This repository has been archived by the owner on May 3, 2024. It is now read-only.

enabled miopen pooling #6

Closed
wants to merge 1 commit into from
Closed

enabled miopen pooling #6

wants to merge 1 commit into from

Conversation

dagamayank
Copy link
Contributor

This fix enables caffe to use MIOpen for MAX_POOLING. This was disabled for some reason which cuDNN does not / did not support. Looking at NVIDIA's github itself makes me believe that they have fixed this limitation.

Highly recommend to run training with this fix to ensure this does not manifest in accuracy issues. I am suspecting all this while we were not using MIOpen for pooling in our training runs.

cc\ @ashishfarmer This might also be the reason for the discrepancy you saw between ocl and hip caffe for pooling.

@ashishfarmer
Copy link

@dagamayank - Yes, I noticed that cuDNN disables MAX pool, and after digging in a bit, this is what I found:
BVLC/caffe#3574
BVLC/caffe#2015

Looks like it was added as a workaround for indexing for in place computation in a layer after max pooling. I am still not sure why this is an issue or would affect miopen pool. We need some testing, especially training runs on the scenerio mentioned in the original issue.

I am not sure if cuDNN fixed that problem, the date of the work-around to turn off max pooling is from Jan 2016.

@parallelo
Copy link
Contributor

Thank you for the PR, Mayank. We'll take a look at training accuracy with these mods.

@parallelo
Copy link
Contributor

A slightly modified version of this PR has been integrated into the 'tuning' branch. Closing. Thanks!

@parallelo parallelo closed this Jun 22, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants