Skip to content
This repository was archived by the owner on May 3, 2024. It is now read-only.

Conversation

@dagamayank
Copy link
Contributor

This fix enables caffe to use MIOpen for MAX_POOLING. This was disabled for some reason which cuDNN does not / did not support. Looking at NVIDIA's github itself makes me believe that they have fixed this limitation.

Highly recommend to run training with this fix to ensure this does not manifest in accuracy issues. I am suspecting all this while we were not using MIOpen for pooling in our training runs.

cc\ @ashishfarmer This might also be the reason for the discrepancy you saw between ocl and hip caffe for pooling.

@ashishfarmer
Copy link

@dagamayank - Yes, I noticed that cuDNN disables MAX pool, and after digging in a bit, this is what I found:
BVLC/caffe#3574
BVLC/caffe#2015

Looks like it was added as a workaround for indexing for in place computation in a layer after max pooling. I am still not sure why this is an issue or would affect miopen pool. We need some testing, especially training runs on the scenerio mentioned in the original issue.

I am not sure if cuDNN fixed that problem, the date of the work-around to turn off max pooling is from Jan 2016.

@parallelo
Copy link
Contributor

Thank you for the PR, Mayank. We'll take a look at training accuracy with these mods.

@parallelo
Copy link
Contributor

A slightly modified version of this PR has been integrated into the 'tuning' branch. Closing. Thanks!

@parallelo parallelo closed this Jun 22, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants