Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fast FwdMaxUnpooling GPU kernel #17

Open
ghost opened this issue Sep 6, 2018 · 1 comment
Open

fast FwdMaxUnpooling GPU kernel #17

ghost opened this issue Sep 6, 2018 · 1 comment

Comments

@ghost
Copy link

ghost commented Sep 6, 2018

in file https://github.com/HyeonwooNoh/caffe/blob/master/src/caffe/layers/unpooling_layer.cpp
you asked: TODO(Yangqing): Is there a faster way to do unpooling in the channel-first

I tried also to implement a fast ForwardMaxUnPooling.
I only made the GPU version. See at the bottom of this file:
https://github.com/3DdeepAI/3DdeepAI/blob/3DdeepAI/NNkernels.h
kernel "FwdMaxUnpooling"

In my implementation there are less divisions: 2 instead of 6

https://github.com/HyeonwooNoh/caffe/blob/master/src/caffe/layers/unpooling_layer.cu

@ghost
Copy link
Author

ghost commented Sep 6, 2018

I updated the previous post and changed the code:
https://github.com/3DdeepAI/3DdeepAI/blob/3DdeepAI/NNkernels.h
Now it works. And it only uses 2 divisions (instead of 6 in your code)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0 participants