General support for Float16 and other DTypes #2302

vchuravy · 2016-06-01T04:29:52Z

Godricly · 2016-06-02T07:09:50Z

May I ask how the half precision is supported on the CPU?
Can I view the pooling #2280 as an example to do this?

vchuravy · 2016-06-02T12:22:16Z

I am following the way convolution is implemented and I think on CPU Float16 is implemented by promoting to Float32. https://gcc.gnu.org/onlinedocs/gcc/Half-Precision.html

piiswrong · 2016-06-03T03:24:29Z

I have a 1080 coming in next week. Let's try to merge in the common ones so we can have benchmark numbers released asap

vchuravy · 2016-06-03T03:49:47Z

If you start working on one, just post it here so that we don't duplicate effort.

vchuravy · 2016-06-03T07:53:28Z

@Godricly #2322 Is probably a good template of how to do it.

vchuravy · 2016-06-11T02:02:58Z

I updated the list a bit.

tqchen · 2016-06-21T17:53:21Z

I would need to mention one thing. Simply support half_t type was not enough for making things faster with fp16. Usually a explicit vectorization of code is needed. So unless things are operated together in a Packet structure with intrinsics. There is less likely to be speedup.

Godricly · 2016-06-22T02:10:52Z

It sounds like the underlining mshadow needs optimization for data alignment. Another thing I'm thinking about is whether we should add a option to enable backward computation in higher precision(float) for half_t type. The half_t type cannot represent too small gradients. And it will be messy in coding to enable this.
Working on embedding now.

Godricly · 2016-07-20T17:00:03Z

@vchuravy I have a updated Dtype pooling branch based on your work. Can you double check this one and submit a pr? My fork of MxNet is kind of messy now.

xlvector · 2016-07-25T14:59:16Z

@Godricly Hi, do you have some examples of using fp16? Is it used in training or inference?

Godricly · 2016-07-25T17:22:06Z

Not yet. Basically, u can insert some cast layer to transform input(data and label) into fp16, so the network flows in fp16. Currently, mxnet has compatible issue with fp16, so I cancelled my previous PR #2564.

If you are interested in fp16, u can follow my branch to enable fp16 param init and single machine training, the multi machine one depends ps-lite which is a little bit hard to get it work.

U also need to make some modification on optimizer, which use float type to update weights and convert them back to fp16 in network.

For the lstm case, the provided data type is need, which is painful. If you have any better solution, please let me know. 😆

BTW, the DType BN is only functional using cudnn.

xlvector · 2016-07-26T12:22:15Z

@Godricly Thanks very much.

Godricly · 2016-08-10T02:31:24Z

Pooling and Dropout have been merged.

Godricly · 2016-08-15T02:07:29Z

@vchuravy Can you update the todo list please? Or create a new issue to track the progress?
DType regression is submitted in #3018.

vchuravy · 2016-08-15T03:01:50Z

Updated it. What is your current status on BatchNorm? #2562 Is my latest stand but you mentioned that you made some updates?

Godricly · 2016-08-15T03:40:37Z

There is a branch under my mxnet.If you are only using cuDNN BN, it should be good enough to start with.

It is functional with cuDNN. But not with the native mshadow one.
The cudnn BN of FP16 is using float for mean and variance. I haven't figure out how to get infer_type compatible both with and without cuDNN. The marco I used will break non-cuDNN version.

Considering these two issues, I didn't submit it.

lygstate · 2017-05-21T05:29:39Z

what's the situation of fp16 support in mxnet

ysh329 · 2017-05-21T11:33:30Z

@lygstate inference or trainning acceleration on mobile or embed system device etc.

lygstate · 2017-05-21T17:14:47Z

I means the progress of fp16 support， if it's not finished what I can do for it？

ysh329 · 2017-05-22T00:44:04Z

@lygstate you can train a fp16 model from scratch by using cast function in symbol file.
How to set int8 or float16 to predict? · Issue #5822 · dmlc/mxnet
#5822

lygstate · 2017-05-22T16:41:40Z

I want do traning in fp32， but predict in fp16。 are that possible？

ysh329 · 2017-05-23T02:00:14Z

@piiswrong @Godricly

Godricly · 2017-05-23T05:05:04Z

@lygstate
There is a fp16 example of image classification. You can refer to that one.
You can predict with trained fp32 model using fp16 with proper clipping, But I think the performance will drop.
However I don't think you can deploy fp16 on mobile devices with mxnet. The current one relies on cudnn backend.

lygstate · 2017-05-23T05:17:51Z

@Godricly , Yeap, I want using fp16 with cudnn for performance reason:) Thanks a lot

lisa-imagia · 2017-11-01T16:55:27Z

Quick fix for monitoring weights on float16: #8506

haojin2 · 2018-03-15T19:02:34Z

Seems like crop, slice_channel, softmax_activation are all deprecated operators, I think maybe we can skip the support for FP16 for those operators?

ChaiBapchya · 2019-02-18T03:50:26Z

@eric-haibin-lin Since we have quite a few separate (specific) requests for FP16 support. Do we merge one together and close out the redundant ones? or we keep the issues the way they are?

eric-haibin-lin · 2019-02-18T05:13:33Z

@ChaiBapchya this list might actually be out-dated now..

ChaiBapchya · 2019-02-18T19:39:50Z

Do you recommend closing this issue in that case?

eric-haibin-lin · 2019-02-20T05:53:19Z

I do see that many ops are going to be deprecated. Closing it now. Please file separate github issue when an unsupported fp16 op is encountered.

vchuravy mentioned this issue Jun 3, 2016

Enable DTypes in deconvolution #2322

Merged

Godricly mentioned this issue Jul 31, 2016

Does the mxnet support the binary operations? #2870

Open

eric-haibin-lin added the Call for Contribution label Sep 29, 2017

This was referenced Mar 12, 2018

[MXNET-92] Support float16 in L2Normalization operator #10078

Merged

[MXNET-100] Support float16 in Correlation operator #10125

Merged

haojin2 mentioned this issue Mar 20, 2018

[MXNET-101] Support float16 in LeakyReLU operator #10169

Merged

8 tasks

eric-haibin-lin added Operator Feature request Data Type FP16 and removed Data Type labels Sep 29, 2018

eric-haibin-lin closed this as completed Feb 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

General support for Float16 and other DTypes #2302

General support for Float16 and other DTypes #2302

vchuravy commented Jun 1, 2016 •

edited by eric-haibin-lin

Loading

Godricly commented Jun 2, 2016

vchuravy commented Jun 2, 2016

piiswrong commented Jun 3, 2016

vchuravy commented Jun 3, 2016

vchuravy commented Jun 3, 2016

vchuravy commented Jun 11, 2016

tqchen commented Jun 21, 2016

Godricly commented Jun 22, 2016 •

edited

Loading

Godricly commented Jul 20, 2016

xlvector commented Jul 25, 2016

Godricly commented Jul 25, 2016 •

edited

Loading

xlvector commented Jul 26, 2016

Godricly commented Aug 10, 2016

Godricly commented Aug 15, 2016

vchuravy commented Aug 15, 2016

Godricly commented Aug 15, 2016

lygstate commented May 21, 2017

ysh329 commented May 21, 2017

lygstate commented May 21, 2017

ysh329 commented May 22, 2017 •

edited

Loading

lygstate commented May 22, 2017

ysh329 commented May 23, 2017

Godricly commented May 23, 2017

lygstate commented May 23, 2017 •

edited

Loading

lisa-imagia commented Nov 1, 2017

haojin2 commented Mar 15, 2018

ChaiBapchya commented Feb 18, 2019

eric-haibin-lin commented Feb 18, 2019

ChaiBapchya commented Feb 18, 2019

eric-haibin-lin commented Feb 20, 2019

General support for Float16 and other DTypes #2302

General support for Float16 and other DTypes #2302

Comments

vchuravy commented Jun 1, 2016 • edited by eric-haibin-lin Loading

Up for grabs

Depending on a resolution to dmlc/mshadow#125

Done

Godricly commented Jun 2, 2016

vchuravy commented Jun 2, 2016

piiswrong commented Jun 3, 2016

vchuravy commented Jun 3, 2016

vchuravy commented Jun 3, 2016

vchuravy commented Jun 11, 2016

tqchen commented Jun 21, 2016

Godricly commented Jun 22, 2016 • edited Loading

Godricly commented Jul 20, 2016

xlvector commented Jul 25, 2016

Godricly commented Jul 25, 2016 • edited Loading

xlvector commented Jul 26, 2016

Godricly commented Aug 10, 2016

Godricly commented Aug 15, 2016

vchuravy commented Aug 15, 2016

Godricly commented Aug 15, 2016

lygstate commented May 21, 2017

ysh329 commented May 21, 2017

lygstate commented May 21, 2017

ysh329 commented May 22, 2017 • edited Loading

lygstate commented May 22, 2017

ysh329 commented May 23, 2017

Godricly commented May 23, 2017

lygstate commented May 23, 2017 • edited Loading

lisa-imagia commented Nov 1, 2017

haojin2 commented Mar 15, 2018

ChaiBapchya commented Feb 18, 2019

eric-haibin-lin commented Feb 18, 2019

ChaiBapchya commented Feb 18, 2019

eric-haibin-lin commented Feb 20, 2019

vchuravy commented Jun 1, 2016 •

edited by eric-haibin-lin

Loading

Godricly commented Jun 22, 2016 •

edited

Loading

Godricly commented Jul 25, 2016 •

edited

Loading

ysh329 commented May 22, 2017 •

edited

Loading

lygstate commented May 23, 2017 •

edited

Loading