ResNet50 inference model seemed not support FP16 #173

chenming22 · 2024-03-06T05:35:27Z

I want to evaluate inference performance using AMX-FP16 on new CPU while default docker or AI tool package not supported AMX-FP16 because of old version of onednn(3.2.0), so building onednn-3.3.0 and intel extensions for tensorflow-2.15.0 from source and using following model:

FP32, FP16 and BFloat32 Pretrained model:

wget https://zenodo.org/record/2535873/files/resnet50_v1.pb

howerver, log from onednn showed precision of this model was FP32 not FP16.

The text was updated successfully, but these errors were encountered:

sramakintel · 2024-03-25T18:01:42Z

@chenming22 we will work on adding FP16 to resnet50 model. Thanks for reaching out.

sramakintel · 2024-03-25T21:10:10Z

@chenming22 we will work on adding FP16 to resnet50 model. Thanks for reaching out.

@chenming22 we currently have AMP FP16 TF support based on the PR upstream here: tensorflow/tensorflow#62817. Since the script already supports FP16 input, installing tf-nighty from pip can provide FP16 support. Can you run the workload again to see if FP16 works as expected?

chenming22 · 2024-03-26T05:45:27Z

thanks @sramakintel , i will have a try.

chenming22 · 2024-06-06T01:55:21Z

hi, @sramakintel , noticed above pr been merged to tf2.16.1, so i install tf2.16.1 from pip and then ran inference.sh in ResNet50 V1.5 inference with precision=fp16, ONEDNN_MAX_CPU_ISA=AVX512_CORE_AMX_FP16 and model: resnet50_v1.pb, but log and metric on emon showed no AMX instructions used, any other environment variables or configurations needed?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ResNet50 inference model seemed not support FP16 #173

ResNet50 inference model seemed not support FP16 #173

chenming22 commented Mar 6, 2024

sramakintel commented Mar 25, 2024

sramakintel commented Mar 25, 2024

chenming22 commented Mar 26, 2024

chenming22 commented Jun 6, 2024 •

edited

Loading

ResNet50 inference model seemed not support FP16 #173

ResNet50 inference model seemed not support FP16 #173

Comments

chenming22 commented Mar 6, 2024

FP32, FP16 and BFloat32 Pretrained model:

sramakintel commented Mar 25, 2024

sramakintel commented Mar 25, 2024

chenming22 commented Mar 26, 2024

chenming22 commented Jun 6, 2024 • edited Loading

chenming22 commented Jun 6, 2024 •

edited

Loading