This example load a language translation model and confirm its accuracy and speed based on GLUE data.
git clone -b dnnl_ep --depth 1 https://github.com/intel/neural-compressor.git
cd neural-compressor
pip install -e ./
cd examples/onnxrt/nlp/huggingface_model/text_classification/mix_precision/
pip install -r requirements.txt
Note: Validated ONNX Runtime Version.
Supported model identifier from huggingface.co:
Model Identifier |
---|
Intel/bert-base-uncased-mrpc |
Intel/roberta-base-mrpc |
Intel/xlm-roberta-base-mrpc |
Intel/camembert-base-mrpc |
distilbert-base-uncased-finetuned-sst-2-english |
Alireza1044/albert-base-v2-sst2 |
Intel/MiniLM-L12-H384-uncased-mrpc |
philschmid/MiniLM-L6-H384-uncased-sst2 |
bert-base-cased-finetuned-mrpc |
Intel/electra-small-discriminator-mrpc |
M-FAC/bert-mini-finetuned-mrpc |
Intel/xlnet-base-cased-mrpc |
Intel/bart-large-mrpc |
optimum-cli export onnx --model Intel/bert-base-uncased-mrpc --task text-classification <path to export onnx model>
Download the GLUE data with prepare_data.sh
script.
export GLUE_DIR=/path/to/glue_data
export TASK_NAME=MRPC # or SST
bash prepare_data.sh --data_dir=$GLUE_DIR --task_name=$TASK_NAME
If the hardware doesn't support bf16 instruction, please set flag as below to force bf16 conversion (this way will be deprecated):
export FORCE_BF16=1
bash run.sh --input_model=path/to/model \ # model path as *.onnx
--output_model=path/to/model_tune \ # model path as *.onnx
Please make sure DnnlExecutionProvider is in available providers list to execute evaluation.
bash eval.sh --input_model=path/to/model \ # model path as *.onnx
--output_model=path/to/model_tune \ # model path as *.onnx
--dataset_location=path/to/glue/data \
--batch_size=batch_size \ # optional