Skip to content

Commit

Permalink
Verify notus-7b-v1 (#301)
Browse files Browse the repository at this point in the history
This should close #271 , I have updated the model list and added tests
with reference from #259

---------

Signed-off-by: Lim, Kuan Xian <[email protected]>
  • Loading branch information
kuanxian1 authored Mar 13, 2024
1 parent 900a60a commit 84f3974
Show file tree
Hide file tree
Showing 2 changed files with 28 additions and 1 deletion.
26 changes: 26 additions & 0 deletions .github/workflows/causal_lm_cpp.yml
Original file line number Diff line number Diff line change
Expand Up @@ -186,6 +186,32 @@ jobs:
source ./ov/setupvars.sh
convert_tokenizer ./Phi-2/pytorch/dldt/FP16/ --output ./Phi-2/pytorch/dldt/FP16/ --with-detokenizer --trust-remote-code
timeout 50s ./build/beam_search_causal_lm ./Phi-2/pytorch/dldt/FP16/ 69 > ./pred.txt
cpp-beam_search_causal_lm-notus-7b-v1:
runs-on: ubuntu-20.04-16-cores
steps:
- uses: actions/checkout@v4
with:
submodules: recursive
- uses: actions/setup-python@v4
with:
python-version: 3.8
- name: Install OpenVINO
run: |
mkdir ./ov/
curl https://storage.openvinotoolkit.org/repositories/openvino/packages/nightly/2024.1.0-14645-e6dc0865128/l_openvino_toolkit_ubuntu20_2024.1.0.dev20240304_x86_64.tgz | tar --directory ./ov/ --strip-components 1 -xz
sudo ./ov/install_dependencies/install_openvino_dependencies.sh
- name: Download, convert and build
run: |
source ./ov/setupvars.sh
python -m pip install --upgrade-strategy eager "optimum>=1.14" -r ./llm_bench/python/requirements.txt ./thirdparty/openvino_tokenizers/[transformers] --extra-index-url https://download.pytorch.org/whl/cpu && python ./llm_bench/python/convert.py --model_id argilla/notus-7b-v1 --output_dir ./notus-7b-v1/ --precision FP16 &
cmake -DCMAKE_BUILD_TYPE=Release -S ./text_generation/causal_lm/cpp/ -B ./build/
cmake --build ./build/ --config Release -j
wait
- name: Compare
run: |
source ./ov/setupvars.sh
convert_tokenizer ./notus-7b-v1/pytorch/dldt/FP16/ --output ./notus-7b-v1/pytorch/dldt/FP16/ --with-detokenizer --trust-remote-code
timeout 50s ./build/beam_search_causal_lm ./notus-7b-v1/pytorch/dldt/FP16/ 69 > ./pred.txt
3 changes: 2 additions & 1 deletion text_generation/causal_lm/cpp/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -122,6 +122,7 @@ To enable Unicode characters for Windows cmd open `Region` settings from `Contro
7. Dolly
1. https://huggingface.co/databricks/dolly-v2-3b
8. Phi
1. https://huggingface.co/microsoft/phi-2
1. https://huggingface.co/microsoft/phi-2
9. [notus-7b-v1](https://huggingface.co/argilla/notus-7b-v1)

This pipeline can work with other similar topologies produced by `optimum-intel` with the same model signature.

0 comments on commit 84f3974

Please sign in to comment.