added qwen1.5-7b to model list #316

mengbingrock · 2024-03-21T01:59:36Z

I've verified support of Qwen1.5-7B by OpenVINO and then added it to the github workflow and readme.md

(base) root@8tvt:~/openvino.genai/llm_bench/python# ../../text_generation/causal_lm/cpp/build/greedy_causal_lm qwen/pytorch/dldt/FP32/ "Why is the Sun yellow?"

 

The Sun does not actually appear yellow to us when we look at it. In fact, it appears white because it emits light across a wide range of wavelengths, including all the colors of the visible spectrum. When this light reaches our eyes, our eyes combine the different colors to create the perception of white.

p-wysocki · 2024-03-22T08:46:52Z

cc @Wovchena @pavel-esir

text_generation/causal_lm/cpp/README.md

pavel-esir

looks good to me. thx @mengbingrock

ilya-lavrenov · 2024-04-05T10:34:58Z

.github/workflows/causal_lm_cpp.yml

+        run: |
+          source ./ov/setupvars.sh
+          convert_tokenizer ./Qwen1.5-7B-Chat/pytorch/dldt/FP16/ --output ./Qwen1.5-7B-Chat/pytorch/dldt/FP16/ --with-detokenizer --trust-remote-code
+          timeout 50s ./build/beam_search_causal_lm ./Qwen1.5-7B-Chat/pytorch/dldt/FP16/ "你好！" > ./pred_qwen15.txt


could you please also add comparison with Hugging Face?
Example is

openvino.genai/.github/workflows/causal_lm_cpp.yml

Lines 67 to 80 in f973f62

python -c "

import transformers

with open('pred.txt', 'r') as file:

predictions = file.read()

tokenizer = transformers.LlamaTokenizer.from_pretrained('TinyLlama/TinyLlama-1.1B-Chat-v1.0')

tokenized = tokenizer('69', return_tensors='pt')

for beam in transformers.LlamaForCausalLM.from_pretrained('TinyLlama/TinyLlama-1.1B-Chat-v1.0').generate(**tokenized, num_beam_groups=3, num_beams=15, num_return_sequences=15, diversity_penalty=1.0, max_new_tokens=20, early_stopping=False, length_penalty=1.0, no_repeat_ngram_size=9**9, do_sample=False):

ref = ': ' + tokenizer.decode(beam[tokenized['input_ids'].numel():], skip_special_tokens=True) + '\n'

idx = predictions.find(ref)

if -1 == idx:

raise RuntimeError(f'Missing "{ref=}" from predictions')

predictions = predictions[:idx] + predictions[idx + len(ref):]

"

echo 69 passed

add qwen1.5-7b to model list

f2d0229

mengbingrock mentioned this pull request Mar 21, 2024

[Good First Issue]: Verify qwen1.5-7b-chat with GenAI text_generation #266

Closed

p-wysocki linked an issue Mar 21, 2024 that may be closed by this pull request

[Good First Issue]: Verify qwen1.5-7b-chat with GenAI text_generation #266

Closed

p-wysocki requested a review from Wovchena March 21, 2024 07:54

pavel-esir self-requested a review March 22, 2024 10:33

pavel-esir reviewed Mar 22, 2024

View reviewed changes

text_generation/causal_lm/cpp/README.md Outdated Show resolved Hide resolved

Update text_generation/causal_lm/cpp/README.md

e517701

pavel-esir approved these changes Mar 22, 2024

View reviewed changes

pavel-esir merged commit a9ab37e into openvinotoolkit:master Mar 22, 2024
9 checks passed

ilya-lavrenov reviewed Apr 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added qwen1.5-7b to model list #316

added qwen1.5-7b to model list #316

mengbingrock commented Mar 21, 2024

p-wysocki commented Mar 22, 2024

pavel-esir left a comment

ilya-lavrenov Apr 5, 2024

	python -c "
	import transformers
	with open('pred.txt', 'r') as file:
	predictions = file.read()
	tokenizer = transformers.LlamaTokenizer.from_pretrained('TinyLlama/TinyLlama-1.1B-Chat-v1.0')
	tokenized = tokenizer('69', return_tensors='pt')
	for beam in transformers.LlamaForCausalLM.from_pretrained('TinyLlama/TinyLlama-1.1B-Chat-v1.0').generate(tokenized, num_beam_groups=3, num_beams=15, num_return_sequences=15, diversity_penalty=1.0, max_new_tokens=20, early_stopping=False, length_penalty=1.0, no_repeat_ngram_size=99, do_sample=False):
	ref = ': ' + tokenizer.decode(beam[tokenized['input_ids'].numel():], skip_special_tokens=True) + '\n'
	idx = predictions.find(ref)
	if -1 == idx:
	raise RuntimeError(f'Missing "{ref=}" from predictions')
	predictions = predictions[:idx] + predictions[idx + len(ref):]
	"
	echo 69 passed

added qwen1.5-7b to model list #316

added qwen1.5-7b to model list #316

Conversation

mengbingrock commented Mar 21, 2024

p-wysocki commented Mar 22, 2024

pavel-esir left a comment

Choose a reason for hiding this comment

ilya-lavrenov Apr 5, 2024

Choose a reason for hiding this comment