Allow custom answer generation function in WWB #507

eaidova · 2024-06-14T08:42:21Z

This functionally will help for enable different model API that has different interface for generation answers (e.g. OpenVINO GenAI)

example with GenAI:

from transformers import AutoModelForCausalLM, AutoTokenizer
import huggingface_hub as hf_hub
import whowhatbench
import openvino_genai

model_id = "databricks/dolly-v2-3b"
base_model = AutoModelForCausalLM.from_pretrained(model_id)
ov_model_dir = "./dolly-v2-3b-int4-ov"

hf_hub.snapshot_download("OpenVINO/dolly-v2-3b-int4-ov", local_dir=ov_model_dir)
optimized_model = openvino_genai.LLMPipeline(ov_model_dir, "CPU")
tokenizer = AutoTokenizer.from_pretrained(model_id)

def genai_gen_answer(model, tokenizer, question, max_new_tokens, skip_question):
    out = model.generate(question, max_new_tokens=max_new_tokens)
    return out.texts[0]

evaluator = whowhatbench.Evaluator(base_model=base_model, tokenizer=tokenizer)
metrics_per_prompt, metrics = evaluator.score(optimized_mode, gen_answer_fn=genai_gen_answer)

andreyanufr · 2024-06-14T10:45:37Z

This functionally will help for enable different model API that has different interface for generation answers (e.g. OpenVINO GenAI)

example with GenAI:

from transformers import AutoModelForCausalLM, AutoTokenizer
import huggingface_hub as hf_hub
import whowhatbench
import openvino_genai

model_id = "databricks/dolly-v2-3b"
base_model = AutoModelForCausalLM.from_pretrained(model_id)
ov_model_dir = "./dolly-v2-3b-int4-ov"

hf_hub.snapshot_download("OpenVINO/dolly-v2-3b-int4-ov", local_dir=ov_model_dir)
optimized_model = openvino_genai.LLMPipeline(ov_model_dir, "CPU")
tokenizer = AutoTokenizer.from_pretrained(model_id)

def genai_gen_answer(model, tokenizer, question, max_new_tokens, skip_question):
    out = model.generate(question, max_new_tokens=max_new_tokens)
    return out.texts[0]

evaluator = whowhatbench.Evaluator(base_model=base_model, tokenizer=tokenizer)
metrics_per_prompt, metrics = evaluator.score(optimized_mode, gen_answer_fn=genai_gen_answer)

@eaidova
May be it is better to move gen_answer_fn to whowhatbench.Evaluator() init? In current implementation base_model and optimized model will have different inference.

eaidova · 2024-06-14T10:59:19Z

This functionally will help for enable different model API that has different interface for generation answers (e.g. OpenVINO GenAI)
example with GenAI:

from transformers import AutoModelForCausalLM, AutoTokenizer
import huggingface_hub as hf_hub
import whowhatbench
import openvino_genai

model_id = "databricks/dolly-v2-3b"
base_model = AutoModelForCausalLM.from_pretrained(model_id)
ov_model_dir = "./dolly-v2-3b-int4-ov"

hf_hub.snapshot_download("OpenVINO/dolly-v2-3b-int4-ov", local_dir=ov_model_dir)
optimized_model = openvino_genai.LLMPipeline(ov_model_dir, "CPU")
tokenizer = AutoTokenizer.from_pretrained(model_id)

def genai_gen_answer(model, tokenizer, question, max_new_tokens, skip_question):
    out = model.generate(question, max_new_tokens=max_new_tokens)
    return out.texts[0]

evaluator = whowhatbench.Evaluator(base_model=base_model, tokenizer=tokenizer)
metrics_per_prompt, metrics = evaluator.score(optimized_mode, gen_answer_fn=genai_gen_answer)

@eaidova May be it is better to move gen_answer_fn to whowhatbench.Evaluator() init? In current implementation base_model and optimized model will have different inference.

@andreyanufr it is expected that they may have different inference. I want to compare model answers between genai and original model or optimum-intel. In this case, both base and optimized may be used via different API.

Do you think I should implement this for base model too?

github-actions bot added the category: llm_bench Label for tool/llm_bench folder label Jun 14, 2024

eaidova requested review from andreyanufr and ljaljushkin June 14, 2024 08:43

eaidova force-pushed the ea/custom_answer_gen branch from 73deda5 to ab0970b Compare June 14, 2024 08:45

Allow custom answer generation function in WWB

659b327

eaidova force-pushed the ea/custom_answer_gen branch from ab0970b to 659b327 Compare June 14, 2024 08:50

andreyanufr approved these changes Jun 14, 2024

View reviewed changes

eaidova merged commit c7c592d into openvinotoolkit:master Jun 20, 2024
28 checks passed

eaidova deleted the ea/custom_answer_gen branch June 20, 2024 07:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow custom answer generation function in WWB #507

Allow custom answer generation function in WWB #507

eaidova commented Jun 14, 2024 •

edited

Loading

andreyanufr commented Jun 14, 2024

eaidova commented Jun 14, 2024 •

edited

Loading

Allow custom answer generation function in WWB #507

Allow custom answer generation function in WWB #507

Conversation

eaidova commented Jun 14, 2024 • edited Loading

andreyanufr commented Jun 14, 2024

eaidova commented Jun 14, 2024 • edited Loading

eaidova commented Jun 14, 2024 •

edited

Loading

eaidova commented Jun 14, 2024 •

edited

Loading