Clean up VLMPipeline #68

Wovchena · 2024-10-07T16:22:04Z

…olkit#916) To avoid confusions like openvinotoolkit#909 --------- Co-authored-by: Ravi Panchumarthy <[email protected]> Co-authored-by: Ekaterina Aidova <[email protected]>

Co-authored-by: Ilya Lavrenov <[email protected]>

Address comments in openvinotoolkit#912

ilya-lavrenov · 2024-10-07T16:27:49Z

src/cpp/include/openvino/genai/vlm_pipeline.hpp

    );

    /// @brief Default destructor.
    ~VLMPipeline();

    /// @brief Generate a response given a prompt and any number of
-    /// uint8 RGB images.
+    /// uint8 RGB images with [HWC] layout.


if I pass NHWC, what will happen?

1HWC is actually expected. Should I enable both?

I suppose yes, maybe in PR with multiple images you can add support of NHWC ? where ov::Tensor represents multiple images and N > 1

Done in 7441a18. @popovaan, please, add tests for [NHWC] ([1HWC], [2HWC]) and [HWC] image layouts.

Use the latest Optimum from GitHub: https://github.com/huggingface/optimum.git Some example to try: ```sh # Export FP16 model optimum-cli export openvino -m SimianLuo/LCM_Dreamshaper_v7 --weight-format fp16 sd-lcm-fp16 # Export INT8 WOQ model optimum-cli export openvino -m SimianLuo/LCM_Dreamshaper_v7 --weight-format int8 sd-lcm-int8 # Collect the references wwb --base-model sd-lcm-fp16 --gt-data lcm_test/sd_xl.json --model-type sd-lcm # Compute the metric wwb --target-model sd-lcm-int8 --gt-data lcm_test/sd_xl.json --model-type sd-lcm ``` In addition you can compare vs. SD LCM quantized in a Hybrid quantization mode as follows: ```python import time import datasets import matplotlib.pyplot as plt import numpy as np import transformers from pathlib import Path from openvino.runtime import Core from optimum.intel import OVConfig, OVQuantizer, OVStableDiffusionPipeline, OVWeightQuantizationConfig, OVLatentConsistencyModelPipeline from optimum.intel.openvino.configuration import OVQuantizationMethod transformers.logging.set_verbosity_error() datasets.logging.set_verbosity_error() MODEL_ID = "SimianLuo/LCM_Dreamshaper_v7" DATASET_NAME = "jxie/coco_captions" base_model_path = Path(f"models/{MODEL_ID}") fp32_model_path = base_model_path.with_name(base_model_path.name + "_FP32") int8_model_path = base_model_path.with_name("sd-lcm-int8") dataset = datasets.load_dataset(DATASET_NAME, split="train", streaming=True).shuffle(seed=42) print(next(iter(dataset))) def preprocess_fn(example): return {"prompt": example["caption"]} NUM_SAMPLES = 200 dataset = dataset.take(NUM_SAMPLES) calibration_dataset = dataset.map(lambda x: preprocess_fn(x), remove_columns=dataset.column_names) int8_pipe = OVLatentConsistencyModelPipeline.from_pretrained(model_id=MODEL_ID, export=True) quantization_config = OVWeightQuantizationConfig(bits=8, num_samples=NUM_SAMPLES, quant_method=OVQuantizationMethod.HYBRID) quantizer = OVQuantizer(int8_pipe) quantizer.quantize( ov_config=OVConfig(quantization_config=quantization_config), calibration_dataset=calibration_dataset, save_directory=int8_model_path ) ``` And use compute the metric the same way: ```sh wwb --target-model sd-lcm-int8 --gt-data lcm_test/sd_xl.json --model-type sd-lcm ```

Ticket 144955, CVS-153301 --------- Co-authored-by: wenyi5608 <[email protected]> Co-authored-by: Yang,Su <[email protected]> Co-authored-by: Yaroslav Tarkan <[email protected]> Co-authored-by: Alina Kladieva <[email protected]> Co-authored-by: Pavel Esir <[email protected]> Co-authored-by: Pavel Esir <[email protected]> Co-authored-by: Artur Paniukov <[email protected]> Co-authored-by: Ekaterina Aidova <[email protected]> Co-authored-by: Ilya Lavrenov <[email protected]> Co-authored-by: Mikhail Ryzhov <[email protected]>

… for text_encoder

Fixed compilation

- Added VLM bindings. - Added Python VLM chat sample. - Added initialization of performance metrics with zeros. Tickets: CVS-153174, CVS-153173, CVS-153626 PR to miniCPM-V-2_6 branch: #62 --------- Co-authored-by: wenyi5608 <[email protected]> Co-authored-by: Yang,Su <[email protected]> Co-authored-by: Wovchena <[email protected]> Co-authored-by: Yaroslav Tarkan <[email protected]> Co-authored-by: Alina Kladieva <[email protected]> Co-authored-by: Pavel Esir <[email protected]> Co-authored-by: Pavel Esir <[email protected]> Co-authored-by: Artur Paniukov <[email protected]> Co-authored-by: Ekaterina Aidova <[email protected]> Co-authored-by: Ilya Lavrenov <[email protected]> Co-authored-by: Mikhail Ryzhov <[email protected]>

TODO: - [ ] Remove `ov::Core` from constructors. - [ ] Hide files and API. --------- Co-authored-by: wenyi5608 <[email protected]> Co-authored-by: Yang,Su <[email protected]> Co-authored-by: Yaroslav Tarkan <[email protected]> Co-authored-by: Alina Kladieva <[email protected]> Co-authored-by: Pavel Esir <[email protected]> Co-authored-by: Pavel Esir <[email protected]> Co-authored-by: Artur Paniukov <[email protected]> Co-authored-by: Ekaterina Aidova <[email protected]> Co-authored-by: Ilya Lavrenov <[email protected]> Co-authored-by: Mikhail Ryzhov <[email protected]>

ilya-lavrenov · 2024-10-10T18:32:13Z

Can be closed

andrei-kochin and others added 4 commits October 3, 2024 12:47

convert.py script deprecation and llm-bench README update (openvinoto…

9177756

…olkit#916) To avoid confusions like openvinotoolkit#909 --------- Co-authored-by: Ravi Panchumarthy <[email protected]> Co-authored-by: Ekaterina Aidova <[email protected]>

StaticLLMPipeline: Enable DQ (openvinotoolkit#878)

b11f0d9

LoRA in Text2ImagePipeline (openvinotoolkit#911)

41f1e7b

Co-authored-by: Ilya Lavrenov <[email protected]>

Clean up VLMPipeline

d3bb229

Address comments in openvinotoolkit#912

Wovchena mentioned this pull request Oct 7, 2024

MiniCPM-V-2_6 with image input openvinotoolkit/openvino.genai#912

Merged

Remove error handling

a5a58c2

ilya-lavrenov reviewed Oct 7, 2024

View reviewed changes

Wovchena force-pushed the clean-up branch from 35b6d89 to a5a58c2 Compare October 7, 2024 16:30

Allow [NHWC] and [HWC]

7441a18

ilya-lavrenov approved these changes Oct 8, 2024

View reviewed changes

ilya-lavrenov mentioned this pull request Oct 8, 2024

Clean up VLMPipeline openvinotoolkit/openvino.genai#923

Merged

Wovchena added 2 commits October 8, 2024 12:29

Move subtract_chat_tokenized_inputs' implementation to .cpp

eaaa971

Revert test to drop mac

5eb7011

Wovchena mentioned this pull request Oct 8, 2024

Finilize chat_template #65

Closed

AlexKoff88 and others added 3 commits October 8, 2024 08:34

Remove excess comma in src/cpp/CMakeLists.txt (openvinotoolkit#927)

ff38f90

Fix layout description

ff27cf7

Wovchena force-pushed the clean-up branch from 9c0f352 to ff27cf7 Compare October 8, 2024 10:10

rkazants and others added 12 commits October 8, 2024 16:01

Update README.md

a0b78c0

Update samples/cpp/text2image/README.md

8c9a240

Fix the misprint (openvinotoolkit#928)

abbc695

Fix misprint (openvinotoolkit#929)

a5fb3a6

fix cb llm bench for gpu, allow string config

aa7bfd6

fix cb llm bench for gpu, allow string config (openvinotoolkit#931)

117d790

fix linting issue in llm bench (openvinotoolkit#932)

aaf731c

disable md5 check assert for CB

14df316

disable md5 check assert for CB (openvinotoolkit#933)

09c5742

Use older MSVC toolchain version

4465727

Use OpenVINO runners

7f9a579

ilya-lavrenov and others added 22 commits October 9, 2024 10:21

Apply suggestions from code review

9d1e7e3

Try to fix Windows

2c56899

Added &

4aa01ca

Update stable_diffusion_1_5_cpp.yml

0afb553

Update stable_diffusion_1_5_cpp.yml

4ae6b18

Apply suggestions from code review

3e772fd

Apply suggestions from code review

0b55cd0

Apply suggestions from code review

22c573d

Apply suggestions from code review

fcd6670

Apply suggestions from code review

e7c1371

Update .github/workflows/stable_diffusion_1_5_cpp.yml

78bbf62

Apply suggestions from code review

581e2c1

SDXL Pipeline, Euler Discrete scheduler

d43773c

text2image Readme update

0663533

num_images_per_prompt>1 for demo, unet reshape fix, num_hidden_layers…

f4e90aa

… for text_encoder

Readme update

ea8b9fa

codestyle fixes

df2161d

Apply suggestions from code review

8db0301

Fixed compilation

Prevent overwriting of the sampling strategy. (openvinotoolkit#937)

a1feff9

Merge branch 'master' into clean-up

770f7ed

github-actions bot added llm_bench WWB labels Oct 9, 2024

Remove py constructor

6609a08

Wovchena closed this Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up VLMPipeline #68

Clean up VLMPipeline #68

Wovchena commented Oct 7, 2024

ilya-lavrenov Oct 7, 2024

Wovchena Oct 7, 2024

ilya-lavrenov Oct 7, 2024

Wovchena Oct 8, 2024

ilya-lavrenov commented Oct 10, 2024

Clean up VLMPipeline #68

Clean up VLMPipeline #68

Conversation

Wovchena commented Oct 7, 2024

ilya-lavrenov Oct 7, 2024

Choose a reason for hiding this comment

Wovchena Oct 7, 2024

Choose a reason for hiding this comment

ilya-lavrenov Oct 7, 2024

Choose a reason for hiding this comment

Wovchena Oct 8, 2024

Choose a reason for hiding this comment

ilya-lavrenov commented Oct 10, 2024