Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge releases/2024/3 into master #720

Merged

Conversation

Wovchena
Copy link
Collaborator

No description provided.

Wovchena and others added 30 commits July 15, 2024 13:48
Workaround Python_VERSION_MAJOR and MINOR not being set by replasing
Python3 with Python

Disable generation of some of the COMPONENTs not needed for GenAI. There
are still unwanted empty archives, but they are generated
uncounditionally by rapidjson.
…envinotoolkit#604)

That allows LLMPipeline to create ContinuousBatchingPipeline as a
backend. There's also a constructor accepting ireq, which can be used if
the model was already transformed appropriately for
ContinuousBatchingPipeline. But it feels it's going to be misleading and
it simpler just to throw if such constructor is called with
ContinuousBatchingPipeline backend.
Updated default configurations based on results from CVS-143530.

(cherry picked from commit f460002)
Co-authored-by: Yaroslav Tarkan <[email protected]>
…#642)

OpenVINOGenAITargets.cmake was excluded from packaging because
CPACK_COMPONENTS_ALL is custom now and doesn't install Unspecified
component
…oop for greedy sampling (openvinotoolkit#607)

Searching for max element in a custom loop gives better performance than
using std::max_element
- Added Readme for python tests
- Added `--model_ids` option to run selectively only on specific models

---------

Co-authored-by: Zlobin Vladimir <[email protected]>
Symbols that cause errors:
- `\u0643`
- `\u25aa`
… optional plugin_config in tokenizer (openvinotoolkit#669)

This improves performance of CB lib when tested within OVMS.
Already merged to master:
openvinotoolkit#651
This is cherry-pick
…oolkit#670)

[mixtral-8x7b-instruct-v0.1-int4-ov](https://huggingface.co/OpenVINO/mixtral-8x7b-instruct-v0.1-int4-ov/)
didn't have `generation_config.json` therefore generation continued
ininitely. EOS_TOKEN_ID was red correctly but during generation it was
not met.

Updated docs so in every generate call max_new_tokens is set either in
arguments or via default generation config
`pipe.set_generation_config({'max_new_tokens': 100, 'num_beam_groups':
3, ...)`

tickets: CVS-146933 CVS-146324
Co-authored-by: Anastasiia Pnevskaia <[email protected]>
src/README.md Outdated Show resolved Hide resolved
samples/python/benchmark_genai/README.md Outdated Show resolved Hide resolved
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we have CI runs for these new samples? I don't see it

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No. It was last hour merge

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we have a task for it?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Created 148650

src/README.md Outdated Show resolved Hide resolved
src/cpp/include/openvino/genai/llm_pipeline.hpp Outdated Show resolved Hide resolved
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are they applicable only for Stateful pipeline or works will for static / CB as well?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Only for Stateful. I suspect uninitialized perf metrics are returned for CB backend.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's create a task for it.
We need unified response for all backends.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Created 148651

@@ -16,7 +16,7 @@ struct SchedulerConfig {
std::size_t num_kv_blocks = 0;

// total size of KV cache in GB
std::size_t cache_size = 0;
std::size_t cache_size = 1;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why? it's explicitly set to 0, so users have to set some meaningful values on their own

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's now possible to construct the struct with all the default values which is used to create CB backend in LLMPipeline.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok, let's fix it together with CB device name via passing scheduler config as device property

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@olpipi, please, address it

- Microsoft Visual Studio 2019 or higher, version 16.3 or later
- Python 3.8 or higher
- Git for Windows
#### Option 2 - setting environment variables manually:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need to recommend such approach? setupvars is default and we don't need to expose variables manually (which is implementation detail), because we will have to support it in future in case of any changes in OpenVINO scripts.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Developers tend to stop at build step and skip installation part. @TolyaTalamanov is an example.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's a developer problems that they cannot read readme step by step

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They can. But they aren't explained that it's not recommended to skip it and why.

@Wovchena Wovchena added this pull request to the merge queue Aug 1, 2024
@Wovchena Wovchena removed this pull request from the merge queue due to a manual request Aug 1, 2024
@Wovchena Wovchena added this pull request to the merge queue Aug 1, 2024
Merged via the queue into openvinotoolkit:master with commit 47fbb5e Aug 1, 2024
27 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.