forked from kserve/kserve
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enhance ci env #9
Closed
Closed
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
sivanantha321
force-pushed
the
enhance-ci-env
branch
25 times, most recently
from
February 13, 2024 05:57
1923acf
to
ce387d0
Compare
…rve#3431) Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
sivanantha321
force-pushed
the
enhance-ci-env
branch
2 times, most recently
from
February 14, 2024 11:09
d3b4fcd
to
f060d7c
Compare
* Fix parameters value type conversion Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix datatype check in infer type Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix datatype check in infer type Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
This is a small fix to the test_raw_logger.py/test_kserve_logger E2E test. This E2E test was creating a `message-dumper-raw` that was dependant on a Serverless setup, leading to the E2E to fail on an environment where Istio nor KNative are available. The fix is to add the right annotation to `message-dumper-raw` to deploy it in Raw mode, and adjust the logger URL of the `isvc-logger-raw` to point to the Kubernetes Service, rather than the KNative service. Signed-off-by: Edgar Hernández <[email protected]>
Some code of the E2Es assume the environment is GitHub, because it is referring to GitHub-specific variables. This PR focuses on references to the `kserve/image-transformer` image. This image is built in the CI flow and made available to the runner, so that a pull from an external registry is not needed. The references to this image are changed to an environment variable that is more agnostic to the runner, in an effort to make E2Es more compatible in other environments. The e2e-test.yml is modified to declare the new variable. Signed-off-by: Edgar Hernández <[email protected]>
* Initial commit to support vllm as alternative backend Signed-off-by: Gavrish Prabhu <[email protected]> * include minor fixes and readme changes Signed-off-by: Gavrish Prabhu <[email protected]> * fix poetry lock issues Signed-off-by: Gavrish Prabhu <[email protected]> * fix lint issues Signed-off-by: Gavrish Prabhu <[email protected]> * use_vllm support True as default Signed-off-by: Gavrish Prabhu <[email protected]> * refactor code and fix review comments Signed-off-by: Gavrish Prabhu <[email protected]> * build failure - fix tests and install vllm part of dockerfile Signed-off-by: Gavrish Prabhu <[email protected]> * fix poetry lock issue Signed-off-by: Gavrish Prabhu <[email protected]> * include string constants Signed-off-by: Gavrish Prabhu <[email protected]> * linting fix Signed-off-by: Gavrish Prabhu <[email protected]> * fix review comments Signed-off-by: Gavrish Prabhu <[email protected]> * fix tests Signed-off-by: Gavrish Prabhu <[email protected]> * fix review comments Signed-off-by: Gavrish Prabhu <[email protected]> * add support in vllm for locally downloaded models Signed-off-by: Gavrish Prabhu <[email protected]> * Update Readme Signed-off-by: Gavrish Prabhu <[email protected]> * Update Readme Signed-off-by: Gavrish Prabhu <[email protected]> * Update python/huggingfaceserver/README.md Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Gavrish Prabhu <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Expose qpext aggregate metrics port Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* Bump paddlepaddle to 2.6.0 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump transformers to version 4.37.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump cryptography to version 42.0.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump fastapi to version 0.109.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump pillow to version 10.2.0 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump aiohttp to version 3.9.3 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Revert fastapi bump Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump ray serve to 2.9.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Pin alibi to >=0.9.4 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix alibi version Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Dan Sun <[email protected]>
…ve#3436) * feat: Allow to disable ingress creation for raw deployment mode Signed-off-by: Yuan Tang <[email protected]> * test Signed-off-by: Yuan Tang <[email protected]> * Add configs Signed-off-by: Yuan Tang <[email protected]> * codegen Signed-off-by: Yuan Tang <[email protected]> * python docs Signed-off-by: Yuan Tang <[email protected]> * update helm docs Signed-off-by: Yuan Tang <[email protected]> * undo changes in pkg/apis/serving/v1beta1/zz_generated.deepcopy.go Signed-off-by: Yuan Tang <[email protected]> * remove unnecessary changes Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> * remove hf change Signed-off-by: Yuan Tang <[email protected]> * Update config/configmap Signed-off-by: Yuan Tang <[email protected]> * fix chart syntax Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
* build: Skip already downloaded Istio in quick_install.sh Signed-off-by: Yuan Tang <[email protected]> * Update quick_install.sh Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
…rver (kserve#3449) * fix: HuggingFace predictor should not be recognized as multi-model server Signed-off-by: Yuan Tang <[email protected]> * add Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
* Do not inject model agent for huggingface runtime Signed-off-by: Dan Sun <[email protected]> * Add option to return token type ids Signed-off-by: Dan Sun <[email protected]> * Add option to return token type ids Signed-off-by: Dan Sun <[email protected]> * Default model dir Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
Add padding and truncation in huggingface tokenizer Signed-off-by: Kevin Mingtarja <[email protected]> Signed-off-by: Kevin Mingtarja <[email protected]>
kserve#3456) * Fix model dir does not work with vllm Signed-off-by: Dan Sun <[email protected]> * Fix extra comma Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
sivanantha321
force-pushed
the
enhance-ci-env
branch
from
February 20, 2024 08:14
a942bad
to
5d83a10
Compare
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What this PR does / why we need it:
Which issue(s) this PR fixes (optional, in
fixes #<issue number>(, fixes #<issue_number>, ...)
format, will close the issue(s) when PR gets merged):Fixes #
Type of changes
Please delete options that are not relevant.
Feature/Issue validation/testing:
Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.
Test A
Test B
Logs
Special notes for your reviewer:
Checklist:
Release note: