You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As part of our collaboration with Google Cloud and following up #2 to create the Deep Learning Containers (DLCs) for both Google Kubernetes Engine (GKE) and Vertex AI, we want to create dedicated examples per each alternative offered.
The examples to be created and included within this repository are listed below and divided in two categories:
PyTorch via transformers using a Custom Prediction Routine (CPR) in Vertex AI with any relevant model for any supported HF_TASK other than text-generation (could be either CPU, GPU or both)
TGI via a pre-built DLC (on GPU) from the Hugging Face Hub
TGI via a pre-built DLC (on GPU) from a GCS Bucket
TEI via a pre-built DLC (could be either CPU, GPU or both)
TEI via a pre-built DLC (could be either CPU, GPU or both) from a GCS Bucket
GKE
TGI via a pre-built DLC with a custom Kubernetes configuration for GKE Autopilot mode (on GPU) from the Hugging Face Hub Add examples/gke/tgi-deployment #41
TEI via a pre-built DLC with a custom Kubernetes configuration for GKE Autopilot mode (could be either CPU, GPU or both) Add examples/gke/tei-deployment #43
As part of our collaboration with Google Cloud and following up #2 to create the Deep Learning Containers (DLCs) for both Google Kubernetes Engine (GKE) and Vertex AI, we want to create dedicated examples per each alternative offered.
The examples to be created and included within this repository are listed below and divided in two categories:
Edit: updated as of Philipp's comment below!
Training
examples/vertex-ai/...
for LLM fine-tuning with TRL #44accelerate
Addexamples/vertex-ai/...
for LLM fine-tuning with TRL #44examples/gke/...
for LLM fine-tuning with TRL #53examples/gke/...
for LLM fine-tuning with TRL #53Inference
examples/vertex-ai/notebooks
only missing review / rewrite) Reviewexamples/vertex-ai/notebooks/*.ipynb
#55transformers
using a Custom Prediction Routine (CPR) in Vertex AI with any relevant model for any supportedHF_TASK
other thantext-generation
(could be either CPU, GPU or both)examples/gke/tgi-deployment
#41examples/gke/{tei,tgi}-from-gcs-deployment
#42examples/gke/tei-deployment
#43examples/gke/{tei,tgi}-from-gcs-deployment
#42Note
This issue assumes that the DLCs are already created and can be used as containers for the examples described above.
The text was updated successfully, but these errors were encountered: