Skip to content

Latest commit

 

History

History
51 lines (37 loc) · 2.31 KB

README_gmc.md

File metadata and controls

51 lines (37 loc) · 2.31 KB

Deploy CodeTrans in a Kubernetes Cluster

This document outlines the deployment process for a Code Translation (CodeTran) application that utilizes the GenAIComps microservice components on Intel Xeon servers and Gaudi machines.

Please install GMC in your Kubernetes cluster, if you have not already done so, by following the steps in Section "Getting Started" at GMC Install. We will soon publish images to Docker Hub, at which point no builds will be required, further simplifying install.

If you have only Intel Xeon machines you could use the codetrans_xeon.yaml file or if you have a Gaudi cluster you could use codetrans_gaudi.yaml In the below example we illustrate on Xeon.

Required Models

By default, the LLM model is set to a default value as listed below:

Service Model
LLM HuggingFaceH4/mistral-7b-grok

Change the MODEL_ID in codetrans_xeon.yaml for your needs.

Deploy the RAG application

  1. Create the desired namespace if it does not already exist and deploy the application

    export APP_NAMESPACE=CT
    kubectl create ns $APP_NAMESPACE
    sed -i "s|namespace: codetrans|namespace: $APP_NAMESPACE|g"  ./codetrans_xeon.yaml
    kubectl apply -f ./codetrans_xeon.yaml
  2. Check if the application is up and ready

    kubectl get pods -n $APP_NAMESPACE
  3. Deploy a client pod for testing

    kubectl create deployment client-test -n $APP_NAMESPACE --image=python:3.8.13 -- sleep infinity
  4. Check that client pod is ready

    kubectl get pods -n $APP_NAMESPACE
  5. Send request to application

    export CLIENT_POD=$(kubectl get pod -n $APP_NAMESPACE -l app=client-test -o jsonpath={.items..metadata.name})
    export accessUrl=$(kubectl get gmc -n $APP_NAMESPACE -o jsonpath="{.items[?(@.metadata.name=='codetrans')].status.accessUrl}")
    kubectl exec "$CLIENT_POD" -n $APP_NAMESPACE -- curl $accessUrl -X POST -d '{"language_from": "Golang","language_to": "Python","source_code": "package main\n\nimport \"fmt\"\nfunc main() {\n    fmt.Println(\"Hello, World!\");\n}"}' -H 'Content-Type: application/json' > $LOG_PATH/gmc_codetrans.log