NetApp's GenAI Toolkit

A self-managed cloud native solution with an easy to use UI and API to get started with GenAI, RAG workflows, Chatbots and AI assistants building with unstructured data on Azure NetApp Files or Google Cloud NetApp Volumes. Use it standalone (it has a great UI) or as a component in custom workflows via its API.

Provides

Enterprise level Document Search through LLM vector embeddings (auto embeds with built in PGVector DB)
Chatbot (RAG) UI/API
Tooling for Chatbots
RAG model evaluations
Exportable Smart Prompts/Chatbot endpoints
(Assistants/Agents TBD)

Deploying the Toolkit

The toolkit consists of Kubernetes YAML files, packaged as Helm charts, which can be installed on any cluster using Helm. Below are instructions on how to deploy the toolkit to a local Kubernetes cluster, AKS, and GKE. If your environment is already set up, you can use the Quickstart section for each deployment method.

AKS

AKS Quickstart

helm install genai-toolkit genai-toolkit-helmcharts --set cloudProvider="anf",nfs.volumes="1.2.3.4:/path1\,5.6.7.8:/path2"

Note the escaped comma in the nfs.volumes list. All commas have to be escaped (for now)

AKS Requirements

For ANF, the toolkit requires an AKS cluster, at least one ANF volume, and connectivity between the volumes and the cluster.

Setting up an ANF Volume

To get started, follow this guide: Create an NFS volume for Azure NetApp Files. Note the vNet where you create the volume. Once the volume is available, proceed to the next step.

Setting up an AKS Cluster

Follow this guide: Quickstart: Deploy an Azure Kubernetes Service (AKS) cluster using Azure portal. Note the vNet where you create the cluster. Once you can run kubectl commands locally against the cluster, proceed to the next step.

Verifying Network Access to the ANF Volume

To verify network access between your AKS cluster and the ANF volume, follow these steps:

Deploy a test pod: Deploy a simple test pod in your AKS cluster with tools like curl or ping installed. Use the following YAML to create a test pod:

apiVersion: v1
kind: Pod
metadata:
  name: test-pod
spec:
  containers:
  - name: test-container
    image: busybox
    command: ['sh', '-c', 'sleep 3600']
    resources:
      limits:
        memory: "128Mi"
        cpu: "500m"

Apply the YAML file using kubectl apply -f test-pod.yaml.

Access the test pod: Once the pod is running, access it using:

kubectl exec -it test-pod -- sh

Test connectivity: Inside the pod, use ping to test connectivity to the ANF volume's IP address:

ping <ANF_VOLUME_IP>

If you receive responses, the network access between the AKS cluster and the ANF volume is properly configured.

AKS Deployment

Once your Azure resources are set up, deploy the toolkit using:

helm install genai-toolkit genai-toolkit-helmcharts --set cloudProvider="anf",nfs.volumes="1.2.3.4:/path1\,5.6.7.8:/path2"

Note the escaped comma in the nfs.volumes list. All commas have to be escaped (for now)

Replace nfs.volumes with the NFS paths of your ANF volumes. This information is available in the mount instructions for the volume.

After the toolkit starts up, get the public IP by running:

kubectl get svc genai-toolkit-nginx -o jsonpath='{.status.loadBalancer.ingress[0].ip}'

Use this IP to access the UI in your preferred browser or to make direct API calls.

GKE

GKE Quickstart

helm install genai-toolkit genai-toolkit-helmcharts --set cloudProvider="gcnv",nfs.volumes="1.2.3.4:/path1\,5.6.7.8:/path2"

Note the escaped comma in the nfs.volumes list. All commas have to be escaped (for now)

GKE Requirements

For GCNV, the toolkit requires a GKE cluster, at least one GCNV volume, and connectivity between the volumes and the cluster.

Setting up a GCNV Volume

To get started, follow this guide: Create an NFS volume for Google Cloud NetApp Volumes. Note the VPC where you create the volume. Once the volume is available, proceed to the next step.

Setting up a GKE Cluster

Follow this guide: Quickstart: Deploy a GKE cluster. Note the VPC where you create the cluster. Once you can run kubectl commands locally against the cluster, proceed to the next step.

Verifying Network Access to the GCNV Volume

For some reason, you are unable to ping or curl the volumes in GCNV from the cluster. But if they are on the same subnet, you should have access to the volume fromt he cluster.

GKE Deployment

Once your Google Cloud resources are set up, deploy the toolkit using:

helm install genai-toolkit genai-toolkit-helmcharts --set cloudProvider="gcnv",nfs.volumes="1.2.3.4:/path1\,5.6.7.8:/path2"

Note the escaped comma in the nfs.volumes list. All commas have to be escaped (for now)

Replace nfs.volumes with the NFS paths of your ANF volumes. This information is available in the mount instructions for the volume.

After the toolkit starts up, get the public IP by running:

kubectl get svc genai-toolkit-nginx -o jsonpath='{.status.loadBalancer.ingress[0].ip}'

Use this IP to access the UI in your preferred browser or to make direct API calls.

Local K8s

Local Deployment Quickstart

helm install genai-toolkit genai-toolkit-helmcharts --set cloudProvider="local",localDir="/path/to/your/dataset/directory"

Local Deployment Requirements

For local deployment, you need a local Kubernetes cluster running. You can use Docker Desktop to set up a local Kubernetes cluster. Follow this guide to enable Kubernetes in Docker Desktop: Docker Desktop - Enable Kubernetes.

We have also tested this on Minikube and Orbstack but theoretically, it should work on any local K8s orchestrator. Note you might have to mount the directory into Minikube first for this to work.

Local Deployment

Once your local Kubernetes cluster is set up, deploy the toolkit using:

helm install genai-toolkit genai-toolkit-helmcharts --set cloudProvider="local",localDir="/path/to/your/dataset/directory\,/path/to/your/second/dataset/directory"

Replace localDir is a list of absolute paths to your datasets on your local machine. This will mount your local directories into the container as "ONTAP" volumes.

After the toolkit starts up use localhost to access the UI in your preferred browser or to make direct API calls.

Helm Chart Parameters

Parameter	Description	Default Value	Available values
`nfs.volumes`	A list of NFS connection strings	`1.2.3.4:/path1,5.6.7.8:/path2`	None
`cloudProvider`	The cloud provider to use.	`anf`	`anf` / `gcnv` / `local`
`db.connectionString`	The database connection string	To K8s DB
`localDir`	The local directory to use as a dataset "volume"	None

Note: By not setting the db.connectionString the toolkit will default to use an in cluster database. This is not recommended for production use cases. For testing, it is fine.

There are other optional variables but these are only used for development of the toolkit and do not require any attention.

Screenshots (on Azure)

Changelog

v0.6.0:

Improved document handling
Deployment changed to Helm
Enhanced tooling support
Switching over to generated clients
Performance and optimization improvements
Added system status endpoint

v0.5.0:

Advanced RAG (BM25 and Query Expansion)
Support for SmartPrompt tools
Default tools added (SearXNG)
Deployment of toolkit changed to kubernetes

v0.4.0:

RAG config evaluations
Cross container authentication
Image model config optional in RAG config
Enable talking to a model without context (passthrough)
UI improvements and hardening

v0.3.0:

Azure Support with Azure NetApp Files
PGVector is now the default vector database (Can change for Instaclustr or Azure Flex Server)
RAG evaluations
SmartPrompts
Azure OpenAI and OpenAI models

v0.2.0:

GCP Support with Google Cloud NetApp Volumes
Prompt API
Chat UI
Search/Explore
Gemini and Claude3 models from VertexAI

Support

If you encounter any issues with getting the GenAI toolkit up and running or configuring the AI models, please submit an Issue in this repo using the templates (bugs or feedback). You can also visit our Discussions page.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
api		api
archived/terraform		archived/terraform
genai-toolkit-helmcharts		genai-toolkit-helmcharts
images		images
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NetApp's GenAI Toolkit

Provides

Table of Contents

Deploying the Toolkit

AKS

AKS Quickstart

AKS Requirements

Setting up an ANF Volume

Setting up an AKS Cluster

Verifying Network Access to the ANF Volume

AKS Deployment

GKE

GKE Quickstart

GKE Requirements

Setting up a GCNV Volume

Setting up a GKE Cluster

Verifying Network Access to the GCNV Volume

GKE Deployment

Local K8s

Local Deployment Quickstart

Local Deployment Requirements

Local Deployment

Helm Chart Parameters

Screenshots (on Azure)

Changelog

Support

About

Contributors 3

Languages

License

NetAppLabs/genai-toolkit-deployments

Folders and files

Latest commit

History

Repository files navigation

NetApp's GenAI Toolkit

Provides

Table of Contents

Deploying the Toolkit

AKS

AKS Quickstart

AKS Requirements

Setting up an ANF Volume

Setting up an AKS Cluster

Verifying Network Access to the ANF Volume

AKS Deployment

GKE

GKE Quickstart

GKE Requirements

Setting up a GCNV Volume

Setting up a GKE Cluster

Verifying Network Access to the GCNV Volume

GKE Deployment

Local K8s

Local Deployment Quickstart

Local Deployment Requirements

Local Deployment

Helm Chart Parameters

Screenshots (on Azure)

Changelog

Support

About

Resources

License

Stars

Watchers

Forks

Contributors 3

Languages