From 20fa2c42996d90b6f5925f515cf421800afd09bf Mon Sep 17 00:00:00 2001 From: Yaron Date: Wed, 27 Nov 2024 17:13:54 +0200 Subject: [PATCH] Merge pull request #1261 from run-ai/hardware-requirements updated hardware reuirements --- .../cluster-setup/cluster-prerequisites.md | 11 +++++++++-- .../runai-setup/self-hosted/k8s/prerequisites.md | 9 +++++++-- .../runai-setup/self-hosted/ocp/prerequisites.md | 10 +++++++++- 3 files changed, 25 insertions(+), 5 deletions(-) diff --git a/docs/admin/runai-setup/cluster-setup/cluster-prerequisites.md b/docs/admin/runai-setup/cluster-setup/cluster-prerequisites.md index 34789951cb..7a81e8e6cd 100644 --- a/docs/admin/runai-setup/cluster-setup/cluster-prerequisites.md +++ b/docs/admin/runai-setup/cluster-setup/cluster-prerequisites.md @@ -15,8 +15,8 @@ This configuration is the minimum requirement you need to install and use Run:ai | Component | Required Capacity | | :---- | :---- | -| CPU | 8 cores | -| Memory | 16GB | +| CPU | 10 cores | +| Memory | 20GB | | Disk space | 50GB | ### Run:ai Cluster - Worker nodes @@ -24,6 +24,13 @@ The Run:ai Cluster supports both x86 CPUs and NVIDIA x86 GPUs. For the list of supported GPU models, see [Supported NVIDIA Data Center GPUs and Systems](https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/platform-support.html#supported-nvidia-data-center-gpus-and-systems ){target=_blank}. +The following configuration represents the minimum hardware requirements for installing and operating the Run:ai cluster on worker nodes. Each node must meet these specifications: + +| Component | Required Capacity | +| :---- | :---- | +| CPU | 2 cores | +| Memory | 4GB | + ### Shared storage Run:ai workloads must be able to access data from any worker node in a uniform way, to access training data and code as well as save checkpoints, weights, and other machine-learning-related artifacts. diff --git a/docs/admin/runai-setup/self-hosted/k8s/prerequisites.md b/docs/admin/runai-setup/self-hosted/k8s/prerequisites.md index 37aa5d8b6c..416cc78f33 100644 --- a/docs/admin/runai-setup/self-hosted/k8s/prerequisites.md +++ b/docs/admin/runai-setup/self-hosted/k8s/prerequisites.md @@ -25,10 +25,15 @@ Run:ai requires [Helm](https://helm.sh/){target=_blank} 3.14 or later. To instal ## Cluster hardware requirements -See Cluster prerequisites [hardware](../../cluster-setup/cluster-prerequisites.md#hardware-requirements) requirements. +The Run:ai control plane services require the following resources: -In addition, the control plane installation of Run:ai requires the configuration of Kubernetes Persistent Volumes of a total size of 110GB. +| Component | Required Capacity | +| :---- | :---- | +| CPU | 10 cores | +| Memory | 12GB | +| Disk space | 110GB | +If Run:ai cluster is planned to be installed on the same cluster as the Run:ai control plane: Ensure the control plane requirements are in addition to the Run:ai cluster [hardware requirements](../../cluster-setup/cluster-prerequisites.md#hardware-requirements). ## Run:ai software requirements diff --git a/docs/admin/runai-setup/self-hosted/ocp/prerequisites.md b/docs/admin/runai-setup/self-hosted/ocp/prerequisites.md index 175268069d..4a4d8c667a 100644 --- a/docs/admin/runai-setup/self-hosted/ocp/prerequisites.md +++ b/docs/admin/runai-setup/self-hosted/ocp/prerequisites.md @@ -27,7 +27,15 @@ Run:ai requires [Helm](https://helm.sh/){target=_blank} 3.14 or later. To instal ## Cluster hardware requirements -See Cluster prerequisites [hardware](../../cluster-setup/cluster-prerequisites.md#hardware-requirements) requirements. +The Run:ai control plane services require the following resources: + +| Component | Required Capacity | +| :---- | :---- | +| CPU | 10 cores | +| Memory | 12GB | +| Disk space | 110GB | + +If Run:ai cluster is planned to be installed on the same cluster as the Run:ai control plane: Ensure the control plane requirements are in addition to the Run:ai cluster [hardware requirements](../../cluster-setup/cluster-prerequisites.md#hardware-requirements). ## Run:ai software requirements