Skip to content

Commit

Permalink
Merge pull request #1262 from run-ai/hardware-req-219
Browse files Browse the repository at this point in the history
Merge pull request #1261 from run-ai/hardware-requirements
  • Loading branch information
yarongol authored Nov 27, 2024
2 parents 39eec02 + 20fa2c4 commit bf43c68
Show file tree
Hide file tree
Showing 3 changed files with 25 additions and 5 deletions.
11 changes: 9 additions & 2 deletions docs/admin/runai-setup/cluster-setup/cluster-prerequisites.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,15 +15,22 @@ This configuration is the minimum requirement you need to install and use Run:ai

| Component | Required Capacity |
| :---- | :---- |
| CPU | 8 cores |
| Memory | 16GB |
| CPU | 10 cores |
| Memory | 20GB |
| Disk space | 50GB |

### Run:ai Cluster - Worker nodes
The Run:ai Cluster supports both x86 CPUs and NVIDIA x86 GPUs.
For the list of supported GPU models, see [Supported NVIDIA Data Center GPUs and Systems](https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/platform-support.html#supported-nvidia-data-center-gpus-and-systems
){target=_blank}.

The following configuration represents the minimum hardware requirements for installing and operating the Run:ai cluster on worker nodes. Each node must meet these specifications:

| Component | Required Capacity |
| :---- | :---- |
| CPU | 2 cores |
| Memory | 4GB |

### Shared storage

Run:ai workloads must be able to access data from any worker node in a uniform way, to access training data and code as well as save checkpoints, weights, and other machine-learning-related artifacts.
Expand Down
9 changes: 7 additions & 2 deletions docs/admin/runai-setup/self-hosted/k8s/prerequisites.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,10 +25,15 @@ Run:ai requires [Helm](https://helm.sh/){target=_blank} 3.14 or later. To instal

## Cluster hardware requirements

See Cluster prerequisites [hardware](../../cluster-setup/cluster-prerequisites.md#hardware-requirements) requirements.
The Run:ai control plane services require the following resources:

In addition, the control plane installation of Run:ai requires the configuration of Kubernetes Persistent Volumes of a total size of 110GB.
| Component | Required Capacity |
| :---- | :---- |
| CPU | 10 cores |
| Memory | 12GB |
| Disk space | 110GB |

If Run:ai cluster is planned to be installed on the same cluster as the Run:ai control plane: Ensure the control plane requirements are in addition to the Run:ai cluster [hardware requirements](../../cluster-setup/cluster-prerequisites.md#hardware-requirements).

## Run:ai software requirements

Expand Down
10 changes: 9 additions & 1 deletion docs/admin/runai-setup/self-hosted/ocp/prerequisites.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,15 @@ Run:ai requires [Helm](https://helm.sh/){target=_blank} 3.14 or later. To instal

## Cluster hardware requirements

See Cluster prerequisites [hardware](../../cluster-setup/cluster-prerequisites.md#hardware-requirements) requirements.
The Run:ai control plane services require the following resources:

| Component | Required Capacity |
| :---- | :---- |
| CPU | 10 cores |
| Memory | 12GB |
| Disk space | 110GB |

If Run:ai cluster is planned to be installed on the same cluster as the Run:ai control plane: Ensure the control plane requirements are in addition to the Run:ai cluster [hardware requirements](../../cluster-setup/cluster-prerequisites.md#hardware-requirements).

## Run:ai software requirements

Expand Down

0 comments on commit bf43c68

Please sign in to comment.