AI-Hypercomputer
diff --git a/‎README.md‎
Lines changed: 5 additions & 5 deletions b/‎README.md‎
Lines changed: 5 additions & 5 deletions
diff --git a/‎docs/configuring-environment-gke-a4-high.md‎ renamed to ‎docs/configuring-environment-gke-a4.md‎
Lines changed: 11 additions & 11 deletions b/‎docs/configuring-environment-gke-a4-high.md‎ renamed to ‎docs/configuring-environment-gke-a4.md‎
Lines changed: 11 additions & 11 deletions
diff --git a/‎src/frameworks/a4high/maxtext-configs/llama3-1-405b-256gpus-a4h-fp8.yaml‎ renamed to ‎src/frameworks/a4/maxtext-configs/llama3-1-405b-256gpus-a4-fp8.yaml‎ b/‎src/frameworks/a4high/maxtext-configs/llama3-1-405b-256gpus-a4h-fp8.yaml‎ renamed to ‎src/frameworks/a4/maxtext-configs/llama3-1-405b-256gpus-a4-fp8.yaml‎
diff --git a/‎src/frameworks/a4high/nemo-configs/llama3-1-405b-224gpus-a4high-bf16.yaml‎ renamed to ‎src/frameworks/a4/nemo-configs/llama3-1-405b-224gpus-a4-bf16.yaml‎ b/‎src/frameworks/a4high/nemo-configs/llama3-1-405b-224gpus-a4high-bf16.yaml‎ renamed to ‎src/frameworks/a4/nemo-configs/llama3-1-405b-224gpus-a4-bf16.yaml‎
diff --git a/‎src/frameworks/a4high/nemo-configs/llama3-1-405b-224gpus-a4high-fp8.yaml‎ renamed to ‎src/frameworks/a4/nemo-configs/llama3-1-405b-224gpus-a4-fp8.yaml‎ b/‎src/frameworks/a4high/nemo-configs/llama3-1-405b-224gpus-a4high-fp8.yaml‎ renamed to ‎src/frameworks/a4/nemo-configs/llama3-1-405b-224gpus-a4-fp8.yaml‎
diff --git a/‎src/helm-charts/a4high/maxtext-training/Chart.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/Chart.yaml‎ b/‎src/helm-charts/a4high/maxtext-training/Chart.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/Chart.yaml‎
diff --git a/‎src/helm-charts/a4high/maxtext-training/templates/maxtext-configmap.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/templates/maxtext-configmap.yaml‎ b/‎src/helm-charts/a4high/maxtext-training/templates/maxtext-configmap.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/templates/maxtext-configmap.yaml‎
diff --git a/‎src/helm-charts/a4high/maxtext-training/templates/maxtext-launcher-job.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/templates/maxtext-launcher-job.yaml‎ b/‎src/helm-charts/a4high/maxtext-training/templates/maxtext-launcher-job.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/templates/maxtext-launcher-job.yaml‎
diff --git a/‎src/helm-charts/a4high/maxtext-training/templates/maxtext-launcher-svc.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/templates/maxtext-launcher-svc.yaml‎ b/‎src/helm-charts/a4high/maxtext-training/templates/maxtext-launcher-svc.yaml‎ renamed to ‎src/helm-charts/a4/maxtext-training/templates/maxtext-launcher-svc.yaml‎
diff --git a/‎src/helm-charts/a4high/nemo-training/Chart.yaml‎ renamed to ‎src/helm-charts/a4/nemo-training/Chart.yaml‎ b/‎src/helm-charts/a4high/nemo-training/Chart.yaml‎ renamed to ‎src/helm-charts/a4/nemo-training/Chart.yaml‎
@@ -35,12 +35,12 @@ Models             | GPU Machine Type
 **Llama-3.1-405B** | [A3 Ultra (NVIDIA H200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms) | NeMo.     | Pre-training  | GKE          | [Link](./training/a3ultra/llama3-1-405b/nemo-pretraining-gke/README.md)
 **Mixtral-8-7B**   | [A3 Ultra (NVIDIA H200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms) | NeMo      | Pre-training  | GKE          | [Link](./training/a3ultra/mixtral-8x7b/nemo-pretraining-gke/README.md)
 
-### Training benchmarks A4 High
+### Training benchmarks A4
 
-Models             | GPU Machine Type                                                                                            | Framework | Workload Type | Orchestrator | Link to the recipe
------------------- | ----------------------------------------------------------------------------------------------------------- | --------- | ------------- | ------------ | ------------------
-**Llama-3.1-405B** | [A4 High (NVIDIA B200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-high-vms)   | MaxText   | Pre-training  | GKE          | [Link](./training/a4high/llama3-1-405b/maxtext-pretraining-gke/README.md)
-**Llama-3.1-405B** | [A4 High (NVIDIA B200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-high-vms)   | NeMo      | Pre-training  | GKE          | [Link](./training/a4high/llama3-1-405b/nemo-pretraining-gke/README.md)
+Models             | GPU Machine Type                                                                                     | Framework | Workload Type | Orchestrator | Link to the recipe
+------------------ | ---------------------------------------------------------------------------------------------------- | --------- | ------------- | ------------ | ------------------
+**Llama-3.1-405B** | [A4 (NVIDIA B200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-vms) | MaxText   | Pre-training  | GKE          | [Link](./training/a4/llama3-1-405b/maxtext-pretraining-gke/README.md)
+**Llama-3.1-405B** | [A4 (NVIDIA B200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-vms) | NeMo      | Pre-training  | GKE          | [Link](./training/a4/llama3-1-405b/nemo-pretraining-gke/README.md)
 
 ### Inference benchmarks A3 Mega
 
 
@@ -1,6 +1,6 @@
-# Configuring the environment for running benchmark recipes on a GKE Cluster with A4 High Node Pools
+# Configuring the environment for running benchmark recipes on a GKE Cluster with A4 Node Pools
 
-This [guide](https://cloud.google.com/ai-hypercomputer/docs/create/gke-ai-hypercompute) outlines the steps to configure the environment required to run benchmark recipes on a [Google Kubernetes Engine (GKE) cluster](https://cloud.google.com/kubernetes-engine/docs/concepts/kubernetes-engine-overview) with [A4 High](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-vms) node pools.
+This [guide](https://cloud.google.com/ai-hypercomputer/docs/create/gke-ai-hypercompute) outlines the steps to configure the environment required to run benchmark recipes on a [Google Kubernetes Engine (GKE) cluster](https://cloud.google.com/kubernetes-engine/docs/concepts/kubernetes-engine-overview) with [A4](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-vms) node pools.
 
 ## Prerequisites
 
@@ -26,7 +26,7 @@ Before you begin, ensure you have completed the following:
 
 ## Reserve capacity
 
-To ensure that your workloads have the A4 High GPU resources required for these
+To ensure that your workloads have the A4 GPU resources required for these
 instructions, you can create a [future reservation request](https://cloud.google.com/compute/docs/instances/future-reservations-overview).
 With this request, you can reserve blocks of capacity for a defined duration in the
 future. At that date and time in the future, Compute Engine automatically
@@ -77,7 +77,7 @@ The environment comprises of the following components:
 - [Artifact Registry](https://cloud.google.com/artifact-registry/docs/overview): serves as a
   private container registry for storing and managing Docker images used in the deployment.
 - [Google Kubernetes Engine (GKE)](https://cloud.google.com/kubernetes-engine/docs/concepts/kubernetes-engine-overview)
-  Cluster with A4 High Node Pools: provides a managed Kubernetes environment to run benchmark
+  Cluster with A4 Node Pools: provides a managed Kubernetes environment to run benchmark
   recipes.
 
 ## Set up the client workstation
@@ -150,16 +150,16 @@ Replace the following:
      repository descriptions are not encrypted.
 
 
-## Create a GKE Cluster with A4 High Node Pools
+## Create a GKE Cluster with A4 Node Pools
 
 Follow [this guide]() for
-detailed instructions to create a GKE cluster with A4 High node pools and required GPU driver versions.
+detailed instructions to create a GKE cluster with A4 node pools and required GPU driver versions.
 
 The documentation uses [ Cluster Toolkit](https://cloud.google.com/cluster-toolkit/docs/overview) to create your GKE cluster quickly while incorporating best practices:
 
 - Creation of the necessary VPC networks and subnets.
 - Creation of a GKE cluster with multi-networking enabled.
-- Creation of an A4 High node pool with NVIDIA B200 GPUs.
+- Creation of an A4 node pool with NVIDIA B200 GPUs.
 - Installation of the required components for GPUDirect-RDMA and NCCL plugin.
 
 1.  [Launch Cloud Shell](https://cloud.google.com/shell/docs/launching-cloud-shell). You can use a
@@ -205,13 +205,13 @@ The documentation uses [ Cluster Toolkit](https://cloud.google.com/cluster-toolk
       previous step to store the state of Terraform deployment.
    * `PROJECT_ID`: your Google Cloud project ID.
    * `COMPUTE_REGION`: the compute region for the cluster.
-   * `COMPUTE_ZONE`: the compute zone for the node pool of A4 High machines.
+   * `COMPUTE_ZONE`: the compute zone for the node pool of A4 machines.
    * `IP_ADDRESS/SUFFIX`: The IP address range that you want to allow to
       connect with the cluster. This CIDR block must include the IP address of
       the machine to call Terraform.
    * `RESERVATION_NAME`: the name of your reservation.
    * `BLOCK_NAME`: the name of a specific block within the reservation.
-   * `NODE_COUNT`: the number of A4 High nodes in your cluster.
+   * `NODE_COUNT`: the number of A4 nodes in your cluster.
 
   To modify advanced settings, edit
   `examples/gke-a4-highgpu/gke-a4-highgpu.yaml`.
@@ -220,7 +220,7 @@ The documentation uses [ Cluster Toolkit](https://cloud.google.com/cluster-toolk
    to provide access to Terraform.
 
 1.  Deploy the blueprint to provision the GKE infrastructure
-    using A4 High machine types:
+    using A4 machine types:
 
    ```sh
    cd ~/cluster-toolkit
@@ -242,7 +242,7 @@ VPC networks and GKE cluster:
 
 ## What's next
 
-Once you have set up your GKE cluster with A4 High node pools, you can proceed to deploy and
+Once you have set up your GKE cluster with A4 node pools, you can proceed to deploy and
 run your [benchmark recipes](../README.md#benchmarks-support-matrix).
 
 ## Get Help