@@ -21,26 +21,26 @@ Welcome to the reproducible benchmark recipes repository for GPUs! This reposito
2121Models | GPU Machine Type | Framework | Workload Type | Orchestrator | Link to the recipe
2222----------------- | --------------------------------------------------------------------------------------------------------- | --------- | ------------- | ------------ | ------------------
2323** GPT3-175B** | [ A3 Mega (NVIDIA H100)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-mega-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3mega/gpt3-175b/nemo-pretraining-gke/README.md )
24- ** Llama-3-70B** | [ A3 Mega (NVIDIA H100)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-mega-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3mega/llama-3 -70b/nemo-pretraining-gke/README.md )
25- ** Llama-3.1-70B** | [ A3 Mega (NVIDIA H100)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-mega-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3mega/llama-3. 1-70b/nemo-pretraining-gke/README.md )
24+ ** Llama-3-70B** | [ A3 Mega (NVIDIA H100)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-mega-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3mega/llama3 -70b/nemo-pretraining-gke/README.md )
25+ ** Llama-3.1-70B** | [ A3 Mega (NVIDIA H100)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-mega-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3mega/llama3- 1-70b/nemo-pretraining-gke/README.md )
2626** Mixtral-8-7B** | [ A3 Mega (NVIDIA H100)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-mega-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3mega/mixtral-8x7b/nemo-pretraining-gke/README.md )
2727
2828### Training benchmarks A3 Ultra
2929
3030Models | GPU Machine Type | Framework | Workload Type | Orchestrator | Link to the recipe
3131------------------ | ----------------------------------------------------------------------------------------------------------- | --------- | ------------- | ------------ | ------------------
32- ** Llama-3.1-70B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | MaxText | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama-3. 1-70b/maxtext-pretraining-gke/README.md )
33- ** Llama-3.1-70B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama-3. 1-70b/nemo-pretraining-gke/README.md )
34- ** Llama-3.1-405B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | MaxText | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama-3. 1-405b/maxtext-pretraining-gke/README.md )
35- ** Llama-3.1-405B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | NeMo. | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama-3. 1-405b/nemo-pretraining-gke/README.md )
32+ ** Llama-3.1-70B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | MaxText | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama3- 1-70b/maxtext-pretraining-gke/README.md )
33+ ** Llama-3.1-70B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama3- 1-70b/nemo-pretraining-gke/README.md )
34+ ** Llama-3.1-405B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | MaxText | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama3- 1-405b/maxtext-pretraining-gke/README.md )
35+ ** Llama-3.1-405B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | NeMo. | Pre-training | GKE | [ Link] ( ./training/a3ultra/llama3- 1-405b/nemo-pretraining-gke/README.md )
3636** Mixtral-8-7B** | [ A3 Ultra (NVIDIA H200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-ultra-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a3ultra/mixtral-8x7b/nemo-pretraining-gke/README.md )
3737
3838### Training benchmarks A4 High
3939
4040Models | GPU Machine Type | Framework | Workload Type | Orchestrator | Link to the recipe
4141------------------ | ----------------------------------------------------------------------------------------------------------- | --------- | ------------- | ------------ | ------------------
42- ** Llama-3.1-405B** | [ A4 High (NVIDIA B200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-high-vms ) | MaxText | Pre-training | GKE | [ Link] ( ./training/a4high/llama-3. 1-405b/maxtext-pretraining-gke/README.md )
43- ** Llama-3.1-405B** | [ A4 High (NVIDIA B200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-high-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a4high/llama-3. 1-405b/nemo-pretraining-gke/README.md )
42+ ** Llama-3.1-405B** | [ A4 High (NVIDIA B200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-high-vms ) | MaxText | Pre-training | GKE | [ Link] ( ./training/a4high/llama3- 1-405b/maxtext-pretraining-gke/README.md )
43+ ** Llama-3.1-405B** | [ A4 High (NVIDIA B200)] ( https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4-high-vms ) | NeMo | Pre-training | GKE | [ Link] ( ./training/a4high/llama3- 1-405b/nemo-pretraining-gke/README.md )
4444
4545### Inference benchmarks A3 Mega
4646
0 commit comments