Skip to content

Commit c19dde2

Browse files
Merge pull request #332 from Exabyte-io/chore/SOF-7666
chore/SOF 7666 - Migration to New Platform, New Infrastructure Specs
2 parents 841b1e3 + ec56a31 commit c19dde2

File tree

13 files changed

+429
-152
lines changed

13 files changed

+429
-152
lines changed

lang/en/docs/infrastructure/clusters/aws.md

Lines changed: 36 additions & 50 deletions
Original file line numberDiff line numberDiff line change
@@ -4,69 +4,55 @@ This page contains information about clusters hosted on Amazon Web Services[^1]
44

55
## Clusters
66

7-
The following table provides information about available clusters on Amazon Web Services (AWS) cloud computing platform. The latest cluster status can be found on <a href="https://platform.mat3ra.com/clusters" target="_blank">Clusters</a> page in web application.
7+
The following table provides information about available clusters on Amazon Web Services (AWS) cloud computing platform.
8+
The latest cluster status can be found on <a href="https://platform.mat3ra.com/clusters" target="_blank">Clusters</a>
9+
page in web application.
810

9-
| Name | Master Hostname | Location |
10-
| :---: | :---: | :---: |
11-
| cluster-001 | master-production-20160630-cluster-001.exabyte.io | West US |
11+
| Name | Master Hostname | Location |
12+
|:-------------:|:---------------------------------------------------:|:--------:|
13+
| `cluster-002` | `master-production-20250821-cluster-001.mat3ra.com` | West US |
1214

1315
## Queues
1416

15-
The list of currently enabled queues is given below. Price per core hour is shown in relation to the [relative unit price](../../pricing/service-levels.md#comparison-table) and is subject to change at any time. Total number of nodes can be increased upon [request](../../ui/support.md).
16-
17-
| Name | Category[^2] | Mode[^3] | Charge Policy[^4] | Price | Max Nodes per Job<sup>+</sup> | Max Nodes Total |
18-
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
19-
| D | debug | debug | core-seconds | 2.251 | 1 | 10 |
20-
| OR | ordinary | regular | core-seconds | 1.000 | 1 | 10 |
21-
| OR4 | ordinary | regular | core-seconds | 1.126 | 1 | 20 |
22-
| OR8 | ordinary | regular | core-seconds | 1.126 | 1 | 20 |
23-
| OR16 | ordinary | regular | core-seconds | 1.126 | 1 | 20 |
24-
| OF | ordinary | fast | core-hours | 1.000 | &le;5 | 100 |
25-
| OFplus| ordinary | fast | core-hours | 0.962 | &le;5 | 10 |
26-
| SR | saving | regular | core-seconds | 0.200 | 1 | 10 |
27-
| SR4 | saving | regular | core-seconds | 0.225 | 1 | 20 |
28-
| SR8 | saving | regular | core-seconds | 0.225 | 1 | 20 |
29-
| SR16 | saving | regular | core-seconds | 0.225 | 1 | 20 |
30-
| SF | saving | fast | core-hours | 0.200 | &le;5 | 100 |
31-
| SFplus| saving | fast | core-hours | 0.379 | &le;5 | 10 |
32-
| GOF | ordinary | fast | core-hours | 8.655 | &le;5 | 10 |
33-
| G4OF | ordinary | fast | core-hours | 8.655 | &le;5 | 10 |
34-
| G8OF | ordinary | fast | core-hours | 8.655 | &le;5 | 10 |
35-
| GSF | saving | fast | core-hours | 3.370 | &le;5 | 10 |
36-
| G4SF | saving | fast | core-hours | 4.158 | &le;5 | 10 |
37-
| G8SF | saving | fast | core-hours | 4.335 | &le;5 | 10 |
17+
The list of currently enabled queues is given below. Price per core hour is shown in relation to
18+
the [relative unit price](../../pricing/service-levels.md#comparison-table) and is subject to change at any time. Total
19+
number of nodes can be increased upon [request](../../ui/support.md).
20+
21+
| Name | Category[^2] | Mode[^3] | Charge Policy[^4] | Price | Max Nodes per Job<sup>+</sup> | Max Nodes Total |
22+
|:------:|:------------:|:--------:|:-----------------:|:-----:|:-----------------------------:|:---------------:|
23+
| D | debug | debug | core-seconds | 2.251 | 1 | 10 |
24+
| OR | ordinary | regular | core-seconds | 1.000 | 1 | 10 |
25+
| OF | ordinary | fast | core-hours | 1.000 | 10 | 100 |
26+
| OFplus | ordinary | fast | core-hours | 0.962 | 5 | 10 |
27+
| SR | saving | regular | core-seconds | 0.200 | 1 | 10 |
28+
| SF | saving | fast | core-hours | 0.200 | 10 | 100 |
29+
| SFplus | saving | fast | core-hours | 0.379 | 5 | 10 |
30+
| GOF | ordinary | fast | core-hours | 8.655 | 5 | 10 |
31+
| GSF | saving | fast | core-hours | 1.731 | 5 | 10 |
32+
| G4OF | ordinary | fast | core-hours | 8.655 | 5 | 10 |
3833

3934
<sup>+</sup> please contact support to inquire about attempting a larger node count per job
4035

4136
## Hardware Specifications
4237

4338
The following table contains hardware specifications for the above queues.
4439

45-
| Name | CPU[^5] | Cores per Node | GPU[^6] | GPU per Node | Memory (GB) | Bandwidth (Gbps) |
46-
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
47-
| D | c-3 | 8 | - | - | 15 | &le;10 |
48-
| OR | c-3 | 36 | - | - | 60 | &le;10 |
49-
| OR4 | c-3 | 4 | - | - | 7.5 | &le;10 |
50-
| OR8 | c-3 | 8 | - | - | 15 | &le;10 |
51-
| OR16 | c-3 | 16 | - | - | 30 | &le;10 |
52-
| OF | c-3 | 36 | - | - | 60 | 10 |
53-
| OFplus| c-5 | 72 | - | - | 144 | 25 |
54-
| SR | c-3 | 36 | - | - | 60 | 10 |
55-
| SR4 | c-3 | 4 | - | - | 7.5 | &le;10 |
56-
| SR8 | c-3 | 8 | - | - | 15 | &le;10 |
57-
| SR16 | c-3 | 16 | - | - | 30 | &le;10 |
58-
| SF | c-3 | 36 | - | - | 60 | 10 |
59-
| SFplus| c-5 | 72 | - | - | 144 | 25 |
60-
| GOF | c-4 | 8 | g-1 | 1 | 61 | 10 |
61-
| G4OF | c-4 | 32 | g-1 | 4 | 244 | 10 |
62-
| G8OF | c-4 | 64 | g-1 | 8 | 488 | 25 |
63-
| GSF | c-4 | 8 | g-1 | 1 | 61 | 10 |
64-
| G4SF | c-4 | 32 | g-1 | 4 | 244 | 10 |
65-
| G8SF | c-4 | 64 | g-1 | 8 | 488 | 25 |
66-
40+
| Name | CPU[^5] | Cores per Node | GPU[^6] | GPU per Node | Memory (GB) | Bandwidth (Gbps) | Instance Type |
41+
|:------:|:-------:|:--------------:|:-------:|:------------:|:-----------:|:----------------:|:-------------------:|
42+
| D | c-3 | 4 | - | - | 15 | &le;10 | c4.2xlarge |
43+
| OR | c-3 | 36 | - | - | 60 | &le;10 | c4.8xlarge |
44+
| OF | c-3 | 36 | - | - | 60 | 10 | c4.8xlarge |
45+
| OFplus | c-5 | 72 | - | - | 192 | 100 | c5n.18xlarge |
46+
| SR | c-3 | 36 | - | - | 60 | 10 | c4.8xlarge |
47+
| SF | c-3 | 36 | - | - | 60 | 10 | c4.8xlarge |
48+
| SFplus | c-5 | 72 | - | - | 192 | 100 | c5n.18xlarge |
49+
| GOF | c-8 | 8 | g-3 | 8 | 1152 | 400 | p4d.24xlarge |
50+
| GSF | c-8 | 8 | g-3 | 8 | 1152 | 400 | p4d.24xlarge |
51+
| G4OF | c-4 | 32 | g-4 | 1 | 256 | 10 | p5.4xlarge |
6752

6853
!!! note "Hyper-threading"
69-
Hyper-threading[^7] is enabled on all AWS compute nodes by default. It is recommended to use half of available cores on each compute node (e.g 18 cores on OF queue) if the application does not benefit from the extra virtual cores.
54+
Hyper-threading[^7] is enabled on all AWS compute nodes by default. It is recommended to use half of available cores on
55+
each compute node (e.g. 18 cores on OF queue) if the application does not benefit from the extra virtual cores.
7056

7157
## Links
7258

lang/en/docs/infrastructure/clusters/azure.md

Lines changed: 35 additions & 46 deletions
Original file line numberDiff line numberDiff line change
@@ -4,70 +4,59 @@ This page contains information about clusters hosted on Microsoft Azure[^1] and
44

55
## Clusters
66

7-
The following table provides information about available clusters on Microsoft Azure cloud computing platform. The latest cluster status can be found on <a href="https://platform.mat3ra.com/clusters" target="_blank">Clusters</a> page in web application.
7+
The following table provides information about available clusters on Microsoft Azure cloud computing platform. The
8+
latest cluster status can be found on <a href="https://platform.mat3ra.com/clusters" target="_blank">Clusters</a> page
9+
in web application.
810

9-
| Name | Hostname | Location |
10-
| :---: | :---: | :---: |
11-
| cluster-007 | master-production-20160630-cluster-007.exabyte.io | East US |
11+
| Name | Hostname | Location |
12+
|:-----------:|:-------------------------------------------------:|:--------:|
13+
| cluster-003 | master-production-20250821-cluster-003.mat3ra.com | East US |
1214

1315
## Queues
1416

15-
The list of currently enabled queues is given below. Price per core hour is shown in relation to the [relative unit price](../../pricing/service-levels.md#comparison-table) and is subject to change at any time. Total number of nodes can be increased upon [request](../../ui/support.md).
16-
17-
| Name | Category[^2] | Mode[^3] | Charge Policy[^4] | Price | Max Nodes per Job<sup>+</sup> | Max Nodes Total |
18-
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
19-
| D | debug | debug | core-seconds | 4.002 | 1 | 10 |
20-
| OR | ordinary | regular | core-seconds | 1.275 | 1 | 10 |
21-
| OF | ordinary | fast | core-hours | 1.275 | &le;5 | 100 |
22-
| OFplus| ordinary | fast | core-hours | 1.275 | 5 | 10 |
23-
| SR | saving | regular | core-seconds | 0.379 | 1 | 10 |
24-
| SF | saving | fast | core-hours | 0.379 | 1<sup>*</sup> | 100 |
25-
| SFplus | saving | fast | core-hours | 0.379 | 5 | 10 |
26-
| GPOF | ordinary | fast | core-hours | 6.110 | &le;5 | 10 |
27-
| GP2OF | ordinary | fast | core-hours | 6.110 | &le;5 | 10 |
28-
| GP4OF | ordinary | fast | core-hours | 6.110 | &le;5 | 10 |
29-
| GPSF | saving | fast | core-hours | 1.222 | &le;5 | 10 |
30-
| GP2SF | saving | fast | core-hours | 1.222 | &le;5 | 10 |
31-
| GP4SF | saving | fast | core-hours | 1.222 | &le;5 | 10 |
17+
The list of currently enabled queues is given below. Price per core hour is shown in relation to
18+
the [relative unit price](../../pricing/service-levels.md#comparison-table) and is subject to change at any time. Total
19+
number of nodes can be increased upon [request](../../ui/support.md).
3220

33-
<sup>+</sup> please contact support to inquire about attempting a larger node count per job
34-
35-
<sup>*</sup> presently the infrastructure limitations are not allowing for the multi-node communication in SF queue, so only single-node jobs should be attempted (as of Oct 2022)
21+
| Name | Category[^2] | Mode[^3] | Charge Policy[^4] | Price | Max Nodes per Job<sup>+</sup> | Max Nodes Total |
22+
|:------:|:------------:|:--------:|:-----------------:|:-----:|:-----------------------------:|:---------------:|
23+
| D | debug | ordinary | core-seconds | 4.002 | 1 | 10 |
24+
| OR | regular | ordinary | core-seconds | 1.275 | 1 | 10 |
25+
| SR | regular | saving | core-seconds | 0.379 | 1 | 10 |
26+
| OF | fast | ordinary | core-hours | 1.275 | 5 | 100 |
27+
| SF | fast | saving | core-hours | 0.379 | 5 | 100 |
28+
| GPOF | fast | ordinary | core-hours | 6.110 | 5 | 10 |
29+
| GPSF | fast | saving | core-hours | 1.222 | 5 | 10 |
3630

31+
<sup>+</sup> please contact support to inquire about attempting a larger node count per job
3732

3833
## Hardware Specifications
3934

40-
The following table contains hardware specifications for the above queues.
41-
42-
| Name | CPU[^5] | Cores per Node | GPU[^6] | GPU per Node | Memory (GB) | Bandwidth (Gb/sec) |
43-
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
44-
| D | c-7 | 16 | - | - | 32 | &le;10 |
45-
| OR | c-6 | 44 | - | - | 352 | 100 |
46-
| OF | c-6 | 44 | - | - | 352 | 100 |
47-
| OFplus| c-6 | 44 | - | - | 352 | 100 |
48-
| SR | c-6 | 44 | - | - | 352 | 100 |
49-
| SF | c-6 | 44 | - | - | 352 | 100 |
50-
| SFPlus| c-6 | 44 | - | - | 352 | 100 |
51-
| GPOF | c-2 | 6 | g-2 | 1 | 112 | 10 |
52-
| GP2OF | c-2 | 12 | g-2 | 2 | 224 | 10 |
53-
| GP4OF | c-2 | 24 | g-2 | 4 | 448 | 10 |
54-
| GPSF | c-2 | 6 | g-2 | 1 | 112 | 10 |
55-
| GP2SF | c-2 | 12 | g-2 | 2 | 224 | 10 |
56-
| GP4SF | c-2 | 24 | g-2 | 4 | 448 | 10 |
35+
The following table contains hardware specifications for the above queues.
36+
37+
| Name | Cores per Node | GPU per Node | Memory (GB) | Bandwidth (Gb/sec) | VM Size |
38+
|:------:|:--------------:|:------------:|:-----------:|:------------------:|:------------------------:|
39+
| D | 8 | - | 2 | &le;10 | Standard_F8s_v2 |
40+
| OR | 44 | - | 352 | 100 | Standard_HC44rs |
41+
| OF | 44 | - | 352 | 100 | Standard_HC44rs |
42+
| SR | 44 | - | 352 | 100 | Standard_HC44rs |
43+
| SF | 44 | - | 352 | 100 | Standard_HC44rs |
44+
| GPOF | 40 | 1 | 320 | 40 | Standard_NC40ads_H100_v5 |
45+
| GPSF | 40 | 1 | 320 | 40 | Standard_NC40ads_H100_v5 |
5746

5847
## Links
5948

6049
[^1]: [Microsoft Azure, Website](https://azure.microsoft.com/en-us/)
6150

62-
[^2]: [Queue Cost Categories, Website](../resource/category.md#cost-categories)
51+
[^2]: [Queue Cost Categories, this documentation](../resource/category.md#cost-categories)
6352

64-
[^3]: [Queue Provision Modes, Website](../resource/category.md#provision-modes)
53+
[^3]: [Queue Provision Modes, this documentation](../resource/category.md#provision-modes)
6554

66-
[^4]: [Charge polices, Website](../resource/queues.md#charge-policies)
55+
[^4]: [Charge polices, this documentation](../resource/queues.md#charge-policies)
6756

68-
[^5]: [CPU types, Website](hardware.md#cpu-types)
57+
[^5]: [CPU types, this documentation](hardware.md#cpu-types)
6958

70-
[^6]: [GPU types, Website](hardware.md#gpu-types)
59+
[^6]: [GPU types, this documentation](hardware.md#gpu-types)
7160

7261
[^7]: [Azure high performance compute virtual machines, Website](https://docs.microsoft.com/en-us/azure/virtual-machines/linux/sizes-hpc)
7362

0 commit comments

Comments
 (0)