The following systems with NVIDIA Blackwell GPUs were used in the benchmarking effort.
|
Supermicro System |
|
# nodes |
CPU SKU |
# CPU |
GPU SKU |
|
SRS-GB300-NVL72-M1 (18x ARS-121GL-NB3) |
Liquid Cooling |
18 |
Neoverse-V2 (Grace) |
36 |
NVIDIA Blackwell Ultra GPU (GB300) |
|
AS-8126GS-NB3RT |
Air Cooling |
2 |
AMD EPYC 9575F |
4 |
NVIDIA Blackwell Ultra GPU (B300-SXM-270GB) |
|
AS-4126GS-NB3RT-LCC |
Liquid Cooling |
1 |
AMD EPYC 9575F |
2 |
NVIDIA Blackwell Ultra GPU (B300-SXM-270GB) |
|
SYS-222GS-NB3OT-ALC |
Advanced Liquid Cooling |
1 |
Intel(R) Xeon(R) 6776P |
2 |
NVIDIA Blackwell Ultra GPU (B300-SXM-270GB) |
|
SYS-422GS-NB3RT-LCC |
Liquid Cooling |
1 |
Intel(R) Xeon(R) 6776P |
2 |
NVIDIA Blackwell Ultra GPU (B300-SXM-270GB) |
|
AS-4126GS-NBR-LCC |
Liquid Cooling |
1 |
AMD EPYC 9965 |
2 |
NVIDIA Blackwell GPU (B200-SXM-180GB) |
|
SYS-A22GA-NBRT |
Air Cooling |
1 |
Intel(R) Xeon(R) 6979P |
2 |
NVIDIA Blackwell GPU (B200-SXM-180GB) |
In the LLAMA 3.1 8b benchmark, 2 systems of HGX-B300 showed even better performance than the linear scaling for HGX-B300 to NVL72, due to the optimal use of networking for this small model. In the bigger LLAMA 3.1 70b model LORA, the 2 systems of HGX-B300 follow the linear scaling of the HGX-B300 to NVL72.
Overall, Supermicro demonstrates the linear scaling capability of the NVIDIA Blackwell GPU architecture that uses NVLINK with performance that scales linearly from 8 GPU B300 (HGX-B300) to 72 GPU B300 (NVL72).