Supermicro NVIDIA Blackwell Systems Demonstrate Linear Scalability for MLPerf Training v6.0

mlperf-supermicro-systemIn the latest MLPerf Training V6.0 benchmarking effort, Supermicro demonstrated linear scaling in training performance for 3 different models, scaling from 8 GPU to 72 B300 GPUs. Additional runs on HGX B200 systems were run, which showed that B300 performs better than B200 in Training performance.

The following systems with NVIDIA Blackwell GPUs were used in the benchmarking effort.

Supermicro System

 

# nodes

CPU SKU

# CPU

GPU SKU

SRS-GB300-NVL72-M1

(18x ARS-121GL-NB3)

Liquid Cooling

18

 Neoverse-V2 (Grace)

36

 NVIDIA Blackwell Ultra GPU (GB300)

 AS-8126GS-NB3RT

Air Cooling

2

 AMD EPYC 9575F

4

 NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)

 AS-4126GS-NB3RT-LCC

Liquid Cooling

1

 AMD EPYC 9575F

2

 NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)

 SYS-222GS-NB3OT-ALC

 Advanced Liquid Cooling

1

 Intel(R) Xeon(R) 6776P

2

 NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)

 SYS-422GS-NB3RT-LCC

Liquid Cooling

1

Intel(R) Xeon(R) 6776P

2

NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)

 AS-4126GS-NBR-LCC

Liquid Cooling

1

 AMD EPYC 9965

2

 NVIDIA Blackwell GPU (B200-SXM-180GB)

 SYS-A22GA-NBRT

Air Cooling

1

 Intel(R) Xeon(R) 6979P

2

 NVIDIA Blackwell GPU (B200-SXM-180GB)

In the LLAMA 3.1 8b benchmark, 2 systems of HGX-B300 showed even better performance than the linear scaling for HGX-B300 to NVL72, due to the optimal use of networking for this small model. In the bigger LLAMA 3.1 70b model LORA, the 2 systems of HGX-B300 follow the linear scaling of the HGX-B300 to NVL72.

Overall, Supermicro demonstrates the linear scaling capability of the NVIDIA Blackwell GPU architecture that uses NVLINK with performance that scales linearly from 8 GPU B300 (HGX-B300) to 72 GPU B300 (NVL72).

Screenshot 2026-06-16 155436Screenshot 2026-06-16 155615

 

Screenshot 2026-06-16 155711