Supermicro NVIDIA Blackwell Systems Demonstrate Linear Scalability for MLPerf Training v6.0

1 min read

mlperf-supermicro-system In the latest MLPerf Training V6.0 benchmarking effort, Supermicro demonstrated linear scaling in training performance for 3 different models, scaling from 8 GPU to 72 B300 GPUs. Additional runs on HGX B200 systems were run, which showed that B300 performs better than B200 in Training performance.

The following systems with NVIDIA Blackwell GPUs were used in the benchmarking effort.

Supermicro System		# nodes	CPU SKU	# CPU	GPU SKU
SRS-GB300-NVL72-M1 (18x ARS-121GL-NB3)	Liquid Cooling	18	Neoverse-V2 (Grace)	36	NVIDIA Blackwell Ultra GPU (GB300)
AS-8126GS-NB3RT	Air Cooling	2	AMD EPYC 9575F	4	NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)
AS-4126GS-NB3RT-LCC	Liquid Cooling	1	AMD EPYC 9575F	2	NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)
SYS-222GS-NB3OT-ALC	Advanced Liquid Cooling	1	Intel(R) Xeon(R) 6776P	2	NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)
SYS-422GS-NB3RT-LCC	Liquid Cooling	1	Intel(R) Xeon(R) 6776P	2	NVIDIA Blackwell Ultra GPU (B300-SXM-270GB)
AS-4126GS-NBR-LCC	Liquid Cooling	1	AMD EPYC 9965	2	NVIDIA Blackwell GPU (B200-SXM-180GB)
SYS-A22GA-NBRT	Air Cooling	1	Intel(R) Xeon(R) 6979P	2	NVIDIA Blackwell GPU (B200-SXM-180GB)

In the LLAMA 3.1 8b benchmark, 2 systems of HGX-B300 showed even better performance than the linear scaling for HGX-B300 to NVL72, due to the optimal use of networking for this small model. In the bigger LLAMA 3.1 70b model LORA, the 2 systems of HGX-B300 follow the linear scaling of the HGX-B300 to NVL72.

Overall, Supermicro demonstrates the linear scaling capability of the NVIDIA Blackwell GPU architecture that uses NVLINK with performance that scales linearly from 8 GPU B300 (HGX-B300) to 72 GPU B300 (NVL72).

Screenshot 2026-06-16 155436 Screenshot 2026-06-16 155615

Screenshot 2026-06-16 155711

Supermicro NVIDIA Blackwell Systems Demonstrate Linear Scalability for MLPerf Training v6.0

About us

News

Resources

Supermicro NVIDIA Blackwell Systems Demonstrate Linear Scalability for MLPerf Training v6.0

Subscribe to Data Center Stories

About us

News

Resources

Connect & Follow