Supermicro, in collaboration with NVIDIA, has achieved breakthrough performance results in STAC-ML™ Markets (Inference) benchmarks, demonstrating the power of combining Supermicro's advanced server architecture with NVIDIA's cutting-edge AI platform.
Compared to the previous FPGA-based record, the NVIDIA GH200 Grace Hopper Superchip in a Supermicro ARS-111GL-NHR server delivered:
These results establish a new benchmark for AI inference performance in financial markets, demonstrating exceptional speed, accuracy, and efficiency in a single, integrated solution.
STAC recently completed an audited benchmark of the Supermicro ARS-111GL-NHR server powered by the NVIDIA GH200 Grace Hopper™ Superchip. This collaboration between Supermicro and NVIDIA delivered exceptional results across all model sizes tested, setting new standards for AI inference performance in financial markets.
The joint solution demonstrated remarkable improvements compared to previous FPGA-based systems:
This achievement showcases the powerful performance of Supermicro's optimized server design and the NVIDIA GH200 Grace Hopper Superchip architecture. The NVIDIA GH200 Grace Hopper™ Superchip combines the NVIDIA Grace™ CPU and NVIDIA Hopper GPU in a single superchip, delivering unprecedented memory bandwidth and energy efficiency for AI Inference workloads.
The combination of the Supermicro ARS-111GL-NHR platform and the NVIDIA GH200 Grace Hopper Superchip represents a significant leap forward for financial institutions deploying AI inference solutions, demonstrating both companies' commitment to delivering the highest-performance solutions for mission-critical applications.
STAC-ML Markets (Inference) is the technology benchmark standard for solutions used to run inference on real-time market data. Designed by quants and technologists from leading financial firms, the benchmarks evaluate latency, throughput, energy efficiency, space efficiency, and algorithm quality across various model configurations.
The Supermicro ARS-111GL-NHR with NVIDIA GH200 Grace Hopper Superchip excels in all these dimensions, making it an ideal platform for:
The Supermicro ARS-111GL-NHR with NVIDIA GH200 Grace Hopper Superchip is available now. For detailed configuration information and to learn more about how this solution can accelerate your AI inference workloads, contact your Supermicro sales representative or visit supermicro.com.
Full STAC benchmark reports are available to STAC Observer members at docs.stacresearch.com/SMC250910.