House of Zen promises 3.5x improvement in inference and 3x uplift in training perf over last-gen software
theregister.co.ukAMD closed the performance gap with Nvidia's Blackwell accelerators with the launch of the MI355X this spring. Now the company just needs to overcome Nvidia's CUDA software advantage and make that perf more accessible to developers.
The release of AMD's ROCm 7.0 software platform this week is a step in that direction, promising major improvements in inference and training performance that benefit not only its latest chips but its older MI300-series parts as well. The so-called CUDA moat could be getting narrower.
ROCm, if you're not familiar, is a suite of software libraries and development tools, including HIP frameworks, that provides developers a low-level programming interface for running high-performance computing (HPC) and AI workloads on GPUs. The software stack is reminiscent in many ways of the CUDA runtime, but for AMD GPUs rather than Nvidia.
Since the launch of the MI300X, its first truly AI-optimized ...
Copyright of this story solely belongs to theregister.co.uk . To see the full text click HERE