Nvidia has brought out the Rubin CPX GPU, a specialised accelerator specifically made for massive-context AI models. The chip delivers 30 PetaFLOPS of NVFP4 compute performance on a monolithic die …
The post Nvidia launches GPU for disaggregated inference appeared first on Electronics Weekly .