This expertise fixes the most important downside with fashionable GPUs

GPU storage expansion

In an attention-grabbing growth for the GPU business, PCIe-attached reminiscence is ready to alter how we take into consideration GPU reminiscence capability and efficiency. Panmnesia, an organization backed by South Korea’s KAIST analysis institute, is engaged on a expertise referred to as Compute Specific Hyperlink, or CXL, that enables GPUs to make the most of exterior reminiscence assets through the PCIe interface.

Historically, GPUs just like the RTX 4060 are restricted by their onboard VRAM, which might bottleneck efficiency in memory-intensive duties equivalent to AI coaching, information analytics, and high-resolution gaming. CXL leverages the high-speed PCIe connection to connect exterior reminiscence modules on to the GPU.

This methodology offers a low-latency reminiscence growth choice, with efficiency metrics exhibiting important enhancements over conventional strategies. In accordance with reviews, the brand new expertise manages to attain double-digit nanosecond latency, which is a considerable discount in comparison with customary SSD-based options.


Furthermore, this expertise isn’t restricted to only conventional RAM. SSDs may also be used to increase GPU reminiscence, providing a flexible and scalable answer. This functionality permits for the creation of hybrid reminiscence programs that mix the pace of RAM with the capability of SSDs, additional enhancing efficiency and effectivity.

Get your weekly teardown of the tech behind PC gaming

Whereas CXL operates on a PCIe hyperlink, integrating this expertise with GPUs isn’t easy. GPUs lack the mandatory CXL logic material and subsystems to help DRAM or SSD endpoints. Subsequently, merely including a CXL controller is just not possible.

GPU cache and reminiscence programs solely acknowledge expansions by way of Unified Digital Reminiscence (UVM). Nonetheless, assessments performed by Panmnesia revealed that UVM had the poorest efficiency amongst examined GPU kernels resulting from overhead from host runtime intervention throughout web page faults and inefficient information transfers on the web page stage.

To deal with the problem, Panmnesia developed a sequence of {hardware} layers that help all key CXL protocols, consolidated right into a unified controller. This CXL 3.1-compliant root advanced contains a number of root ports for exterior reminiscence over PCIe and a bunch bridge with a host-managed machine reminiscence decoder. This decoder connects to the GPU’s system bus and manages the system reminiscence, offering direct entry to expanded storage through load/retailer directions, successfully eliminating UVM’s points.

The implications of this expertise are far-reaching. For AI and machine studying, the power so as to add extra reminiscence means dealing with bigger datasets extra effectively, accelerating coaching occasions, and bettering mannequin accuracy. In gaming, builders can push the boundaries of graphical constancy and complexity with out being constrained by VRAM limitations.

For information facilities and cloud computing environments, Panmnesia’s CXL expertise offers an economical approach to improve present infrastructure. By attaching extra reminiscence by way of PCIe, information facilities can improve their computational energy with out requiring intensive {hardware} overhauls.

Regardless of its potential, Panmnesia faces a giant problem in gaining industrywide adoption. The most effective graphics playing cards from AMD and Nvidia don’t help CLX, and so they could by no means help it. There’s additionally a excessive chance that business gamers may develop their very own PCIe-attached reminiscence applied sciences for GPUs. Nonetheless, Panmnesia’s innovation represents a step ahead in addressing GPU reminiscence bottlenecks, with the potential to affect high-performance computing and gaming considerably.

Leave a Comment