• [$] A parallel path for GPU restore in CRIU

    From LWN.net@86:200/23 to All on Wed Jun 18 06:40:07 2025

    The fundamental concept of checkpoint/restore is elegant: capture a
    process's state and resurrect it later, perhaps elsewhere. Checkpointing meticulously records a process's memory, open files, CPU state, and more into a snapshot. Restoration then reconstructs the process from this state. This established technique faces new challenges with GPU-accelerated applications, where low-latency restoration is crucial for
    fault
    tolerance, live migration, and
    fast startups. Recently, the restore process for AMD GPUs has been redesigned to
    eliminate substantial bottlenecks.

    https://lwn.net/Articles/1024747/
    --- SBBSecho 3.28-Linux
    * Origin: Palantir * palantirbbs.ddns.net * Pensacola, FL * (86:200/23)