3.10-b Mar 10, 2026 看ktransformers, 有三个地方: AMX 2. CUDA Graph 3. expert defer。 都是围绕 Maximize the utilization of CPU and GPU.