Skip to content

feat(turbomind): integrate cublasGemmGroupedBatchedEx for Qwen3.5 MoE…

01c1958
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

feat(turbomind): integrate cublasGemmGroupedBatchedEx for Qwen3.5 MoE inference on Blackwell GPUs with memory copy optimizations #4490

feat(turbomind): integrate cublasGemmGroupedBatchedEx for Qwen3.5 MoE…
01c1958
Select commit
Loading
Failed to load commit list.

Annotations

1 warning
cuda-12.4
succeeded Apr 9, 2026 in 25m 57s