You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: build vLLM from source for ARM64 CUDA 13 (NVIDIA DGX)
The prebuilt vLLM ARM64 wheels have ABI incompatibility with PyTorch CUDA 13 nightly builds. For ARM64 with CUDA 13 (e.g., NVIDIA DGX GB300 Blackwell, DGX GB200):
- Install CUDA toolkit 13.0 for compilation
- Use PyTorch nightly with cu130 support
- Build vLLM from source to ensure ABI compatibility
Add VLLM_ARM64_BUILD_FROM_SOURCE build arg (default: true) to allow opting out of source builds for faster build times on non-CUDA 13 systems.
Also:
- Update AMD64 wheel path to manylinux_2_35 (required for cu130)
- Bump vLLM to 0.15.1
Signed-off-by: Dorin Geman <dorin.geman@docker.com>
0 commit comments