Skip to content
Closed
Changes from 1 commit
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
a8dd326
chore: fix install (#2191)
ishandhanani Jul 30, 2025
79e6711
chore: fix QA bugs in documentation/readmes (#2199)
athreesh Jul 30, 2025
9320d68
fix(sglang): disagg yaml worker change and agg kv router fix (#2205)
ishandhanani Jul 31, 2025
1c9c7d3
chore: cleanup dead links (#2208)
nealvaidya Jul 31, 2025
a6d48bd
chore: Remove multimodal readme. (#2212) (#2234)
krishung5 Jul 31, 2025
44cbf88
fix: drop cuda graph bs (batch size) on dsr1 h100 sgl (#2235)
ishandhanani Aug 1, 2025
a57bade
fix: Locked triton==3.3.1 since triton 3.4.0 breaks tensorrt-llm 1.0.…
dmitry-tokarev-nv Aug 1, 2025
95c8b58
fix: sgl instructions point to new frontend (#2245)
ishandhanani Aug 1, 2025
bfe2808
fix: readme instruction (#2265)
ishandhanani Aug 4, 2025
e2552ed
docs: Backport: Dyn 591 (#2247) to 0.4.0 (#2251)
atchernych Aug 4, 2025
9af0a01
fix: trtllm container - ENV var used before declaration (#2277)
dmitry-tokarev-nv Aug 5, 2025
d60af96
docs: add instruction to deploy model with inference gateway #2257 (#…
biswapanda Aug 5, 2025
c948f1d
fix: fix broken doc links (#2308)
biswapanda Aug 5, 2025
add5fa8
fix: Copy cuda libraries from devel to runtime stage (#2298)
nv-tusharma Aug 5, 2025
f8b95fd
docs: update deploy readme (#2306)
atchernych Aug 5, 2025
3f7c7a7
fix: Add common and test dependencies to sglang runtime build (#2279)…
nv-tusharma Aug 5, 2025
741496e
fix: Backport/anish index rst into 0.4.0 - fix links in docs and more…
athreesh Aug 6, 2025
b4be3c2
docs: Final fixes to links reported by QA (#2334)
athreesh Aug 6, 2025
2ed36b8
docs: address sphinx build errors for docs.nvidia.com (#2346)
athreesh Aug 7, 2025
2846f9e
docs: Address vincent issue with trtllm symlink (#2351)
athreesh Aug 7, 2025
b4a3cb3
Pinned PyTorch version
krishung5 Aug 7, 2025
59a2005
Add model label to Component
tzulingk Aug 8, 2025
6c95b2b
Use ModelDeploymentCard.slug() for model name. ModelDeploymentCard.se…
tzulingk Aug 9, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Pinned PyTorch version
  • Loading branch information
krishung5 authored and tzulingk committed Aug 8, 2025
commit b4a3cb35af6c91484a9751c4346c44387ab83d5d
7 changes: 7 additions & 0 deletions container/deps/vllm/install_vllm.sh
Original file line number Diff line number Diff line change
Expand Up @@ -177,4 +177,11 @@ else
python -m pip install -v .
fi

if [ "$ARCH" = "amd64" ]; then
# NOTE: PyTorch 2.8.0 compatibility issue
# PyTorch 2.8.0 causes "RuntimeError: operator torchvision::nms does not exist" error.
# Temporarily pinning to PyTorch 2.7.1 until this compatibility issue is resolved.
uv pip install torch==2.7.1 --index-url https://download.pytorch.org/whl/cu128
fi

echo "vllm installation completed successfully"