fix: Update tensorrt_llm to 1.0.0rc6 #2606

tanmayv25 · 2025-08-21T18:28:44Z

Overview:

There are issues with DeepGEMM on SBSA with VSWA. Upgrading the version can help.

NVIDIA/TensorRT-LLM@0ff8df9

Summary by CodeRabbit

Documentation
- Removed outdated Multi-Token Prediction guidance and build flags from deployment notes, including DeepSeek R1 and Gemma 3 specifics. Latency and MTP caveats remain.
Chores
- Updated defaults to TensorRT-LLM 1.0.0rc6 for builds and optional dependencies.
- Refreshed experimental baseline used when building without an explicit wheel/commit, so unattended builds use the newer default.

coderabbitai · 2025-08-21T18:33:32Z

Walkthrough

Documentation references to experimental TensorRT-LLM build requirements for MTP/VSWA were removed across trtllm docs. The container build script updated the default experimental TensorRT-LLM commit and default wheel to 1.0.0rc6. The Python optional dependency for trtllm was bumped to tensorrt-llm==1.0.0rc6.

Changes

Cohort / File(s)	Summary
Docs cleanup: remove MTP/VSWA build notes `components/backends/trtllm/README.md`, `components/backends/trtllm/deploy/README.md`, `components/backends/trtllm/gemma3_sliding_window_attention.md`	Deleted references to experimental TensorRT-LLM commit, associated build flags/commands, and VSWA compatibility notes. Content about aggregation/serving otherwise unchanged.
Build defaults update `container/build.sh`	Updated DEFAULT_EXPERIMENTAL_TRTLLM_COMMIT to a16ba64… and DEFAULT_TENSORRTLLM_PIP_WHEEL to tensorrt-llm==1.0.0rc6; affects default selection when no wheel/commit provided.
Dependency bump `pyproject.toml`	Updated optional dependency trtllm: tensorrt-llm from 1.0.0rc4 to 1.0.0rc6; no other changes in that block.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Possibly related PRs

fix: Update the NIXL TRTLLM commit version to rc4 #2284 — Previously updated DEFAULT_EXPERIMENTAL_TRTLLM_COMMIT; this PR revises it again to a new commit.
docs: add GPT-OSS deployment guide #2297 — Adjusted TensorRT-LLM build tooling (e.g., GIT URL option), related to build.sh changes here.
docs: Examples README/restructuring #2174 — Edited trtllm README MTP guidance; this PR further removes MTP-related build notes.

Poem

A hare with a wrench and a version tag,
Hops past rc4 with a jaunty wag.
Docs trimmed neat, no flags to keep,
Build wheels roll to rc6 deep.
Commit carrots fresh—a crunchy heap! 🥕🐇

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

container/build.sh (1)
101-103: Defaulting to a pip wheel conflicts with earlier “build from source” guidance and ABI note.

Comments above state the default should be option 1 (local wheel/build from source) due to ABI incompatibility between upstream wheels and NGC PyTorch, yet the logic defaults to installing a versioned pip wheel when neither a wheel nor commit is provided. If TENSORRTLLM_INDEX_URL remains the public PyPI (line 98), this default can be brittle.

Two options:

Keep default = build from source (safer with NGC PyTorch):
-DEFAULT_TENSORRTLLM_PIP_WHEEL="tensorrt-llm==1.0.0rc6"
+DEFAULT_TENSORRTLLM_PIP_WHEEL=""
+# When empty, logic falls back to commit-based local wheel build.
Or, if defaulting to a wheel is intended now, update the surrounding comments to explicitly state “default is option 2 (wheel)” and consider switching TENSORRTLLM_INDEX_URL to a vetted internal index.

Also consider following through on the TODO (lines 99–101) to install ai-dynamo[trtllm] in Dockerfile.trtllm so one source of truth drives the version. I can help wire this up in a follow-up.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 7d27033 and cdc657d.

📒 Files selected for processing (5)

components/backends/trtllm/README.md (0 hunks)
components/backends/trtllm/deploy/README.md (0 hunks)
components/backends/trtllm/gemma3_sliding_window_attention.md (0 hunks)
container/build.sh (2 hunks)
pyproject.toml (1 hunks)

💤 Files with no reviewable changes (3)

components/backends/trtllm/deploy/README.md
components/backends/trtllm/README.md
components/backends/trtllm/gemma3_sliding_window_attention.md

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: Build and Test - dynamo
GitHub Check: pre-merge-rust (lib/runtime/examples)
GitHub Check: pre-merge-rust (lib/bindings/python)
GitHub Check: pre-merge-rust (.)

🔇 Additional comments (2)

pyproject.toml (1)

52-52: Bump to tensorrt-llm 1.0.0rc6 — looks good.

Aligns with the PR objective to move to rc6. No functional concerns here.
container/build.sh (1)
92-96: Verify DEFAULT_EXPERIMENTAL_TRTLLM_COMMIT — it doesn’t match the DeepGEMM SBSA fix commit cited in the PR.

PR description references upstream fix commit 0ff8df95b7ccf0412b32be7befddbec3503115b6 (“[fix] DeepGEMM installation on SBSA”), but DEFAULT_EXPERIMENTAL_TRTLLM_COMMIT is set to a16ba6445c61ed70e7aadfe787d6f316bb422652 (a docs-only change). If users pass --use-default-experimental-tensorrtllm-commit, they won’t pick up the DeepGEMM fix.

If the intent is to default to the DeepGEMM fix, change to the cited commit:
-DEFAULT_EXPERIMENTAL_TRTLLM_COMMIT="a16ba6445c61ed70e7aadfe787d6f316bb422652"
+DEFAULT_EXPERIMENTAL_TRTLLM_COMMIT="0ff8df95b7ccf0412b32be7befddbec3503115b6"
Please confirm the desired commit/tag for experimental builds. Reference: DeepGEMM SBSA fix commit, and the currently configured commit. (github.com)

container/build.sh

pyproject.toml

nv-kmcgill53

why do we need to remove the MTP sections from the docs? Should I trust coderabbit when it says they are outdated?

indrajit96

LGTM from Multimodal side instructions.

tanmayv25 · 2025-08-22T00:59:04Z

why do we need to remove the MTP sections from the docs? Should I trust coderabbit when it says they are outdated?

These docs are outdated. MTP support should available to the trtllm version we are using.

Signed-off-by: Hannah Zhang <[email protected]>

Signed-off-by: Krishnan Prashanth <[email protected]>

Signed-off-by: nnshah1 <[email protected]>

Update tensorrt_llm to 1.0.0rc6

cdc657d

tanmayv25 requested review from a team, alec-flowers, ishandhanani, nnshah1, ptarasiewiczNV, richardhuo-nv and rmccorm4 as code owners August 21, 2025 18:28

pull-request-size bot added the size/S label Aug 21, 2025

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 18:28 Inactive

github-actions bot added the fix label Aug 21, 2025

coderabbitai bot reviewed Aug 21, 2025

View reviewed changes

container/build.sh Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 18:36 Inactive

nv-kmcgill53 approved these changes Aug 21, 2025

View reviewed changes

richardhuo-nv approved these changes Aug 21, 2025

View reviewed changes

Update cuda runtime container

db943a6

pull-request-size bot added size/M and removed size/S labels Aug 21, 2025

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 19:10 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 19:15 Inactive

Improve multimodal instructions

2809dad

pull-request-size bot added size/L and removed size/M labels Aug 21, 2025

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 21:21 Inactive

rename

b79e673

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 21:23 Inactive

fix

79a7ef2

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 21:24 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 21:32 Inactive

indrajit96 approved these changes Aug 21, 2025

View reviewed changes

tanmayv25 added 2 commits August 21, 2025 15:20

More docs

0e7d015

Update

2e159ab

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 22:25 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 22:29 Inactive

Fix dependency versions

4b6d6b1

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 22:49 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 22:50 Inactive

Fix torchvision version

639a9a6

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 23:30 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 23:31 Inactive

Format

9376e7e

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 23:33 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 21, 2025 23:36 Inactive

tanmayv25 merged commit 9ab37d9 into main Aug 22, 2025
14 of 16 checks passed

tanmayv25 deleted the tanmayv-update branch August 22, 2025 00:59

tanmayv25 added a commit that referenced this pull request Aug 22, 2025

fix: Update tensorrt_llm to 1.0.0rc6 (#2606)

af7e432

dmitry-tokarev-nv pushed a commit that referenced this pull request Aug 22, 2025

fix: Update tensorrt_llm to 1.0.0rc6 (#2606) (#2630)

e28209c

coderabbitai bot mentioned this pull request Aug 22, 2025

fix: 0.4.1 disable kvbm tests (CP #2611) #2635

Merged

hhzhang16 pushed a commit that referenced this pull request Aug 27, 2025

fix: Update tensorrt_llm to 1.0.0rc6 (#2606)

ae28cb9

Signed-off-by: Hannah Zhang <[email protected]>

nv-anants pushed a commit that referenced this pull request Aug 28, 2025

fix: Update tensorrt_llm to 1.0.0rc6 (#2606)

3001cab

KrishnanPrash pushed a commit that referenced this pull request Sep 2, 2025

fix: Update tensorrt_llm to 1.0.0rc6 (#2606)

983c7ca

Signed-off-by: Krishnan Prashanth <[email protected]>

coderabbitai bot mentioned this pull request Sep 8, 2025

chore: Update trtllm version to 1.1.0rc3 #2930

Merged

nnshah1 pushed a commit that referenced this pull request Sep 8, 2025

fix: Update tensorrt_llm to 1.0.0rc6 (#2606)

091ba01

Signed-off-by: nnshah1 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Update tensorrt_llm to 1.0.0rc6 #2606

fix: Update tensorrt_llm to 1.0.0rc6 #2606

Uh oh!

tanmayv25 commented Aug 21, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Aug 21, 2025

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

nv-kmcgill53 left a comment

Uh oh!

indrajit96 left a comment

Uh oh!

tanmayv25 commented Aug 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix: Update tensorrt_llm to 1.0.0rc6 #2606

fix: Update tensorrt_llm to 1.0.0rc6 #2606

Uh oh!

Conversation

tanmayv25 commented Aug 21, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Aug 21, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nv-kmcgill53 left a comment

Choose a reason for hiding this comment

Uh oh!

indrajit96 left a comment

Choose a reason for hiding this comment

Uh oh!

tanmayv25 commented Aug 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tanmayv25 commented Aug 21, 2025 •

edited by coderabbitai bot

Loading