Skip to content

Conversation

@nv-anants
Copy link
Contributor

@nv-anants nv-anants commented Aug 22, 2025

Overview:

Update the trtllm al2023 known issue to be more precise

ref: OPS-754

Summary by CodeRabbit

  • Documentation
    • Updated the support matrix to clarify a networking issue when running the AL2023 container locally with host networking, referencing a known mpi4py-related problem.
    • Added guidance to avoid host networking and use explicit port mappings instead, with examples (e.g., 4222 for NATS, 2379/2380 for etcd, 8080 for the frontend).

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Aug 22, 2025

Walkthrough

Updated docs/support_matrix.md to replace the AWS caution: removed the TensorRT‑LLM Python wheels note, added a caution about running AL2023 locally with docker run --network host due to an mpi4py issue, and provided explicit port-mapping guidance (e.g., 4222 for NATS, 2379/2380 for etcd, 8080 for frontend).

Changes

Cohort / File(s) Summary of Changes
Docs — Support Matrix
docs/support_matrix.md
Replaced prior AWS caution (TensorRT‑LLM via Python wheels) with a note on AL2023 local runs using docker --network host referencing an mpi4py bug; added explicit port mapping examples for NATS (4222), etcd (2379/2380), and frontend (8080).

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

Possibly related PRs

Poem

I thump my paw: a doc refined,
Swap cautions clear, with ports aligned.
4222—NATS is set,
etcd twinned, no misstep yet.
Docker drums, mpi4py’s note—
I hop along, commit the quote.

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

  • Visit our Status Page to check the current availability of CodeRabbit.
  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (4)
docs/support_matrix.md (4)

86-86: Clarify Linux-only semantics of --network host and scope of impact

Host networking is only available on Linux Docker. Suggest making that explicit to avoid confusing macOS/Windows users, and clarify that the issue is specifically observed when the AL2023 container is launched with host networking on Linux.

Apply this doc tweak:

-> ¹ There is a known issue with the TensorRT-LLM framework when running the AL2023 container locally with `docker run --network host ...` due to a [bug](https://github.com/mpi4py/mpi4py/discussions/491#discussioncomment-12660609) in mpi4py. To avoid this issue, replace the `--network host` flag with more precise networking configuration by mapping only the necessary ports (e.g., 4222 for nats, 2379/2380 for etcd, 8080 for frontend).
+> ¹ There is a known issue with the TensorRT‑LLM framework when running the Amazon Linux 2023 (AL2023) container on a Linux host with `docker run --network host` due to a known [mpi4py bug](https://github.com/mpi4py/mpi4py/discussions/491#discussioncomment-12660609). Avoid `--network host` and map only the required ports instead (e.g., 4222 for NATS, 2379/2380 for etcd, 8080 for frontend). On Docker Desktop (macOS/Windows), host networking is not supported; use `-p` port mappings.

86-86: Provide a concrete docker run example with port mappings

A short example helps users copy/paste and reduces guesswork.

Add this example under the caution:

 > ¹ There is a known issue with the TensorRT‑LLM framework when running the Amazon Linux 2023 (AL2023) container on a Linux host with `docker run --network host` due to a known [mpi4py bug](https://github.com/mpi4py/mpi4py/discussions/491#discussioncomment-12660609). Avoid `--network host` and map only the required ports instead (e.g., 4222 for NATS, 2379/2380 for etcd, 8080 for frontend). On Docker Desktop (macOS/Windows), host networking is not supported; use `-p` port mappings.
+>
+> Example:
+>
+> ```bash
+> docker run --rm \
+>   -p 4222:4222 \   # NATS
+>   -p 2379:2379 \   # etcd client
+>   -p 2380:2380 \   # etcd peer
+>   -p 8080:8080 \   # frontend
+>   <your-trtllm-runtime-image>:<tag> ...
+> ```

86-86: Service name capitalization and consistency

Suggest capitalizing NATS (proper name), keeping “etcd” lowercase, and using “frontend” consistently across docs.

Apply minor copy edits (already reflected in the suggested diffs above) to use “NATS”, “etcd”, and “frontend”.


86-86: Update version constraints & confirm port mappings in docs/support_matrix.md

  • The ports listed in the note—4222 for NATS, 2379/2380 for etcd, and 8080 for the frontend—match our defaults:
    • NATS default port is 4222 (default_server() in lib/runtime/src/transports/nats.rs returns "nats://localhost:4222")
    • The frontend’s default HTTP port is 8080 (as shown in README.md example: python -m dynamo.frontend --http-port 8080)
    • etcd’s standard client/peer ports are 2379 and 2380, corresponding to the flags you’d map in a Docker run or Compose setup.
    No changes needed to the port list itself.

  • Optional: Add version constraints to future-proof this caution. For example, specify the affected TensorRT-LLM release (e.g. “TensorRT-LLM v0.1.3”) and the mpi4py versions impacted by the bug (e.g. “mpi4py ≥3.1.4”) so users can immediately tell if they’re vulnerable.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

  • MCP integration is disabled by default for public repositories
  • Jira integration is disabled by default for public repositories
  • Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 02e59bb and 5ef35db.

📒 Files selected for processing (1)
  • docs/support_matrix.md (1 hunks)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)
  • GitHub Check: Mirror Repository to GitLab
  • GitHub Check: Build and Test - dynamo
🔇 Additional comments (1)
docs/support_matrix.md (1)

85-86: LGTM — clearer caution and actionable guidance

The new caution is more precise than the prior note about wheels and gives users a workable mitigation. Nice improvement.

@nv-anants nv-anants merged commit ea391f3 into main Aug 22, 2025
14 checks passed
@nv-anants nv-anants deleted the anants/update-trtllm-message branch August 22, 2025 17:08
hhzhang16 pushed a commit that referenced this pull request Aug 27, 2025
KrishnanPrash pushed a commit that referenced this pull request Sep 2, 2025
nnshah1 pushed a commit that referenced this pull request Sep 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants