fix the command and wording

ai-dynamo · richardhuo-nv · Jun 7, 2025 · Jun 6, 2025 · Jun 6, 2025 · Jun 6, 2025
commit 7f41cdb7d807c4aea1bca8feaa8e7ab72c6209f2
@@ -121,12 +121,12 @@ dynamo serve graphs.disagg_router:Frontend -f ./configs/disagg_router.yaml
 #### Aggregated serving with Multi-Token Prediction(MTP) and DeepSeek R1
 ```bash
 cd /workspace/examples/tensorrt_llm
-dynamo serve graphs.disagg_router:Frontend -f configs/deepseek_r1/mtp/mtp_agg.yaml
+dynamo serve graphs.agg:Frontend -f configs/deepseek_r1/mtp/mtp_agg.yaml
 ```
 Notes:
 - There is a noticeable latency for the first two inference requests. Please send warm-up requests before starting the benchmark.
 - Please keep the `cuda_graph_padding_enabled` setting as `false` in the model engine's configuration. There is a known bug, and the fix will be included in the next release of TensorRT-LLM.
-- Disaggregated support for MTP in Dynamo + TensorRT-LLM is coming soon.
+- MTP support for Disaggregation in Dynamo + TensorRT-LLM is coming soon.
 
 #### Multi-Node Disaggregated Serving