add more notes

ai-dynamo · richardhuo-nv · Jun 7, 2025 · Jun 6, 2025 · Jun 6, 2025 · Jun 6, 2025
commit 4f115f43563d67abbca7de4ccf41b6bccf05c416
@@ -127,6 +127,7 @@ Notes:
 - There is a noticeable latency for the first two inference requests. Please send warm-up requests before starting the benchmark.
 - Please keep the `cuda_graph_padding_enabled` setting as `false` in the model engine's configuration. There is a known bug, and the fix will be included in the next release of TensorRT-LLM.
 - MTP support for Disaggregation in Dynamo + TensorRT-LLM is coming soon.
+- MTP performance may vary depending on the acceptance rate of predicted tokens, which is dependent on the dataset or queries used while benchmarking
 
 #### Multi-Node Disaggregated Serving