Skip to content
Merged
Prev Previous commit
Undo extra changes
  • Loading branch information
KrishnanPrash committed Jul 31, 2025
commit 004a801c6c55f79280e9b6740bc5143d5cf94373
1 change: 1 addition & 0 deletions components/backends/trtllm/engine_configs/agg.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ tensor_parallel_size: 1
moe_expert_parallel_size: 1
enable_attention_dp: false
max_num_tokens: 8192
max_batch_size: 16
trust_remote_code: true
backend: pytorch
enable_chunked_prefill: true
Expand Down
2 changes: 2 additions & 0 deletions container/build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -94,6 +94,8 @@ TRTLLM_USE_NIXL_KVCACHE_EXPERIMENTAL="0"

# TensorRT-LLM PyPI index URL
TENSORRTLLM_INDEX_URL="https://pypi.python.org/simple"
# TODO: Remove the version specification from here and use the ai-dynamo[trtllm] package.
# Need to update the Dockerfile.tensorrt_llm to use the ai-dynamo[trtllm] package.
DEFAULT_TENSORRTLLM_PIP_WHEEL="tensorrt-llm==1.0.0rc4"
TENSORRTLLM_PIP_WHEEL=""

Expand Down
Loading