Skip to content

Commit 3b6a760

Browse files
committed
chore: fix whitespace issues
Signed-off-by: Neal Vaidya <nealv@nvidia.com>
1 parent b169463 commit 3b6a760

File tree

1 file changed

+2
-2
lines changed
  • examples/basics/kubernetes/Distributed_Inference

1 file changed

+2
-2
lines changed

examples/basics/kubernetes/Distributed_Inference/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,11 +20,11 @@ helm install dynamo-platform dynamo-platform-${RELEASE_VERSION}.tgz --namespace
2020
```
2121
3. Model hosting with vLLM backend
2222
This `agg_router.yaml` is adpated from vLLM deployment [example](https://github.com/ai-dynamo/dynamo/blob/main/components/backends/vllm/deploy/agg_router.yaml). It has following customizations
23-
- Deployed `Qwen/Qwen2.5-1.5B-Instruct` model
23+
- Deployed `Qwen/Qwen2.5-1.5B-Instruct` model
2424
- Use KV cache based routing in frontend deployment `--router-mode kv`
2525
- Mounted a local cache folder `/YOUR/LOCAL/CACHE/FOLDER` for model artifacts reuse
2626
- Created 4 replicas for this model deployment by setting `replicas: 4`
27-
- Added `debug` flag environment variable for observability
27+
- Added `debug` flag environment variable for observability
2828
Create a K8S secret with your Huggingface token and then deploy the models
2929
```sh
3030
export HF_TOKEN=YOUR_HF_TOKEN

0 commit comments

Comments
 (0)