Commit e8b9331
committed
Squashed commit of the following:
commit e330d96
Author: Yan Ru Pei <[email protected]>
Date: Fri Jul 18 13:40:54 2025 -0700
feat: enable / disable chunked prefill for mockers (#2015)
Signed-off-by: Yan Ru Pei <[email protected]>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
commit 353146e
Author: GuanLuo <[email protected]>
Date: Fri Jul 18 13:33:36 2025 -0700
feat: add vLLM v1 multi-modal example. Add llama4 Maverick example (#1990)
Signed-off-by: GuanLuo <[email protected]>
Co-authored-by: krishung5 <[email protected]>
commit 1f07dab
Author: Jacky <[email protected]>
Date: Fri Jul 18 13:04:20 2025 -0700
feat: Add migration to LLM requests (#1930)
commit 5f17918
Author: Tanmay Verma <[email protected]>
Date: Fri Jul 18 12:59:34 2025 -0700
refactor: Migrate to new UX2 for python launch (#2003)
commit fc12436
Author: Graham King <[email protected]>
Date: Fri Jul 18 14:52:57 2025 -0400
feat(frontend): router-mode settings (#2001)
commit dc75cf1
Author: ptarasiewiczNV <[email protected]>
Date: Fri Jul 18 18:47:28 2025 +0200
chore: Move NIXL repo clone to Dockerfiles (#2009)
commit f6f392c
Author: Iman Tabrizian <[email protected]>
Date: Thu Jul 17 18:44:17 2025 -0700
Remove link to the fix for disagg + eagle3 for TRT-LLM example (#2006)
Signed-off-by: Iman Tabrizian <[email protected]>
commit cc90ca6
Author: atchernych <[email protected]>
Date: Thu Jul 17 18:34:40 2025 -0700
feat: Create a convenience script to uninstall Dynamo Deploy CRDs (#1933)
commit 267b422
Author: Greg Clark <[email protected]>
Date: Thu Jul 17 20:44:21 2025 -0400
chore: loosed python requirement versions (#1998)
Signed-off-by: Greg Clark <[email protected]>
commit b8474e5
Author: ishandhanani <[email protected]>
Date: Thu Jul 17 16:35:05 2025 -0700
chore: update cmake and gap installation and sgl in wideep container (#1991)
commit 157a3b0
Author: Biswa Panda <[email protected]>
Date: Thu Jul 17 15:38:12 2025 -0700
fix: incorrect helm upgrade command (#2000)
commit 0dfca2c
Author: Ryan McCormick <[email protected]>
Date: Thu Jul 17 15:33:33 2025 -0700
ci: Update trtllm gitlab triggers for new components directory and test script (#1992)
commit f3fb09e
Author: Kris Hung <[email protected]>
Date: Thu Jul 17 14:59:59 2025 -0700
fix: Fix syntax for tokio-console (#1997)
commit dacffb8
Author: Biswa Panda <[email protected]>
Date: Thu Jul 17 14:57:10 2025 -0700
fix: use non-dev golang image for operator (#1993)
commit 2b29a0a
Author: zaristei <[email protected]>
Date: Thu Jul 17 13:10:42 2025 -0700
fix: Working Arm Build Dockerfile for Vllm_v1 (#1844)
commit 2430d89
Author: Ryan McCormick <[email protected]>
Date: Thu Jul 17 12:57:46 2025 -0700
test: Add trtllm kv router tests (#1988)
commit 1eadc01
Author: Graham King <[email protected]>
Date: Thu Jul 17 15:07:41 2025 -0400
feat(runtime): Support tokio-console (#1986)
commit b62e633
Author: GuanLuo <[email protected]>
Date: Thu Jul 17 11:16:28 2025 -0700
feat: support separate chat_template.jinja file (#1853)
commit 8ae3719
Author: Hongkuan Zhou <[email protected]>
Date: Thu Jul 17 11:12:35 2025 -0700
chore: add some details to dynamo deploy quickstart and fix deploy.sh (#1978)
Signed-off-by: Hongkuan Zhou <[email protected]>
Co-authored-by: julienmancuso <[email protected]>
commit 08891ff
Author: Ryan McCormick <[email protected]>
Date: Thu Jul 17 10:57:42 2025 -0700
fix: Update trtllm tests to use new scripts instead of dynamo serve (#1979)
commit 49b7a0d
Author: Ryan Olson <[email protected]>
Date: Thu Jul 17 08:35:04 2025 -0600
feat: record + analyze logprobs (#1957)
commit 6d2be14
Author: Biswa Panda <[email protected]>
Date: Thu Jul 17 00:17:58 2025 -0700
refactor: replace vllm with vllm_v1 container (#1953)
Co-authored-by: alec-flowers <[email protected]>
commit 4d2a31a
Author: ishandhanani <[email protected]>
Date: Wed Jul 16 18:04:09 2025 -0700
chore: add port reservation to utils (#1980)
commit 1e3e4a0
Author: Alec <[email protected]>
Date: Wed Jul 16 15:54:04 2025 -0700
fix: port race condition through deterministic ports (#1937)
commit 4ad281f
Author: Tanmay Verma <[email protected]>
Date: Wed Jul 16 14:33:51 2025 -0700
refactor: Move TRTLLM example to the component/backends (#1976)
commit 57d24a1
Author: Misha Chornyi <[email protected]>
Date: Wed Jul 16 14:10:24 2025 -0700
build: Removing shell configuration violations. It's bad practice to hardcod… (#1973)
commit 182d3b5
Author: Graham King <[email protected]>
Date: Wed Jul 16 16:12:40 2025 -0400
chore(bindings): Remove mistralrs / llama.cpp (#1970)
commit def6eaa
Author: Harrison Saturley-Hall <[email protected]>
Date: Wed Jul 16 15:50:23 2025 -0400
feat: attributions for debian deps of sglang, trtllm, vllm runtime containers (#1971)
commit f31732a
Author: Yan Ru Pei <[email protected]>
Date: Wed Jul 16 11:22:15 2025 -0700
feat: integrate mocker with dynamo-run and python cli (#1927)
commit aba6099
Author: Graham King <[email protected]>
Date: Wed Jul 16 12:26:32 2025 -0400
perf(router): Remove lock from router hot path (#1963)
commit b212103
Author: Hongkuan Zhou <[email protected]>
Date: Wed Jul 16 08:55:33 2025 -0700
docs: add notes in docs to deprecate local connector (#1959)
commit 7b325ee
Author: Biswa Panda <[email protected]>
Date: Tue Jul 15 18:52:00 2025 -0700
fix: vllm router examples (#1942)
commit a50be1a
Author: hhzhang16 <[email protected]>
Date: Tue Jul 15 17:58:01 2025 -0700
feat: update CODEOWNERS (#1926)
commit e260fdf
Author: Harrison Saturley-Hall <[email protected]>
Date: Tue Jul 15 18:49:21 2025 -0400
feat: add bitnami helm chart attribution (#1943)
Signed-off-by: Harrison Saturley-Hall <[email protected]>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
commit 1c03404
Author: Biswa Panda <[email protected]>
Date: Tue Jul 15 14:26:24 2025 -0700
fix: update inference gateway deployment instructions (#1940)
commit 5ca570f
Author: Graham King <[email protected]>
Date: Tue Jul 15 16:54:03 2025 -0400
chore: Rename dynamo.ingress to dynamo.frontend (#1944)
commit 7b9182f
Author: Graham King <[email protected]>
Date: Tue Jul 15 16:33:07 2025 -0400
chore: Move examples/cli to lib/bindings/examples/cli (#1952)
commit 40d40dd
Author: Graham King <[email protected]>
Date: Tue Jul 15 16:02:19 2025 -0400
chore(multi-modal): Rename frontend.py to web.py (#1951)
commit a9e0891
Author: Ryan Olson <[email protected]>
Date: Tue Jul 15 12:30:30 2025 -0600
feat: adding http clients and recorded response stream (#1919)
commit 4128d58
Author: Biswa Panda <[email protected]>
Date: Tue Jul 15 10:30:47 2025 -0700
feat: allow helm upgrade using deploy script (#1936)
commit 4da078b
Author: Graham King <[email protected]>
Date: Tue Jul 15 12:57:38 2025 -0400
fix: Remove OpenSSL dependency, use Rust TLS (#1945)
commit fc004d4
Author: jthomson04 <[email protected]>
Date: Tue Jul 15 08:45:42 2025 -0700
fix: Fix TRT-LLM container build when using a custom pip wheel (#1825)
commit 3c6fc6f
Author: ishandhanani <[email protected]>
Date: Mon Jul 14 22:35:20 2025 -0700
chore: fix typo (#1938)
commit de7fe38
Author: Alec <[email protected]>
Date: Mon Jul 14 21:47:12 2025 -0700
feat: add vllm e2e integration tests (#1935)
commit 860f3f7
Author: Keiven C <[email protected]>
Date: Mon Jul 14 21:44:19 2025 -0700
chore: metrics endpoint variables renamed from HTTP_SERVER->SYSTEM (#1934)
Co-authored-by: Keiven Chang <[email protected]>
commit fc402a3
Author: Biswa Panda <[email protected]>
Date: Mon Jul 14 21:21:20 2025 -0700
feat: configurable namespace for vllm v1 example (#1909)
commit df40d2c
Author: ZichengMa <[email protected]>
Date: Mon Jul 14 21:11:29 2025 -0700
docs: fix typo and add mount-workspace to vllm doc (#1931)
Signed-off-by: ZichengMa <[email protected]>
Co-authored-by: Alec <[email protected]>
commit 901715b
Author: Tanmay Verma <[email protected]>
Date: Mon Jul 14 20:14:51 2025 -0700
refactor: Refactor the TRTLLM examples remove dynamo SDK (#1884)
commit 5bf23d5
Author: hhzhang16 <[email protected]>
Date: Mon Jul 14 18:29:19 2025 -0700
feat: update DynamoGraphDeployments for vllm_v1 (#1890)
Co-authored-by: mohammedabdulwahhab <[email protected]>
commit 9e76590
Author: ishandhanani <[email protected]>
Date: Mon Jul 14 17:29:56 2025 -0700
docs: organize sglang readme (#1910)
commit ef59ac8
Author: KrishnanPrash <[email protected]>
Date: Mon Jul 14 16:16:44 2025 -0700
docs: TRTLLM Example of Llama4+Eagle3 (Speculative Decoding) (#1828)
Signed-off-by: KrishnanPrash <[email protected]>
Co-authored-by: Iman Tabrizian <[email protected]>
commit 053041e
Author: Jorge António <[email protected]>
Date: Tue Jul 15 00:06:38 2025 +0100
fix: resolve incorrect finish reason propagation (#1857)
commit 3733f58
Author: Graham King <[email protected]>
Date: Mon Jul 14 19:04:22 2025 -0400
feat(backends): Python llama.cpp engine (#1925)
commit 6a1350c
Author: Tushar Sharma <[email protected]>
Date: Mon Jul 14 14:56:36 2025 -0700
build: minor improvements to sglang dockerfile (#1917)
commit e2a619b
Author: Neelay Shah <[email protected]>
Date: Mon Jul 14 14:52:53 2025 -0700
fix: remove environment variable passing (#1911)
Signed-off-by: Neelay Shah <[email protected]>
Co-authored-by: Neelay Shah <[email protected]>
commit 3d17a49
Author: Schwinn Saereesitthipitak <[email protected]>
Date: Mon Jul 14 14:41:56 2025 -0700
refactor: remove dynamo build (#1778)
Signed-off-by: Schwinn Saereesitthipitak <[email protected]>
commit 3e0cb07
Author: Anant Sharma <[email protected]>
Date: Mon Jul 14 15:43:48 2025 -0400
fix: copy attributions and license to trtllm runtime container (#1916)
commit fc36bf5
Author: ishandhanani <[email protected]>
Date: Mon Jul 14 12:31:49 2025 -0700
feat: receive kvmetrics from sglang scheduler (#1789)
Co-authored-by: zixuanzhang226 <[email protected]>
commit df91fce
Author: Yan Ru Pei <[email protected]>
Date: Mon Jul 14 12:24:04 2025 -0700
feat: prefill aware routing (#1895)
commit ad8ad66
Author: Graham King <[email protected]>
Date: Mon Jul 14 15:20:35 2025 -0400
feat: Shrink the ai-dynamo wheel by 35 MiB (#1918)
Remove http and llmctl binaries. They have been unused for a while.
commit 480b41d
Author: Graham King <[email protected]>
Date: Mon Jul 14 15:06:45 2025 -0400
feat: Python frontend / ingress node (#1912)1 parent 5055bcd commit e8b9331
File tree
234 files changed
+318039
-16614
lines changed- .cargo
- .devcontainer
- .github/workflows
- components
- backends
- llama_cpp
- src/dynamo/llama_cpp
- mocker
- src/dynamo/mocker
- trtllm
- engine_configs
- deepseek_r1
- mtp
- simple
- wide_ep
- llama4/eagle
- launch
- multinode
- src/dynamo/trtllm
- utils
- request_handlers
- utils
- frontend
- src/dynamo/frontend
- http/src
- metrics/src
- bin
- container
- deps/vllm
- deploy
- cloud
- helm
- operator
- inference-gateway/example
- resources
- metrics
- sdk/src/dynamo/sdk/cli
- docs
- architecture
- guides
- dynamo_deploy
- planner_benchmark
- examples
- multimodal_v1
- components
- configs
- connect
- graphs
- utils
- multimodal
- components
- graphs
- sglang
- components
- docs
- utils
- tensorrt_llm
- common
- components
- configs
- deepseek_r1
- mtp
- vllm
- components
- deploy
- launch
- launch
- dynamo-run
- src
- subprocess
- llmctl
- src
- lib
- bindings/python
- examples/cli
- rust
- llm
- src/dynamo
- llm
- llm
- src
- discovery
- entrypoint/input
- http
- kv_router
- mocker
- model_card
- perf
- preprocessor/prompt
- protocols
- common
- openai
- chat_completions
- embeddings
- tests
- data/replays/deepseek-r1-distill-llama-8b
- runtime
- examples
- src
- component
- pipeline
- network/egress
- nodes
- sinks
- sources
- protocols
- tests
- common
- tests
- serve
- utils
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
234 files changed
+318039
-16614
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
58 | | - | |
59 | | - | |
60 | 58 | | |
61 | 59 | | |
62 | 60 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| 55 | + | |
55 | 56 | | |
56 | 57 | | |
57 | | - | |
| 58 | + | |
58 | 59 | | |
59 | 60 | | |
60 | 61 | | |
| 62 | + | |
61 | 63 | | |
62 | 64 | | |
63 | 65 | | |
| |||
0 commit comments