feat: Adding nixl read() multimodal support for vLLM backend #4271

KrishnanPrash · 2025-11-12T21:06:05Z

Overview:

With #3988, we have functional image decoding in the frontend for any b64 or http urls passed with the inference request. This PR builds on top of #3988, and implements the nixl read() portion of the image decoding workflow for the backend.

Details:

Look at handlers.py for the additions to the DECODED workflow.

Signed-off-by: Alexandre Milesi <[email protected]>

Signed-off-by: Krishnan Prashanth <[email protected]>

…h/vllm-nixl-read Signed-off-by: KrishnanPrash <[email protected]>

Signed-off-by: Krishnan Prashanth <[email protected]>

KrishnanPrash · 2025-11-13T23:45:14Z

components/src/dynamo/vllm/handlers.py

+    async def _read_decoded_image_via_nixl(
+        self, decoded_meta: Dict[str, Any]
+    ) -> PIL.Image.Image:
+        """Read decoded image via NIXL RDMA and convert to PIL.Image."""
+        # Lazy-init connector
+        if self._connector is None:
+            self._connector = connect.Connector()
+            await self._connector.initialize()
+            logger.info("NIXL connector initialized for decoded media")
+
+        # Extract fields
+        meta_str = decoded_meta["nixl_metadata"]
+        desc = decoded_meta["nixl_descriptor"]
+        shape = decoded_meta["shape"]
+
+        # Create tensor to receive RDMA data
+        tensor = torch.empty(shape, dtype=torch.uint8)
+
+        # Build RdmaMetadata from frontend-provided descriptor
+        # Frontend sends compressed metadata (matches Python nixl_connect)
+        rdma_meta = RdmaMetadata(
+            descriptors=[
+                SerializedDescriptor(
+                    device="cpu"
+                    if desc.get("mem_type") == "Dram"
+                    else f"cuda:{desc.get('device_id', 0)}",
+                    ptr=desc["addr"],
+                    size=desc["size"],
+                )
+            ],
+            nixl_metadata=meta_str,
+            notification_key=f"img-{shape}",
+            operation_kind=int(OperationKind.READ),
+        )
+
+        # RDMA read
+        read_op = await self._connector.begin_read(
+            rdma_meta, connect.Descriptor(tensor)
+        )
+        await read_op.wait_for_completion()


Not a NIXL expert, so please let me know if I can be doing anything here better.

KrishnanPrash · 2025-11-13T23:47:48Z

lib/llm/src/preprocessor/media/rdma.rs

+    // Compress metadata before base64 encoding (matches Python nixl_connect behavior)
+    // Backend expects: b64:<base64_of_compressed_bytes>
+    let mut encoder = ZlibEncoder::new(Vec::new(), Compression::new(6));
+    encoder.write_all(&nixl_md)?;
+    let compressed = encoder.finish()?;


Once again, welcome any suggestions on correct nixl usage.

KrishnanPrash · 2025-11-13T23:54:01Z

Open Question for Testing:

Ideally, we would like to test both test cases:

Frontend URL pass through + backend decoding: This requires building without nixl.
Frontend decoding + backend nixl read: This requires building dynamo with the command maturin develop --features media-nixl

Based on my conversation with @nv-tusharma, IIUC they suggested creating a separate workflow outside .github/workflows/container-backends-validation.yaml that would be a non-blocking test that would still run in our current CI.

components/src/dynamo/vllm/handlers.py

lib/llm/src/preprocessor/media/rdma.rs

Signed-off-by: Ayush Agarwal <[email protected]>

Signed-off-by: Krishnan Prashanth <[email protected]>

…h/vllm-nixl-read Signed-off-by: KrishnanPrash <[email protected]>

Signed-off-by: Krishnan Prashanth <[email protected]>

whoisj · 2025-11-14T20:29:52Z

components/src/dynamo/vllm/handlers.py

+
+        # Build RdmaMetadata from frontend-provided descriptor
+        # Frontend sends compressed metadata (matches Python nixl_connect)
+        rdma_meta = RdmaMetadata(


does this work, have you tested it?

the "normal flow" is to create a passive operation (ReadableOp or WritableOp) and use their .metadata property to get the set of SerializedDescriptors and not manually compose this.

given that this is using an active operation (ReadOp) it should be taking in the metadata to perform the read, not sending the metadata.

usually metadata comes from the secondary connection, which in turn got it from its ReadableOperation.

whoisj · 2025-11-25T16:26:25Z

components/src/dynamo/vllm/handlers.py

+
+        # Extract fields
+        meta_str = decoded_meta["nixl_metadata"]
+        desc = decoded_meta["nixl_descriptor"]


what type is desc?

whoisj · 2025-11-25T16:28:40Z

components/src/dynamo/vllm/handlers.py

+        # Frontend sends compressed metadata (matches Python nixl_connect)
+        rdma_meta = RdmaMetadata(
+            descriptors=[
+                SerializedDescriptor(


why not pass desc to a Descriptor and then serialize the descriptor to get the metadata?

milesial and others added 14 commits November 10, 2025 14:18

feat: decoded media via NIXL

b9f3484

Signed-off-by: Alexandre Milesi <[email protected]>

feat: NIXL stub support

b6ca505

Signed-off-by: Alexandre Milesi <[email protected]>

chore: cleanups

f145c1c

Signed-off-by: Alexandre Milesi <[email protected]>

fix: image-rs serde

cb0d388

Signed-off-by: Alexandre Milesi <[email protected]>

feat: switch to media-nixl feature flag

b0221bb

Signed-off-by: Alexandre Milesi <[email protected]>

chore: cleanups

a2687d2

Signed-off-by: Alexandre Milesi <[email protected]>

feat: decoded media via NIXL

cfee423

Signed-off-by: Alexandre Milesi <[email protected]>

feat: NIXL stub support

43a3f0c

Signed-off-by: Alexandre Milesi <[email protected]>

chore: cleanups

01f94d6

Signed-off-by: Alexandre Milesi <[email protected]>

fix: image-rs serde

0aabc41

Signed-off-by: Alexandre Milesi <[email protected]>

feat: switch to media-nixl feature flag

d33d718

Signed-off-by: Alexandre Milesi <[email protected]>

chore: cleanups

94d027d

Signed-off-by: Alexandre Milesi <[email protected]>

Initial Working Version

518e768

Signed-off-by: Krishnan Prashanth <[email protected]>

Simplified workflow

76a4683

Signed-off-by: Krishnan Prashanth <[email protected]>

KrishnanPrash requested review from a team as code owners November 12, 2025 21:06

pull-request-size bot added the size/XL label Nov 12, 2025

KrishnanPrash marked this pull request as draft November 12, 2025 21:06

github-actions bot added the feat label Nov 12, 2025

KrishnanPrash closed this Nov 12, 2025

KrishnanPrash reopened this Nov 13, 2025

Merge branch 'alexandrem/frontend-image-decoding-nixl' into kprashant…

3089249

…h/vllm-nixl-read Signed-off-by: KrishnanPrash <[email protected]>

pull-request-size bot added size/L and removed size/XL labels Nov 13, 2025

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:30 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:35 Inactive

Cleaning up comments + Logs

e5c495b

Signed-off-by: Krishnan Prashanth <[email protected]>

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:42 Inactive

KrishnanPrash commented Nov 13, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to GITLAB November 13, 2025 23:46 Inactive

KrishnanPrash commented Nov 13, 2025

View reviewed changes

KrishnanPrash marked this pull request as ready for review November 13, 2025 23:54

KrishnanPrash changed the title ~~feat: Adding nixl read support for decoded path~~ feat: Adding nixl read() multimodal support for vLLM backend Nov 13, 2025

rmccorm4 reviewed Nov 14, 2025

View reviewed changes

components/src/dynamo/vllm/handlers.py Show resolved Hide resolved

rmccorm4 requested review from indrajit96, krishung5 and whoisj November 14, 2025 00:18

KrishnanPrash requested a review from rmccorm4 November 14, 2025 00:28

indrajit96 reviewed Nov 14, 2025

View reviewed changes

components/src/dynamo/vllm/handlers.py Show resolved Hide resolved

lib/llm/src/preprocessor/media/rdma.rs Outdated Show resolved Hide resolved

lib/llm/src/preprocessor/media/rdma.rs Outdated Show resolved Hide resolved

Merge branch 'main' into alexandrem/frontend-image-decoding-nixl

90fcf54

Signed-off-by: Ayush Agarwal <[email protected]>

rmccorm4 requested a review from ayushag-nv November 14, 2025 18:57

ayushag-nv and others added 4 commits November 17, 2025 13:49

Merge branch 'main' into alexandrem/frontend-image-decoding-nixl

8343753

Merge branch 'main' into alexandrem/frontend-image-decoding-nixl

5b89fdc

Removing duplicate moxcms in Cargo.lock

1d67335

Signed-off-by: Krishnan Prashanth <[email protected]>

Merge branch 'alexandrem/frontend-image-decoding-nixl' into kprashant…

0e2af92

…h/vllm-nixl-read Signed-off-by: KrishnanPrash <[email protected]>

copy-pr-bot bot temporarily deployed to GITLAB November 24, 2025 18:13 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 24, 2025 18:18 Inactive

update variable name + add --frontend-decoding flag

7f41e85

Signed-off-by: Krishnan Prashanth <[email protected]>

copy-pr-bot bot temporarily deployed to GITLAB November 24, 2025 20:44 Inactive

whoisj reviewed Nov 25, 2025

View reviewed changes

Base automatically changed from alexandrem/frontend-image-decoding-nixl to main November 25, 2025 19:15

pull-request-size bot added size/XL and removed size/L labels Nov 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Adding nixl read() multimodal support for vLLM backend #4271

feat: Adding nixl read() multimodal support for vLLM backend #4271

Uh oh!

KrishnanPrash commented Nov 12, 2025 •

edited

Loading

Uh oh!

KrishnanPrash Nov 13, 2025

Uh oh!

KrishnanPrash Nov 13, 2025 •

edited

Loading

Uh oh!

KrishnanPrash commented Nov 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

whoisj Nov 14, 2025

Uh oh!

whoisj Nov 25, 2025

Uh oh!

whoisj Nov 25, 2025

Uh oh!

whoisj Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

feat: Adding nixl read() multimodal support for vLLM backend #4271

Are you sure you want to change the base?

feat: Adding nixl read() multimodal support for vLLM backend #4271

Uh oh!

Conversation

KrishnanPrash commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Uh oh!

KrishnanPrash Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

KrishnanPrash Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KrishnanPrash commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

whoisj Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

whoisj Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

whoisj Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

whoisj Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

KrishnanPrash commented Nov 12, 2025 •

edited

Loading

KrishnanPrash Nov 13, 2025 •

edited

Loading

KrishnanPrash commented Nov 13, 2025 •

edited

Loading