Optimize function call serializing #778

ludfjig · 2025-08-11T22:01:28Z

Closes #789

This PR optimizes serialization of FunctionCalls by:

Making the serialization call return a &[u8] instead of Vec, saving a memory allocation when writing it to memory.
Avoids an unnecessary step where VecBytes(Vec) were converted to vec through an useless Iterator.
Preallocates a FlatBufferBuilder with a specific capacity to avoid reallocations when it grows. This is at a small runtime cost of estimating the capacity that will be needed. In practice this estimation is very fast and almost always worth it

Future Todos:

This PR only affects the host side, and there's a lot of improvement that can be made on the guest side: Guest functions should not need to return a Vec. Instead they should be passed some kind of Writer on which they pass return values.
FunctionCall should contained a borrowed &str/&[u8] instead of String and Vec

Relevant benchmark results compared to main branch (but with c2e6cdd which adds the 2 first benchmarks):

guest_functions_with_large_parameters/guest_call_with_large_parameters
                        time:   [723.24 ms 756.56 ms 793.92 ms]
                        change: [−15.950% −10.778% −5.4238%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  5 (5.00%) high mild
  10 (10.00%) high severe

function_call_serialization/serialize_function_call
                        time:   [5.5803 ms 5.6351 ms 5.7012 ms]
                        change: [−80.248% −79.529% −78.899%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
  5 (5.00%) high mild
  1 (1.00%) high severe
function_call_serialization/deserialize_function_call
                        time:   [8.2955 ms 8.3794 ms 8.4724 ms]
                        change: [−58.447% −57.639% −56.842%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 12 outliers among 100 measurements (12.00%)
  4 (4.00%) high mild
  8 (8.00%) high severe

sample_workloads/24K_in_8K_out_c
                        time:   [28.390 µs 28.675 µs 29.009 µs]
                        change: [−45.020% −42.425% −39.887%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 4 outliers among 100 measurements (4.00%)
  2 (2.00%) high mild
  2 (2.00%) high severe
sample_workloads/24K_in_8K_out_rust
                        time:   [27.788 µs 27.997 µs 28.237 µs]
                        change: [−38.325% −36.829% −35.245%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe

jsturtevant

LGTM, thanks for the extensive tests!

src/hyperlight_common/src/flatbuffer_wrappers/util.rs

src/hyperlight_guest/Cargo.toml

src/hyperlight_host/benches/benchmarks.rs

Copilot

Pull Request Overview

This PR optimizes FunctionCall serialization to improve performance by reducing memory allocations and improving buffer capacity estimation. The changes focus on making serialization more efficient by returning borrowed slices instead of owned vectors and pre-allocating FlatBufferBuilder capacity.

Key changes:

Refactored FunctionCall serialization to use encode() method returning &[u8] instead of TryFrom<FunctionCall> for Vec<u8>
Added capacity estimation function to pre-allocate FlatBufferBuilder capacity and avoid reallocations
Updated function signatures to accept &[u8] instead of Vec<u8> where possible

Reviewed Changes

Copilot reviewed 11 out of 16 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`src/hyperlight_common/src/flatbuffer_wrappers/function_call.rs`	Refactored serialization from `TryFrom` trait to `encode()` method returning borrowed slice
`src/hyperlight_common/src/flatbuffer_wrappers/util.rs`	Added capacity estimation function with comprehensive tests
`src/hyperlight_common/src/flatbuffer_wrappers/function_types.rs`	Fixed VecBytes deserialization to use `.bytes().to_vec()` instead of iterator
`src/hyperlight_host/src/sandbox/initialized_multi_use.rs`	Updated to use new encode method with capacity estimation
`src/hyperlight_guest/src/guest_handle/io.rs`	Changed parameter from `Vec<u8>` to `&[u8]`
`src/hyperlight_guest/src/guest_handle/host_comm.rs`	Updated to use new encode method and pass references
`src/hyperlight_guest_bin/src/guest_function/call.rs`	Updated function call to pass reference instead of owned vector
`src/hyperlight_host/benches/benchmarks.rs`	Added benchmarks for serialization performance and sample workloads
`src/tests/rust_guests/simpleguest/src/main.rs`	Added benchmark test function for 24K input/8K output scenario
`src/tests/c_guests/c_simpleguest/main.c`	Added C version of benchmark test function
`src/hyperlight_guest/Cargo.toml`	Added flatbuffers dependency

src/hyperlight_common/src/flatbuffer_wrappers/util.rs

src/hyperlight_common/src/flatbuffer_wrappers/function_call.rs

danbugs

Nice optimization! Mostly LGTM. Just a couple of questions/nits here and there 👍

src/hyperlight_common/src/flatbuffer_wrappers/util.rs

src/tests/c_guests/c_simpleguest/main.c

jsturtevant

Feel free to address the remaining comments but this LGTM

and a c+rust sample workload benchmark Signed-off-by: Ludvig Liljenberg <[email protected]>

Signed-off-by: Ludvig Liljenberg <[email protected]>

…ioncall Signed-off-by: Ludvig Liljenberg <[email protected]>

This should save memory allocations, but require that a FlatBufferBuilder is passed in. Signed-off-by: Ludvig Liljenberg <[email protected]>

Signed-off-by: Ludvig Liljenberg <[email protected]>

…encode Signed-off-by: Ludvig Liljenberg <[email protected]>

ludfjig requested review from danbugs, dblnz, devigned, syntactically, marosset, jprendes and simongdavies as code owners August 11, 2025 22:01

ludfjig force-pushed the big_param_opt branch 2 times, most recently from bf0445e to 4196a2f Compare August 11, 2025 22:45

ludfjig added kind/enhancement For PRs adding features, improving functionality, docs, tests, etc. area/performance Addresses performance labels Aug 11, 2025

ludfjig force-pushed the big_param_opt branch 7 times, most recently from 1ffa58d to 0c83035 Compare August 14, 2025 19:26

jsturtevant reviewed Aug 14, 2025

View reviewed changes

ludfjig force-pushed the big_param_opt branch from 0c83035 to ea5fa1a Compare August 19, 2025 01:01

ludfjig mentioned this pull request Aug 20, 2025

Optimize serialization of function calls #444

Closed

danbugs requested a review from Copilot August 21, 2025 22:51

Copilot AI reviewed Aug 21, 2025

View reviewed changes

src/hyperlight_common/src/flatbuffer_wrappers/util.rs Show resolved Hide resolved

src/hyperlight_common/src/flatbuffer_wrappers/util.rs Show resolved Hide resolved

src/hyperlight_common/src/flatbuffer_wrappers/function_call.rs Show resolved Hide resolved

danbugs previously approved these changes Aug 21, 2025

View reviewed changes

src/hyperlight_common/src/flatbuffer_wrappers/util.rs Show resolved Hide resolved

src/tests/c_guests/c_simpleguest/main.c Show resolved Hide resolved

src/tests/c_guests/c_simpleguest/main.c Show resolved Hide resolved

jsturtevant previously approved these changes Aug 22, 2025

View reviewed changes

ludfjig dismissed stale reviews from jsturtevant and danbugs via fb898f7 August 25, 2025 18:55

ludfjig force-pushed the big_param_opt branch from ea5fa1a to fb898f7 Compare August 25, 2025 18:55

ludfjig added 2 commits August 25, 2025 11:56

Add FunctionCall serialization benchmarks

3251f22

and a c+rust sample workload benchmark Signed-off-by: Ludvig Liljenberg <[email protected]>

Avoid intermediary iterator for vecbytes parametervalue

39ff6f9

Signed-off-by: Ludvig Liljenberg <[email protected]>

ludfjig added 4 commits August 25, 2025 11:56

Add util for estimating capacity needed for flatbuffer-encoding funct…

4360562

…ioncall Signed-off-by: Ludvig Liljenberg <[email protected]>

Make serializing a FunctionCall return a &[u8] instead of Vec<u8>.

6c7dccf

This should save memory allocations, but require that a FlatBufferBuilder is passed in. Signed-off-by: Ludvig Liljenberg <[email protected]>

pr feedback: permalink in comment

3ddae68

Signed-off-by: Ludvig Liljenberg <[email protected]>

Add some notes about not reusing builder after calling FunctionCall::…

e20eac3

…encode Signed-off-by: Ludvig Liljenberg <[email protected]>

ludfjig force-pushed the big_param_opt branch from fb898f7 to e20eac3 Compare August 25, 2025 18:56

jsturtevant approved these changes Aug 25, 2025

View reviewed changes

jsturtevant enabled auto-merge (squash) August 25, 2025 19:00

jsturtevant merged commit 85b4510 into hyperlight-dev:main Aug 25, 2025
33 checks passed

ludfjig mentioned this pull request Aug 25, 2025

Build c guests as required by benchmarks #822

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize function call serializing #778

Optimize function call serializing #778

Uh oh!

ludfjig commented Aug 11, 2025 •

edited

Loading

Uh oh!

jsturtevant left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danbugs left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsturtevant left a comment

Uh oh!

Uh oh!

Uh oh!

Optimize function call serializing #778

Optimize function call serializing #778

Uh oh!

Conversation

ludfjig commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jsturtevant left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danbugs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsturtevant left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ludfjig commented Aug 11, 2025 •

edited

Loading