Compile API: disable optimizations by default #25474

adrianlizarraga · 2025-07-21T16:12:17Z

Description

Disables graph optimizations by default when using the explicit compiling API.
Adds ModelCompilationOptions_SetGraphOptimizationLevel to allow the user to set an optimization level.
Adds C++, Python, and C# bindings for the new API function.
Updates ModelCompilationOptions_SetFlags to take in a uint32_t flags parameter instead of size_t flags to ensure the same size across platforms. This API is not yet in a public ORT release, so safe to modify.

Motivation and Context

When compiling, prefer allowing the EP to do the optimizations instead of ORT.

…fault

Copilot

Pull Request Overview

This PR disables graph optimizations by default in the explicit compiling API, preferring to let execution providers handle optimizations instead. It adds a new method to allow users to manually enable optimizations when needed.

Changes the default graph optimization level to TransformerLevel::Default (minimum optimizations) instead of the previous behavior
Adds ModelCompilationOptions_SetGraphOptimizationLevel API to allow users to explicitly set optimization levels
Updates test expectations to account for different EPContext node input counts based on optimization settings

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
onnxruntime/core/session/model_compilation_options.h	Declares new SetGraphOptimizationLevel method
onnxruntime/core/session/model_compilation_options.cc	Implements optimization level setting and sets default to L0
onnxruntime/core/session/compile_api.h	Declares C API wrapper for graph optimization level setting
onnxruntime/core/session/compile_api.cc	Implements C API wrapper function
include/onnxruntime/core/session/onnxruntime_c_api.h	Adds C API function declaration and documentation
include/onnxruntime/core/session/onnxruntime_cxx_api.h	Adds C++ wrapper method declaration
include/onnxruntime/core/session/onnxruntime_cxx_inline.h	Implements C++ wrapper method
onnxruntime/test/providers/qnn/qnn_ep_context_test.cc	Updates tests to handle different optimization behaviors and add explicit optimization setting

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

onnxruntime/core/session/model_compilation_options.cc

onnxruntime/core/session/compile_api.h

onnxruntime/test/providers/qnn/qnn_ep_context_test.cc

…fault

csharp/src/Microsoft.ML.OnnxRuntime/CompileModel.shared.cs

### Description - Disables graph optimizations by default when using the explicit compiling API. - Adds `ModelCompilationOptions_SetGraphOptimizationLevel` to allow the user to set an optimization level. - Adds C++, Python, and C# bindings for the new API function. - Updates `ModelCompilationOptions_SetFlags` to take in a `uint32_t flags` parameter instead of `size_t flags` to ensure the same size across platforms. This API is not yet in a public ORT release, so safe to modify. ### Motivation and Context When compiling, prefer allowing the EP to do the optimizations instead of ORT.

### Description Cherry-pick the following PRs: #25943 #25937 #25917 #25909 #25898 #25897 #25888 #25881 #25830 #25619 #25575 #25572 #25558 #25530 #25474 #25455 #25110 Also two dependent PRs for qMoE cpu: #25877 #25822 --------- Co-authored-by: xiaomsft <[email protected]> Co-authored-by: Xiaoyan Hu <[email protected]> Co-authored-by: Akshay Sonawane <[email protected]> Co-authored-by: Kunal Vaishnavi <[email protected]> Co-authored-by: Pradeep Sakhamoori <[email protected]> Co-authored-by: mingyue <[email protected]> Co-authored-by: Maximilian Müller <[email protected]> Co-authored-by: Adrian Lizarraga <[email protected]> Co-authored-by: Dmitri Smirnov <[email protected]> Co-authored-by: Emmanuel <[email protected]> Co-authored-by: Emmanuel Assumang <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: praneshgo <[email protected]> Co-authored-by: Hariharan Seshadri <[email protected]> Co-authored-by: Jing Fang <[email protected]> Co-authored-by: Ishwar Raut <[email protected]>

Compile API: disable optimizations by default

c46fd7a

jywu-msft added the release:1.23.1 label Jul 22, 2025

jywu-msft added release:1.23.0 and removed release:1.23.1 labels Aug 2, 2025

Merge branch 'main' into adrianl/compile-api-disable-optimizations-de…

8eaa205

…fault

yuslepukhin requested a review from Copilot August 29, 2025 22:08

Copilot AI reviewed Aug 29, 2025

View reviewed changes

onnxruntime/core/session/model_compilation_options.cc Show resolved Hide resolved

onnxruntime/core/session/model_compilation_options.cc Outdated Show resolved Hide resolved

yuslepukhin requested changes Aug 29, 2025

View reviewed changes

onnxruntime/core/session/compile_api.h Show resolved Hide resolved

onnxruntime/core/session/compile_api.h Show resolved Hide resolved

onnxruntime/test/providers/qnn/qnn_ep_context_test.cc Outdated Show resolved Hide resolved

adrianlizarraga added 5 commits September 1, 2025 21:24

Merge branch 'main' into adrianl/compile-api-disable-optimizations-de…

9e832fb

…fault

Address some review comments

b2fb4b2

python bindings

4e5a1dd

Change flags from size_t to uint32_t

e5e82d0

c# bindings

cb14180

adrianlizarraga commented Sep 2, 2025

View reviewed changes

csharp/src/Microsoft.ML.OnnxRuntime/CompileModel.shared.cs Show resolved Hide resolved

yuslepukhin approved these changes Sep 2, 2025

View reviewed changes

adrianlizarraga merged commit 45ffd99 into main Sep 3, 2025
92 checks passed

adrianlizarraga deleted the adrianl/compile-api-disable-optimizations-default branch September 3, 2025 15:48

gedoensmax mentioned this pull request Sep 4, 2025

[Draft] TRT RTX documentation update thevishalagarwal/onnxruntime#6

Open

tianleiwu mentioned this pull request Sep 4, 2025

cherry picks for 1.23.0 release #25959

Merged

tianleiwu added cherry-picked Cherry-picked for a cherrypicks branch and removed release:1.23.0 labels Sep 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compile API: disable optimizations by default #25474

Compile API: disable optimizations by default #25474

Uh oh!

adrianlizarraga commented Jul 21, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Compile API: disable optimizations by default #25474

Compile API: disable optimizations by default #25474

Uh oh!

Conversation

adrianlizarraga commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

adrianlizarraga commented Jul 21, 2025 •

edited

Loading