Skip to content

Conversation

@snnn
Copy link
Member

@snnn snnn commented Aug 1, 2025

This PR cherry-picks some pipeline changes from the main branch to the 1.23.0 release branch.

And a WebGPU change:

fs-eire and others added 3 commits July 31, 2025 17:55
### Description
We have a big packaging pipeline that build nuget/java/nodejs packages.
After that we run these. This PR split the tests to a dedicated pipeline
and refactored the code that use maven to download deps instead of using
direct HTTP fetch. The new approach allows us to use Azure DevOps
artifacts as an internal mirror to meet network isolation requirements.
Thsi PR also enabled WebGPU and CoreML EP tests for java package on macOS.

This PR also updated tools/python/run_packaging_pipelines.py a little
bit to add the support for RC releases.

### Motivation and Context
Make the packaging pipelines smaller and easier to use.
@snnn snnn changed the title users/snnn/rel 1.23.0 Cherry-picks for ORT 1.23.0 Aug 1, 2025
…gth (#25594)

### Description
<!-- Describe your changes. -->
#25372 adds sliding window support for Group Query Attention, disabling
Flash Attention as it's not yet supported.

This PR adds a check for the sliding window and applies Flash Attention
when the window size exceeds the KV cache length or total sequence
length.

### Motivation and Context
See above.
@snnn snnn merged commit a033ad6 into rel-1.23.0 Aug 1, 2025
80 checks passed
@snnn snnn deleted the users/snnn/rel-1.23.0 branch August 1, 2025 20:50
@snnn snnn mentioned this pull request Sep 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants