Skip to content

Conversation

@hariharans29
Copy link
Member

@hariharans29 hariharans29 commented Sep 6, 2025

Description

The memory alignment for the pre-packed weights buffer was accidentally changed for 8-bit Gemms on x86 while supporting the ARM64 equivalent 8-bit Gemm kernel in #25110. This change in alignment could either cause perf penalty or seg-fault depending on the platform while the corresponding aligned data load instruction is executed in the Gemm kernel.

This changes fixes it as well as adds back a couple of tests to the MLAS 8-bit Gemm test suite and fixes a minor nit in the test file.

Motivation and Context

Resolve packaging pipeline crash

@hariharans29 hariharans29 merged commit 96f4595 into main Sep 8, 2025
128 checks passed
@hariharans29 hariharans29 deleted the hari/debug_8bitGemm_crash branch September 8, 2025 16:11
snnn pushed a commit to snnn/onnxruntime that referenced this pull request Sep 8, 2025
…ts buffer (microsoft#25971)

### Description
The memory alignment for the pre-packed weights buffer was accidentally
changed for 8-bit Gemms on x86 while supporting the ARM64 equivalent
8-bit Gemm kernel in
microsoft#25110. This change in
alignment could either cause perf penalty or seg-fault depending on the
platform while the corresponding aligned data load instruction is
executed in the Gemm kernel.

This changes fixes it as well as adds back a couple of tests to the MLAS
8-bit Gemm test suite and fixes a minor nit in the test file.

### Motivation and Context
Resolve packaging pipeline crash

(cherry picked from commit 96f4595)
snnn pushed a commit that referenced this pull request Sep 8, 2025
…ts buffer (#25971)

### Description
The memory alignment for the pre-packed weights buffer was accidentally
changed for 8-bit Gemms on x86 while supporting the ARM64 equivalent
8-bit Gemm kernel in
#25110. This change in
alignment could either cause perf penalty or seg-fault depending on the
platform while the corresponding aligned data load instruction is
executed in the Gemm kernel.

This changes fixes it as well as adds back a couple of tests to the MLAS
8-bit Gemm test suite and fixes a minor nit in the test file.

### Motivation and Context
Resolve packaging pipeline crash

(cherry picked from commit 96f4595)
snnn pushed a commit that referenced this pull request Sep 8, 2025
…ts buffer (#25971)

### Description
The memory alignment for the pre-packed weights buffer was accidentally
changed for 8-bit Gemms on x86 while supporting the ARM64 equivalent
8-bit Gemm kernel in
#25110. This change in
alignment could either cause perf penalty or seg-fault depending on the
platform while the corresponding aligned data load instruction is
executed in the Gemm kernel.

This changes fixes it as well as adds back a couple of tests to the MLAS
8-bit Gemm test suite and fixes a minor nit in the test file.

### Motivation and Context
Resolve packaging pipeline crash

(cherry picked from commit 96f4595)
snnn added a commit that referenced this pull request Sep 8, 2025
This PR cherry-picks the following PRs to the rel-1.23.0 branch:

* #25938
* #25957
* #25960
* #25968
* #25971

---------

Co-authored-by: Dmitri Smirnov <[email protected]>
Co-authored-by: Adrian Lizarraga <[email protected]>
Co-authored-by: Hariharan Seshadri <[email protected]>
@snnn
Copy link
Member

snnn commented Sep 8, 2025

This PR has been cherry-picked into the rel-1.23.0 branch in PR #25985. Removing the release:1.23.0 label.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants