-
Notifications
You must be signed in to change notification settings - Fork 14k
Vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants #14903
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 1 commit
Commits
Show all changes
16 commits
Select commit
Hold shift + click to select a range
c9d62a2
vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants
0cc4m 2147817
vulkan: use subgroup operations for quantize_q8_1 shader
0cc4m 3a57953
vulkan: add q8_1_x4 type with 128-bit alignment, use in mul_mat_vecq …
0cc4m 7c5f8de
vulkan: use q8_1_x4 blocks in mul_mmq shader
0cc4m 14ccd94
vulkan: do 8 calculations per invocation instead of 32 in mul_mat_vec…
0cc4m 75da350
vulkan: tune mul_mat_vecq performance for Intel
0cc4m ed079a3
vulkan: fix quantizing issue when tensor is not divisible by 128
0cc4m cdf21be
vulkan: adapt integer dot mmv to mmv small m optimization (#15355)
0cc4m 301d79d
vulkan: allow all subgroup modes for mmv and mmvq
0cc4m 7291c7b
vulkan: use prealloc intermediate reuse for mmvq path
0cc4m b637c76
vulkan: tune mmvq for Intel, AMD GCN and Nvidia RTX 3090
0cc4m ab3ab2a
vulkan: adapt mmv quantize_y path to conditional sync logic
0cc4m adc8bac
vulkan: disable q8_0 mmvq on Nvidia
0cc4m 0f12d7b
vulkan: enable q8_0 on Nvidia pre-turing
0cc4m ed5cb48
fix prealloc sync condition
0cc4m bffe00d
fix llvmpipe subgroup 8 issue
0cc4m File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
vulkan: fix quantizing issue when tensor is not divisible by 128
- Loading branch information
commit ed079a300138030a96d883ebf273fb70851f290f
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.