Fix CRC32 encoding #60329

BruceForstall · 2021-10-13T02:59:31Z

On x64, when the crc32 instruction 2nd operand is a memory address
(such as for a static field), and that address is containable
(which normally doesn't happen, because the address will be above
the 4GB lower address space), then the instruction was being
improperly encoded.

Fixes #59714

ghost · 2021-10-13T02:59:37Z

Tagging subscribers to this area: @JulieLeeMSFT
See info in area-owners.md if you want to be subscribed.

Issue Details

On x64, when the crc32 instruction 2nd operand is a memory address
(such as for a static field), and that address is containable
(which normally doesn't happen, because the address will be above
the 4GB lower address space), then the instruction was being
improperly encoded.

Fixes #59714

Author:	BruceForstall
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

BruceForstall · 2021-10-13T03:00:23Z

@tannergooding @dotnet/jit-contrib PTAL

tannergooding · 2021-10-13T15:00:58Z

src/coreclr/jit/emitxarch.cpp

I'm not quite sure this is the right fix....

I would have expected EncodedBySSE38OrSSE3A to be returning true here, because CRC32 is F2 0F 38 F0 or F2 0F 38 F1.

However, looking at https://github.com/dotnet/runtime/blob/745fa1c7b30f1b9107084f0baecc325527fa9561/src/coreclr/jit/emitxarch.cpp#L1583-L1611, it might be failing because the "prefix" is 0xF2 and not 0x66.

0x66, 0xF2, and 0xF3 are all encounterable "prefix" bytes here and this is probably something that was missed when instructions covering the other two were added. In particular:

0x66 is the most prominent and covers every instruction we support today, except CRC32

0xF2 - Only used for SSE42.CRC32

0xF3 - no instructions exposed here yet

ADX.ADOX

AESKLE.AESDEC128KL

AESKLE.AESDEC256KL

AESKLEWIDE_KL.AESDECWIDE128KL

AESKLEWIDE_KL.AESDECWIDE256KL

AESKLE.AESENC128KL

AESKLE.AESENC256KL

AESKLEWIDE_KL.AESENCWIDE128KL

AESKLEWIDE_KL.AESENCWIDE256KL

AESKLE.ENCODEKEY128

AESKLE.ENCODEKEY256

KL.LOADIWKEY

I'm not quite sure this is the right fix

Do you mean you don't know if this fixes the issue? Or you think there might be a more general fix? (I did of course verify that it fixes the issue. It's also nice that it's very simple and contained, as I want to port this to .NET 6.)

More details:

EncodedBySSE38OrSSE3A returns false for crc32 because crc32 is not in the SSE or AVX instruction range, so IsSSEOrAVXInstruction returns false. (Not sure why it's defined that way; tzcnt/lzcnt/popcnt also aren't in that set.)

Looking through the code base, there are 17 places that use the condition (EncodedBySSE38orSSE3A(ins) || (ins == INS_crc32)) and only 8 that use the condition (EncodedBySSE38orSSE3A(ins)). In particular, the places without the || (ins == INS_crc32) logic handle emission of:

IF_RRW_ARD_CNS IF_RWR_ARD_CNS IF_RWR_RRD_ARD_CNS IF_RWR_RRD_ARD_RRD IF_RRW_SRD_CNS IF_RWR_SRD_CNS IF_RWR_RRD_SRD IF_RWR_RRD_SRD_CNS IF_RWR_RRD_SRD_RRD IF_RRW_MRD_CNS IF_RWR_MRD_CNS IF_RWR_RRD_MRD IF_RWR_RRD_MRD_CNS IF_RWR_RRD_MRD_RRD

Since crc32 is a binary operator, none of these apply, so we could let EncodedBySSE38OrSSE3A handle crc32 as well (and rename it to match).

The only question then is whether Is4ByteSSEInstruction should also include it (that's the only other use of EncodedBySSE38OrSSE3A). That's a little trickier to unravel; there a case in emitOutputRR I'm not sure about; a case in emitOutputInstr dealing with IF_RRW_RRW_CNS that doesn't matter; and a case in emitGetAdjustedSize to fix. Presumably we could define it as:

bool emitter::Is4ByteSSEInstruction(instruction ins) { return (!UseVEXEncoding() && EncodedBySSE38orSSE3A(ins)) || (ins == INS_crc32); }

as crc32 is not affected by VEX encoding.

Do you mean you don't know if this fixes the issue

I meant this as, there might be additional locations that need equivalent fixups; however, adjusting the existing method that handles SSE38 and SSE3A encodings may be a better overall fix.

EncodedBySSE38OrSSE3A returns false for crc32 because crc32 is not in the SSE or AVX instruction range

Ah I see, that would impact this as well: https://github.com/dotnet/runtime/blob/main/src/coreclr/jit/instrsxarch.h#L615

For what its worth, it is actually an SSE4.2 instruction (and SSE38 based encoding), even if its not a vector instruction and should likely be in the list (much as BMI1 and BMI2 are in the overall AVX range). I'm guessing that crc32, tzcnt, lzcnt, and popcnt fell into some special category that forced them out of the SSE/AVX ranges, possibly due to more complicated fixups that would've been required. In particular:

crc32 is explicitly SSE4.2

tzcnt is explicitly BMI1

lzcnt is implicitly BMI1 on Intel; its separate because its a separate bit for compat with AMD which exposed it as part of ABM ~2007 (so 6 years earlier)

popcnt is implicitly SSE4.2 on Intel; its separate because for the same reason as lzcnt

Based on your comments above; it sounds like this fix is fine for .NET 6. It's probably worth seeing if we should adjust the overall handling for .NET 7 to properly account for the SSE38 prefix difference (as that is likely required for future instruction expansion regardless).

On x64, when the crc32 instruction 2nd operand is a memory address (such as for a static field), and that address is containable (which normally doesn't happen, because the address will be above the 4GB lower address space), then the instruction was being improperly encoded.

BruceForstall · 2021-10-13T17:53:31Z

@tannergooding Do you have any further change you think are required for this, or are you willing to approve this PR?

tannergooding

I think this looks good for a minimal fix.

I think, in terms of potential future instructions and the like we may need a more complex fix properly recognizing CRC32 as EncodedBySSE38

BruceForstall · 2021-10-13T19:09:58Z

/backport to release/6.0

github-actions · 2021-10-13T19:10:11Z

Started backporting to release/6.0: https://github.com/dotnet/runtime/actions/runs/1338896860

ghost added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Oct 13, 2021

BruceForstall requested a review from tannergooding October 13, 2021 03:00

tannergooding reviewed Oct 13, 2021

View reviewed changes

BruceForstall force-pushed the FixCrc32Encoding branch from 745fa1c to 2e286ba Compare October 13, 2021 15:26

tannergooding approved these changes Oct 13, 2021

View reviewed changes

BruceForstall merged commit 5b1ebf7 into dotnet:main Oct 13, 2021

github-actions bot mentioned this pull request Oct 13, 2021

[release/6.0] Fix CRC32 encoding #60360

Merged

ghost locked as resolved and limited conversation to collaborators Nov 13, 2021

BruceForstall deleted the FixCrc32Encoding branch December 28, 2022 01:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix CRC32 encoding #60329

Fix CRC32 encoding #60329

Uh oh!

BruceForstall commented Oct 13, 2021

Uh oh!

ghost commented Oct 13, 2021

Uh oh!

BruceForstall commented Oct 13, 2021

Uh oh!

tannergooding Oct 13, 2021

Uh oh!

BruceForstall Oct 13, 2021

Uh oh!

tannergooding Oct 13, 2021 •

edited

Loading

Uh oh!

tannergooding Oct 13, 2021

Uh oh!

BruceForstall commented Oct 13, 2021

Uh oh!

tannergooding left a comment

Uh oh!

BruceForstall commented Oct 13, 2021

Uh oh!

github-actions bot commented Oct 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix CRC32 encoding #60329

Fix CRC32 encoding #60329

Uh oh!

Conversation

BruceForstall commented Oct 13, 2021

Uh oh!

ghost commented Oct 13, 2021

Uh oh!

BruceForstall commented Oct 13, 2021

Uh oh!

tannergooding Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

BruceForstall Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

tannergooding Oct 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tannergooding Oct 13, 2021

Choose a reason for hiding this comment

Uh oh!

BruceForstall commented Oct 13, 2021

Uh oh!

tannergooding left a comment

Choose a reason for hiding this comment

Uh oh!

BruceForstall commented Oct 13, 2021

Uh oh!

github-actions bot commented Oct 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tannergooding Oct 13, 2021 •

edited

Loading