Skip to content

Rewrite the Haswell SROT/DROT kernel tail loop with AVX2 to get consistent FMA rounding#5660

Merged
martin-frbg merged 1 commit intoOpenMathLib:developfrom
martin-frbg:issue5658
Mar 4, 2026
Merged

Rewrite the Haswell SROT/DROT kernel tail loop with AVX2 to get consistent FMA rounding#5660
martin-frbg merged 1 commit intoOpenMathLib:developfrom
martin-frbg:issue5658

Conversation

@martin-frbg
Copy link
Collaborator

as suggested by Ilayn in #5658 , and using his code sample
fixes #5658

@martin-frbg martin-frbg added this to the 0.3.32 milestone Mar 3, 2026
@martin-frbg martin-frbg merged commit db6bbc7 into OpenMathLib:develop Mar 4, 2026
103 of 105 checks passed
@ilayn
Copy link
Contributor

ilayn commented Mar 4, 2026

Thank you for the fix @martin-frbg !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

(S/D)ROT Haswell kernel does not round the tail as its SIMD body.

2 participants