Improve performance of genericWordQuotRem2Op (#22966)
Implements the algorithm from compiler-rt's udiv128by64to64default. This rewrite results in a roughly 24x improvement in runtime on AArch64 (and likely any other arch that uses it).
Showing
- compiler/GHC/StgToCmm/Prim.hs 172 additions, 46 deletionscompiler/GHC/StgToCmm/Prim.hs
- testsuite/tests/numeric/should_run/all.T 1 addition, 0 deletionstestsuite/tests/numeric/should_run/all.T
- testsuite/tests/numeric/should_run/quotRem2Large.hs 2250 additions, 0 deletionstestsuite/tests/numeric/should_run/quotRem2Large.hs
Loading
Please register or sign in to comment