(Derived) Ord instances generate terrible code

added compiler perf deriving needs triage pointer tagging labels

mentioned in commit 96a74164

mentioned in issue #16578

removed compiler perf label

added Tbug label and removed needs triage label

added compiler perf label

This seems like an important issue. Could it be scheduled for GHC 9.4?!

changed milestone to %9.4.1

Yes, I agree that getting decent code for Ord is fairly important.

One point @AndreasK. Suppose we have

f x y = case x of (a,b,c,d,e) -> 
        case y of (p,q,r,s) ->   a+b+c+d+e+p+q+r+s

There is a danger than in Cmm we'll

do the eval on x
load a,b,c,d,e into register
then spill them all to the Haskell stack
then do the eval on y
then load a,b,c,d,e,p,q,r,s and add them.

It would be better to sink those loads on x down past the eval of y. Do we do that? Relevant to your work on sinking (#20334). This is a very common pattern, and if we expect it to work we should document it clearly in a Note, and have regression tests.

(Slightly by-the-way to the Ord code.)

Since the tuple elements would be boxed elements it's not that simple.

It would be more like:

do the eval on x
load a,b,c,d,e into register
then spill them all to the Haskell stack
then do the eval on y
...
then load a (from stack)
then do the eval on a
then unbox a, spill the result to stack.
then load b
then do the eval on b
then unbox b, add a+b, spill the result to stack.
...

This repeating until we added them all.

I guess the question is can we defer loading a from x. Probably. I think the is the same problem as in #20333

Ping @nineonine who might be interested in this (similar to #17240 (closed)/!6955 (closed)).

mentioned in issue #20333

marked this issue as related to #16578

Doesn't seem like we have made any progress for 9.4.1 but will remilestone for the next release.

changed milestone to %9.6.1

removed milestone %9.6.1

Could we get a priority tag for this issue, e.g. high?!

Could we get a priority tag for this issue, e.g. high?!

High priority = will make a significant difference to lots of users. Does anyone have evidence that effort invested here would have high reward? At the moment we have no idea if we'd get a big perf win at all, let alone if that perf win would benefit our users.

(Derived) Ord instances generate terrible code

Child items ...

Activity

(Derived) Ord instances generate terrible code

Relates to

Activity