Optimize Pretty for infinite ribbon case

You have assigned yourself Ben. Does that mean you will work on this? At what timescale?

Hi, may I know if this is something I could try to work on?

@dexterleng, absolutely, this is an open task. Do let me know if you would like further guidance.

Do let me know if you would like further guidance.

That would be great.

Could you point to the part of the code where the length computation takes place? I'm currently reading the Pretty print paper.

Which version of Pretty shall I work on? The one in utils or the repo?

Optimize Pretty for infinite ribbon case

Could you elaborate on what is "infinite ribbon case"?

@dexterleng, the "infinite ribbon case" refers to the rendering configuration where the ribbon width is infinite (that is, we have no desire to insert line breaks that weren't explicitly present in the document). This is the case which is used by GHC code generator, which produces very large documents.

It shouldn't be a problem to work on the the pretty library on GitHub. We should be able to easily port any changes made there to GHC.

Could you point to the part of the code where the length computation takes place?

I'm not sure I follow this question. Which length are you referring to?

@bgamari

Could you point to the part of the code where the length computation takes place?

I'm referring to this:

This would be fine except it means that codegen performance regresses since it must walk every FastString we print to compute its length when building a Beside node. This is wasted effort in the case of codegen.

EDIT: Looking at compiler/cmm it appears functions like ftext are being used, which use the length function.

https://gitlab.haskell.org/ghc/ghc/blob/master/compiler/utils/Pretty.hs#L313

https://gitlab.haskell.org/ghc/ghc/blob/master/compiler/cmm/CLabel.hs#L1282

@bgamari what is the use case of pretty during codegen, and why it is a "wasted effort" to compute the length of a FastString during codegen?

@dexterleng the code generator uses pretty to emit all of the assembler that we produce. pretty only uses the length of the FastString to inform the layout algorithm. However, when producing assembler there is no reason to perform layout (since the result is going to be consumed by as, not a human).

Looking at compiler/cmm it appears functions like ftext are being used, which use the length function.

Precisely. The Doc type's Beside node always includes its (strictly-evaluated) length.

Just ran some benchmarks by compiling cabal with and without hardcoded lengths. I've modified text, ftext, ptext, ztext.

./inplace/bin/ghc-stage2 -ilibraries/Cabal/Cabal libraries/Cabal/Cabal/Setup.hs -fforce-recomp -ddump-timings +RTS -s

MUT timings when computing the length are 132.480s, 134.979s, 137.359s, 138.998s, and 139.445s.

MUT timings without computing the length are 134.351s, 134.739s, 135.198s, 132.811s, 131.600s, and 131.560s.

So mean of 136.6s vs 133.3765s and a 2.4% mutator timing decrease.

mentioned in merge request !1675 (closed)

@bgamari could you list some potential solutions to address this?

What if we just had

ftext0width :: FastString -> Doc
ftext0width s = textBeside_ (PStr s) 0 Empty

and use that in CodeGen? Or extend Doc for a variant of TextBeside that is lazy?

data Doc
  = ...
  | TextBesideLazyLength !TextDetails Int Doc
  | ...

textBesideLazyLength_ :: TextDetails -> Int -> RDoc -> RDoc
textBesideLazyLength_ = TextBesideLazyLength


ftext :: FastString -> Doc
ftext s = textBesideLazyLength_ (PStr s) (lengthFS s) Empty

Done, except maybe a few more allocations, which we currently do anyway due to caching of the length of all FastStrings. Or even

data Doc
  = ...
  | TextBesideFastStringNoLength !FastString Doc
  | ...

textBesideFastStringNoLength_ :: FastString -> RDoc -> RDoc
textBesideFastStringNoLength_ = TextBesideFastStringNoLength


ftext :: FastString -> Doc
ftext s = textBesideFastStringNoLength_ s Empty

And re-compute lengthFS everytime as needed.

FWIW, prettyprinter's layout algorithm for unbounded page widths has become quite a bit simpler after some refactorings.

I suspect that the situation in pretty is quite different though.

@sgraf812 Your idea is certainly worth a try, seems about an hours work at most..

mentioned in issue #17097

assigned to @sgraf812

Optimize Pretty for infinite ribbon case

Child items ...

Activity

Optimize Pretty for infinite ribbon case

Relates to

Activity