When it gets to the file src-auto-generated/Text/DescriptorProtos/FileOptions.hs it compiles for several minutes using up more and more memory until it finally crashes with out of memory (I have 16 GB + swap)
$ ghc --version8.10.1$ cabal --version3.2.0.0$ git clone https://github.com/k-bx/protocol-buffers.git$ cd protocol-buffers$ git checkout 5986692b2de759458738049a484d5cda72b71f10$ cd descriptor$ cabal update$ cabal new-build........[19 of 29] Compiling Text.DescriptorProtos.FileOptions ( src-auto-generated/Text/DescriptorProtos/FileOptions.hs, /home/bitc/tmp/protocol-buffers/descriptor/dist-newstyle/build/x86_64-linux/ghc-8.10.1/protocol-buffers-descriptor-2.4.13/build/Text/DescriptorProtos/FileOptions.o, /home/bitc/tmp/protocol-buffers/descriptor/dist-newstyle/build/x86_64-linux/ghc-8.10.1/protocol-buffers-descriptor-2.4.13/build/Text/DescriptorProtos/FileOptions.dyn_o )
(...hangs and uses up tons of memory)
Standalone reproducer
The following program is a condensed version of what goes wrong:
{-# LANGUAGE BangPatterns #-}moduleT18140whereclassMergeableawheremergeAppend::a->a->amergeAppend_ab=binstanceMergeableBoolinstanceMergeablea=>Mergeable(Maybea)wheremergeAppend=mayMergemayMerge::(Mergeableb)=>Maybeb->Maybeb->MaybebmayMergeNothingy=ymayMergexNothing=xmayMerge(Justx)(Justy)=Just(mergeAppendxy)dataD=D!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)!(MaybeBool)instanceMergeableDwheremergeAppend(Dx'1x'2x'3x'4x'5x'6x'7x'8x'9x'10x'11x'12x'13x'14x'15x'16x'17x'18)(Dy'1y'2y'3y'4y'5y'6y'7y'8y'9y'10y'11y'12y'13y'14y'15y'16y'17y'18)=D(mergeAppendx'1y'1)(mergeAppendx'2y'2)(mergeAppendx'3y'3)(mergeAppendx'4y'4)(mergeAppendx'5y'5)(mergeAppendx'6y'6)(mergeAppendx'7y'7)(mergeAppendx'8y'8)(mergeAppendx'9y'9)(mergeAppendx'10y'10)(mergeAppendx'11y'11)(mergeAppendx'12y'12)(mergeAppendx'13y'13)(mergeAppendx'14y'14)(mergeAppendx'15y'15)(mergeAppendx'16y'16)(mergeAppendx'17y'17)(mergeAppendx'18y'18)
The behavior is exponential in the number of fields n, roughly 2^n or 3^n.
lines. When I change each of those lines to (Prelude'.undefined) then the file successfully compiles. I changed all of them (from 6 to 18) to (Prelude'.undefined) and then put them back one by one. the compile time becomes longer and longer (feels like exponentially longer)
I can confirm that it built happily with 8.8.3 (using under 1 Gb of RAM) and fails under 8.10.1 with out of memory on a 64 Gb machine with the same architecture as reported.
Here is a self-contained reproducer (it seems). To be compiled with -O1. The critical parts seem to be: strict fields of the datatype and wrapping fields in Maybe (removing any of those make the problem disappear).
Every additional !(Maybe Bool) field of D essentially triples the amount of terms after the second Simplifier run. So clearly unwanted exponential behavior.
I updated the OP with the standalone reproducer. Feel free to change back if your original problem isn't fixed when we fix the reproducer, @bitc.
Apparently the fact that DataCon wrappers live longer now ofr some reason means that the Simplifier inlines join points more aggressively in the definition of mergeAppend, or doesn't create join points in the first place.
Previously (I used 8.4.3), we get this with 4 fields:
T18140.$fMergeableD_$cmergeAppend = \ (w_s14h :: D) (w1_s14i :: D) -> case w_s14h of { D ww1_s14l ww2_s14m ww3_s14n ww4_s14o -> case w1_s14i of { D ww6_s14s ww7_s14t ww8_s14u ww9_s14v -> join { $j_s12I [Dmd=<C(S),1*C1(U(U,U,U,U))>] :: Maybe Bool -> D [LclId[JoinId(1)], Arity=1, Str=<L,U>m, Unf=OtherCon []] $j_s12I (dt_XYH [OS=OneShot] :: Maybe Bool) = join { $j1_s12F [Dmd=<C(S),1*C1(U(U,U,U,U))>] :: Maybe Bool -> D [LclId[JoinId(1)], Arity=1, Str=<L,U>m, Unf=OtherCon []] $j1_s12F (dt1_XYJ [OS=OneShot] :: Maybe Bool) = case ww3_s14n of wild_X1r { Nothing -> case ww4_s14o of wild1_X2E { Nothing -> T18140.D dt_XYH dt1_XYJ ww8_s14u ww9_s14v; Just ipv_s11s -> case ww9_s14v of wild2_Xa { Nothing -> T18140.D dt_XYH dt1_XYJ ww8_s14u wild1_X2E; Just ipv1_s11v -> T18140.D dt_XYH dt1_XYJ ww8_s14u wild2_Xa } }; Just ipv_s11s -> case ww8_s14u of wild1_Xa { Nothing -> case ww4_s14o of wild2_X2J { Nothing -> T18140.D dt_XYH dt1_XYJ wild_X1r ww9_s14v; Just ipv1_X12M -> case ww9_s14v of wild3_X1x { Nothing -> T18140.D dt_XYH dt1_XYJ wild_X1r wild2_X2J; Just ipv2_s11v -> T18140.D dt_XYH dt1_XYJ wild_X1r wild3_X1x } }; Just ipv1_s11v -> case ww4_s14o of wild2_X2K { Nothing -> T18140.D dt_XYH dt1_XYJ wild1_Xa ww9_s14v; Just ipv2_X12N -> case ww9_s14v of wild3_X1y { Nothing -> T18140.D dt_XYH dt1_XYJ wild1_Xa wild2_X2K; Just ipv3_X12V -> T18140.D dt_XYH dt1_XYJ wild1_Xa wild3_X1y } } } } } in case ww2_s14m of wild_X1r { Nothing -> jump $j1_s12F ww7_s14t; Just ipv_s11s -> case ww7_s14t of wild1_Xa { Nothing -> jump $j1_s12F wild_X1r; Just ipv1_s11v -> jump $j1_s12F wild1_Xa } } } in case ww1_s14l of wild_X1r { Nothing -> jump $j_s12I ww6_s14s; Just ipv_s11s -> case ww6_s14s of wild1_Xa { Nothing -> jump $j_s12I wild_X1r; Just ipv1_s11v -> jump $j_s12I wild1_Xa } } } }
Now we get this:
T18140.$fMergeableD_$cmergeAppend = \ (w_sNj :: D) (w1_sNk :: D) -> case w_sNj of ww_sNm { D ww1_sNn ww2_sNo ww3_sNp ww4_sNq -> case w1_sNk of ww5_sNt { D ww6_sNu ww7_sNv ww8_sNw ww9_sNx -> case ww1_sNn of wild_X8 { Nothing -> case ww2_sNo of wild1_X9 { Nothing -> case ww3_sNp of wild2_Xa { Nothing -> case ww4_sNq of wild3_Xb { Nothing -> ww5_sNt; Just ipv_sCI -> case ww9_sNx of { Nothing -> T18140.D ww6_sNu ww7_sNv ww8_sNw wild3_Xb; Just ipv1_sCL -> ww5_sNt } }; Just ipv_sCI -> case ww8_sNw of wild3_Xb { Nothing -> case ww4_sNq of wild4_Xc { Nothing -> T18140.D ww6_sNu ww7_sNv wild2_Xa ww9_sNx; Just ipv1_Xd -> case ww9_sNx of wild5_Xe { Nothing -> T18140.D ww6_sNu ww7_sNv wild2_Xa wild4_Xc; Just ipv2_sCL -> T18140.D ww6_sNu ww7_sNv wild2_Xa wild5_Xe } }; Just ipv1_sCL -> case ww4_sNq of wild4_Xc { Nothing -> ww5_sNt; Just ipv2_Xd -> case ww9_sNx of { Nothing -> T18140.D ww6_sNu ww7_sNv wild3_Xb wild4_Xc; Just ipv3_Xf -> ww5_sNt } } } }; Just ipv_sCI -> case ww7_sNv of wild2_Xa { Nothing -> case ww3_sNp of wild3_Xb { Nothing -> case ww4_sNq of wild4_Xc { Nothing -> T18140.D ww6_sNu wild1_X9 ww8_sNw ww9_sNx; Just ipv1_Xd -> case ww9_sNx of wild5_Xe { Nothing -> T18140.D ww6_sNu wild1_X9 ww8_sNw wild4_Xc; Just ipv2_sCL -> T18140.D ww6_sNu wild1_X9 ww8_sNw wild5_Xe } }; Just ipv1_Xc -> case ww8_sNw of wild4_Xd { Nothing -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D ww6_sNu wild1_X9 wild3_Xb ww9_sNx; Just ipv2_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D ww6_sNu wild1_X9 wild3_Xb wild5_Xe; Just ipv3_sCL -> T18140.D ww6_sNu wild1_X9 wild3_Xb wild6_Xg } }; Just ipv2_sCL -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D ww6_sNu wild1_X9 wild4_Xd ww9_sNx; Just ipv3_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D ww6_sNu wild1_X9 wild4_Xd wild5_Xe; Just ipv4_Xh -> T18140.D ww6_sNu wild1_X9 wild4_Xd wild6_Xg } } } }; Just ipv1_sCL -> case ww3_sNp of wild3_Xb { Nothing -> case ww4_sNq of wild4_Xc { Nothing -> ww5_sNt; Just ipv2_Xd -> case ww9_sNx of { Nothing -> T18140.D ww6_sNu wild2_Xa ww8_sNw wild4_Xc; Just ipv3_Xf -> ww5_sNt } }; Just ipv2_Xc -> case ww8_sNw of wild4_Xd { Nothing -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D ww6_sNu wild2_Xa wild3_Xb ww9_sNx; Just ipv3_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D ww6_sNu wild2_Xa wild3_Xb wild5_Xe; Just ipv4_Xh -> T18140.D ww6_sNu wild2_Xa wild3_Xb wild6_Xg } }; Just ipv3_Xe -> case ww4_sNq of wild5_Xf { Nothing -> ww5_sNt; Just ipv4_Xg -> case ww9_sNx of { Nothing -> T18140.D ww6_sNu wild2_Xa wild4_Xd wild5_Xf; Just ipv5_Xi -> ww5_sNt } } } } } }; Just ipv_sCI -> case ww6_sNu of wild1_X9 { Nothing -> case ww2_sNo of wild2_Xa { Nothing -> case ww3_sNp of wild3_Xb { Nothing -> case ww4_sNq of wild4_Xc { Nothing -> T18140.D wild_X8 ww7_sNv ww8_sNw ww9_sNx; Just ipv1_Xd -> case ww9_sNx of wild5_Xe { Nothing -> T18140.D wild_X8 ww7_sNv ww8_sNw wild4_Xc; Just ipv2_sCL -> T18140.D wild_X8 ww7_sNv ww8_sNw wild5_Xe } }; Just ipv1_Xc -> case ww8_sNw of wild4_Xd { Nothing -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D wild_X8 ww7_sNv wild3_Xb ww9_sNx; Just ipv2_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D wild_X8 ww7_sNv wild3_Xb wild5_Xe; Just ipv3_sCL -> T18140.D wild_X8 ww7_sNv wild3_Xb wild6_Xg } }; Just ipv2_sCL -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D wild_X8 ww7_sNv wild4_Xd ww9_sNx; Just ipv3_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D wild_X8 ww7_sNv wild4_Xd wild5_Xe; Just ipv4_Xh -> T18140.D wild_X8 ww7_sNv wild4_Xd wild6_Xg } } } }; Just ipv1_Xb -> case ww7_sNv of wild3_Xc { Nothing -> case ww3_sNp of wild4_Xd { Nothing -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D wild_X8 wild2_Xa ww8_sNw ww9_sNx; Just ipv2_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D wild_X8 wild2_Xa ww8_sNw wild5_Xe; Just ipv3_sCL -> T18140.D wild_X8 wild2_Xa ww8_sNw wild6_Xg } }; Just ipv2_Xe -> case ww8_sNw of wild5_Xf { Nothing -> case ww4_sNq of { Nothing -> T18140.D wild_X8 wild2_Xa wild4_Xd ww9_sNx; Just ipv3_Xh -> case ww9_sNx of wild7_Xi { Nothing -> ww_sNm; Just ipv4_sCL -> T18140.D wild_X8 wild2_Xa wild4_Xd wild7_Xi } }; Just ipv3_sCL -> case ww4_sNq of wild6_Xg { Nothing -> T18140.D wild_X8 wild2_Xa wild5_Xf ww9_sNx; Just ipv4_Xh -> case ww9_sNx of wild7_Xi { Nothing -> T18140.D wild_X8 wild2_Xa wild5_Xf wild6_Xg; Just ipv5_Xj -> T18140.D wild_X8 wild2_Xa wild5_Xf wild7_Xi } } } }; Just ipv2_sCL -> case ww3_sNp of wild4_Xd { Nothing -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D wild_X8 wild3_Xc ww8_sNw ww9_sNx; Just ipv3_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D wild_X8 wild3_Xc ww8_sNw wild5_Xe; Just ipv4_Xh -> T18140.D wild_X8 wild3_Xc ww8_sNw wild6_Xg } }; Just ipv3_Xe -> case ww8_sNw of wild5_Xf { Nothing -> case ww4_sNq of wild6_Xg { Nothing -> T18140.D wild_X8 wild3_Xc wild4_Xd ww9_sNx; Just ipv4_Xh -> case ww9_sNx of wild7_Xi { Nothing -> T18140.D wild_X8 wild3_Xc wild4_Xd wild6_Xg; Just ipv5_Xj -> T18140.D wild_X8 wild3_Xc wild4_Xd wild7_Xi } }; Just ipv4_Xg -> case ww4_sNq of wild6_Xh { Nothing -> T18140.D wild_X8 wild3_Xc wild5_Xf ww9_sNx; Just ipv5_Xi -> case ww9_sNx of wild7_Xj { Nothing -> T18140.D wild_X8 wild3_Xc wild5_Xf wild6_Xh; Just ipv6_Xk -> T18140.D wild_X8 wild3_Xc wild5_Xf wild7_Xj } } } } } }; Just ipv1_sCL -> case ww2_sNo of wild2_Xa { Nothing -> case ww3_sNp of wild3_Xb { Nothing -> case ww4_sNq of wild4_Xc { Nothing -> ww5_sNt; Just ipv2_Xd -> case ww9_sNx of { Nothing -> T18140.D wild1_X9 ww7_sNv ww8_sNw wild4_Xc; Just ipv3_Xf -> ww5_sNt } }; Just ipv2_Xc -> case ww8_sNw of wild4_Xd { Nothing -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D wild1_X9 ww7_sNv wild3_Xb ww9_sNx; Just ipv3_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D wild1_X9 ww7_sNv wild3_Xb wild5_Xe; Just ipv4_Xh -> T18140.D wild1_X9 ww7_sNv wild3_Xb wild6_Xg } }; Just ipv3_Xe -> case ww4_sNq of wild5_Xf { Nothing -> ww5_sNt; Just ipv4_Xg -> case ww9_sNx of { Nothing -> T18140.D wild1_X9 ww7_sNv wild4_Xd wild5_Xf; Just ipv5_Xi -> ww5_sNt } } } }; Just ipv2_Xb -> case ww7_sNv of wild3_Xc { Nothing -> case ww3_sNp of wild4_Xd { Nothing -> case ww4_sNq of wild5_Xe { Nothing -> T18140.D wild1_X9 wild2_Xa ww8_sNw ww9_sNx; Just ipv3_Xf -> case ww9_sNx of wild6_Xg { Nothing -> T18140.D wild1_X9 wild2_Xa ww8_sNw wild5_Xe; Just ipv4_Xh -> T18140.D wild1_X9 wild2_Xa ww8_sNw wild6_Xg } }; Just ipv3_Xe -> case ww8_sNw of wild5_Xf { Nothing -> case ww4_sNq of wild6_Xg { Nothing -> T18140.D wild1_X9 wild2_Xa wild4_Xd ww9_sNx; Just ipv4_Xh -> case ww9_sNx of wild7_Xi { Nothing -> T18140.D wild1_X9 wild2_Xa wild4_Xd wild6_Xg; Just ipv5_Xj -> T18140.D wild1_X9 wild2_Xa wild4_Xd wild7_Xi } }; Just ipv4_Xg -> case ww4_sNq of wild6_Xh { Nothing -> T18140.D wild1_X9 wild2_Xa wild5_Xf ww9_sNx; Just ipv5_Xi -> case ww9_sNx of wild7_Xj { Nothing -> T18140.D wild1_X9 wild2_Xa wild5_Xf wild6_Xh; Just ipv6_Xk -> T18140.D wild1_X9 wild2_Xa wild5_Xf wild7_Xj } } } }; Just ipv3_Xd -> case ww3_sNp of wild4_Xe { Nothing -> case ww4_sNq of wild5_Xf { Nothing -> ww5_sNt; Just ipv4_Xg -> case ww9_sNx of { Nothing -> T18140.D wild1_X9 wild3_Xc ww8_sNw wild5_Xf; Just ipv5_Xi -> ww5_sNt } }; Just ipv4_Xf -> case ww8_sNw of wild5_Xg { Nothing -> case ww4_sNq of wild6_Xh { Nothing -> T18140.D wild1_X9 wild3_Xc wild4_Xe ww9_sNx; Just ipv5_Xi -> case ww9_sNx of wild7_Xj { Nothing -> T18140.D wild1_X9 wild3_Xc wild4_Xe wild6_Xh; Just ipv6_Xk -> T18140.D wild1_X9 wild3_Xc wild4_Xe wild7_Xj } }; Just ipv5_Xh -> case ww4_sNq of wild6_Xi { Nothing -> ww5_sNt; Just ipv6_Xj -> case ww9_sNx of { Nothing -> T18140.D wild1_X9 wild3_Xc wild5_Xg wild6_Xi; Just ipv7_Xl -> ww5_sNt } } } } } } } } } }
Make constructor wrappers inline only during the final phaseFor case-of-known constructor to continue triggering early,exprIsConApp_maybe is now capable of looking through lets and cases.See #15840