Making template-haskell refactorable

added TemplateHaskell core libraries labels

I get the sense that most of these package are only depending on template-haskell for the sake of defining Lift instances (or, in the case of exceptions, to define an instance for the Q data type). My hope is that it would that CPP required to avoid compiling these instances with a stage-0 compiler wouldn't be too horrible to maintain.

I ran into a similar but different issue with !8973, you can see the submodule bumps there which added CPP to stop stage-0 depending on TH.

Why is this a problem with the TH package though and not other wired-in things? What about if we changed where Monad was located for example, I don't think we would run into the same issues.

Yes because we don't recompile base with the stage0 compiler.

Is there a reason we compile template-haskell with the stage0 compiler? It seems to me that the stage1 compiler should depend on the boot template-haskell version (for this reason). I'm having a little play around to see what happens if you try to do this.

I suppose the reason is that modules which turn TH syntax into Hs syntax would need to support multiple template-haskell library versions using CPP if you did this.

If we could avoid build template-haskell with the stage0 compiler, that would be better yes. Even if CPP is required.

I suppose the reason is that modules which turn TH syntax into Hs syntax would need to support multiple template-haskell library versions using CPP if you did this.

Does the stage0 compiler actually need that code to work though? Or is it used only on stage1? I'm just wondering if we could have stage0 depend on template-haskell from the boot compiler, but then use CPP to replace the TH-to-HS implementation with an undefined stub, rather than supporting multiple template-haskell versions.

I'm currently trying out this stubbing out with CPP approach. A complication seems to be that template-haskell the library uses "variable brackets" ie the 'Foo notation, which goes through the bracket type-checking operations in GHC/Tc/Gen/Splice.hs

See this branch: https://gitlab.haskell.org/teo/ghc/-/commits/wip/T23536-teo-stubbing-out

added Pnormal Tquestion labels

marked this issue as related to #23647 (closed)

mentioned in issue #23647 (closed)

mentioned in issue #20828

mentioned in issue #24021

I've been staring for almost two hours at this issue and failed to see what problem it is that we want to solve.

Until I found out about GHC.Builtin.Names.TH. So the wiring-in problematised here has nothing to do with those definitions trueName = 'True in ...TH.Syntax, but everything to do with the compiler expecting certain names to be present in certain modules in the in-tree template-haskell!

Since there are names in template-haskell that are wired-into GHC, it makes great sense that template-haskell is coupled to GHC in a similar manner as base (used to be; now it's ghc-internal.). For example, Just used to live in base:GHC.Maybe in GHC 9.8 (a supported boot compiler), now it lives in ghc-internal:GHC.Maybe. What #22229 (closed) and #20828 presumably tried to do was move around similarly wired-in names in the in-tree template-haskell such as Lift into another module (e.g., Internal). Of course in doing so, we must update the hard-wiring in the compiler in GHC.Builtin.Names.TH. For the boot compiler, we cannot update this wiring! It has been fixed when it was released, just like the hard-wiring into base (now ghc-internal).

So here are some ideas for fixing the situation (that is, enable those refactorings), in order of highest perceived desirability:

Move the wired-in names into ghc-internal to begin with?! That would allow for a reinstallable template-haskell, and it is also where I think they belong; their location and implementation is specific to GHC. Of course that would temporarily amount to the same as (2) or (3) for at least 3 release cycles in order to phase out the old wired-in references to template-haskell. After that, we can freely reinstall template-haskell because it is merely a support library building on ghc-internal, although one that will experience a lot of churn in its implementation. Which is OK when it is isolated to this one library.
If that is not an option: we can operate less granularly and try not to build in-tree template-haskell with boot GHC 9.8, with judicious use of CPP in the boot libraries containers, etc. Perhaps some define NO_TH would suffice. Doing so effectively creates another ghc-internal library, though: the version of template-haskell would be tightly coupled to a particular GHC release because of the hard-wiring, so perhaps less desirable than (1), but a good bridging solution for the phase out.
If that is not possible, we can still try to use CPP to move around definitions inside template-haskell, much like we do in the compiler's code base (MIN_VERSION_ghc) for the range of supported boot compilers in order to do refactorings. This still means that GHC 8.10 is unlikely to build with templat-haskell-2.21 because it is not within the support window of bootstrap compiler versions that GHC 9.8 (or whatever version shipped 2.21) shipped with, hence it presumably won't have any CPP ifdefs to support the paricular hard-wiring of GHC 8.10. It is also a lot of churn on GHC devs, which is why (2) seems preferable.

That said, to get the ball rolling I would recommend @teo to try (2). If we can build the stage0 compiler without template-haskell, there is no need for any CPP whatsoever. In particular, I don't think that teo/ghc@2fcb478d will work; ghc-internal is not a library known to a boot compiler GHC 9.8. If we can manage not to build template-haskell with the boot compiler, we don't need to worry either, not even touch it and keep using 'Foo resolution, which should always resolve to names exported from the base library that template-haskell depends on. (Whether these point into ghc-internal is inconsequential, IIUC.)

The next step then would be to try (1) and move definitions into ghc-internal.

Thanks for taking a look! (2) then (1) sounds like a good shout.

After my previous comment I ended up changing strategy towards something that is a bit closer to your (2). I just use the boot compiler's template-haskell for stage0. This meant that the code in ghc that dealt with converting to/from the TH ASTs needed some CPP to deal with the older template-haskell.

A trickier thing was that the canonical list of GHC extensions lives in ghc-boot-th. As this is a dependency of template-haskell we also had to use the boot version of this (annoying!). So, I ended up writing a bunch of CPP to avoid mentioning new extensions. In retrospect this was the wrong approach, but it got me a working stage1 ghc. Unfortunately stage2 needs some of the new extensions to build.

Tomorrow I'll see if I can make things work by duplicating the extension list (ghc-boot has one copy and ghc-boot-th has a distinct one).

FWIW my rather rough WIP implementation is here: 7079b3e9

I was a bit surprised I didn't have to add CPP to disable generating Lift instances in boot libraries, but so far it seems like everything works fine with them there. Though perhaps there's some trouble lurking that I'll encounter when I fix the extensions issue.

I very recently edited the top post to explain the problem a bit more clearly. @teo I think you are pursuing possible solution 2 from the new top post? The boot libraries are generally already compatible with a wide range of compilers' wired-in template-haskell version, so it's not a surprise to me that they "just work" when you use the bootstrap compiler's template-haskell instead of the in-tree template-haskell.

Thanks for updating it. It looks great! Yes I think solution 2 is the one I'm pursuing.

Thanks. If I understand correctly, (2) and (3) would even allow GHC to use TH, which is nice.

What is also worth noting in this scenario is that the version of template-haskell that stage-1 GHC (I previously called it stage0, but that would be the boot compiler) will be built against and that is listed in the .cabal file is not the same as the version of template-haskell that GHC wires in. The latter is always the in-tree version. It's just as what it used to be for base/ghc-internal, but it's worth clarifying because the matrix of artifacts * different stages is uttlerly confusing.

Do note that my proposal (1) from #23536 (comment 554445) to move all wired-in definitions into ghc-internal has the potential to fix #22229 (closed) as well if we move everything that -XDeriveLift needs there as well. I'll post this idea over there.

Edit: Nevermind; Lift obviously transitiely needs the TH AST and it seems antithetical to move that into ghc-internal when the whole point abou TTG is to decouple the syntax tree from GHC. Besides, #22229 (closed) will easily be solved as described in #22229 (comment 495381) once we fix this issue. So the proposal (1) of mine was a bit unrealistic. Nevertheless, template-haskell now enjoys the same wired-in status as ghc-internal, and every GHC release will fix a particular version of it. Perhaps we can introduce a separate, wired-in package ghc-th-internal that defines Lift and friends. That would still pin down a fixed version of its dependency template-haskell.

Perhaps we can introduce a separate, wired-in package ghc-th-internal that defines Lift and friends

Sounds like we are back to the main proposal in #24021.

I've now implemented what I've described in #23536 (comment 554483) and put up a merge request: !12306 (closed)

It should build now

changed the description

changed title from Compile stage0 without dependency on template-haskell? to Making template-haskell refactorable

mentioned in commit 860a0a1b

mentioned in commit ebec0964

mentioned in commit e6edfa0d

mentioned in commit 986509bb

mentioned in commit 8da93468

mentioned in commit d5c5016c

What strikes me as a bit dangerous about proposal (2)/!12306 (closed) is that the in-tree template-haskell changes will not be tested during bootstrapping at all. Although this is similar to the situation with base, I think we have vastly fewer test that will exercise the #if branches in compiler/GHC/HsToCore/Quote.hs, for example. I suppose we can still rely on head.hackage pipelines building aeson, containers, etc. to squeeze out bugs. Besides, there isn't really an alternative.

mentioned in commit 72ad09ed

Making template-haskell refactorable

Background

Problem

Possible solutions

Child items ...

Activity

Making template-haskell refactorable

Background

Problem

Possible solutions

Relates to

Activity