Make the design of defaulting explicit

changed the description

added LinearTypes Pnormal PolyKinds Ttask defaulting representation polymorphism typechecker labels

changed the description

Thanks for putting this up, Simon! I've made some edits, just to clarify.

I do have some reactions to reasons for (D), though:

It makes reasoning more local. You can figure out what a type signature means without looking to see how it is used.

But type signatures aren't in play here. We're only talking about definitions. I'm quite happy to always use (D) on (complete) type signatures, precisely so that people (and machines!) can read type signatures and know that they are definitive.

It eliminates the nasty possibility of an inferred type like f :: forall a. Foo LiftedRep a => blah, where we have an instance Foo LiftedRep t. This can happen if we infer f :: forall a. Foo rr a => blah, where rr is still a unification variable, ultimately set to LiftedRep

I think our new defaulting strategy (promote and then emit a new constraint) has the same strange inferred types as does my (P) strategy. So I do not see this as an advantage of the (D) approach.

changed the description

I think our new defaulting strategy (promote and then emit a new constraint) has the same strange inferred types as does my (P) strategy. So I do not see this as an advantage of the (D) approach.

Ha. That is (just about) true -- but only in rather obscure cases. Usually we'll unify eagerly and therefore won't abstract. For example, we'll have a "representation component" of kappa :: RuntimeRep and we'll unify that with LiftedRep. Only if we have a representation component of F alpha :: RuntimeRep will we defer that equality. But with (P) we won't ever do that eager unification. So I think this still is a real difference.

I'm quite happy to always use (D) on (complete) type signatures, precisely so that people (and machines!) can read type signatures and know that they are definitive.

Why do you want this to work

   f x = x
   h = f 0#

but not this?

  f :: forall a. a->a
  f @(a :: UnliftedType) = ...

It seems a bit funny to default in one case and promote in another.

Also we always default multiplicity variables and, other things being equal, it'd be nicer to treat all this stuff the same.

Just to clarify a point. When we are speaking of promotion vs defaulting, we are only discussing top-level binders, aren't we?

There is no suggestion that we default runtime-rep/multiplicity variables of let-binding. Am I reading this correctly?

eager unification

One of my principles in thinking about type inference design is never to rely on eager unification. Indeed, it might be interesting to kill off the eager unifier entirely just to smoke out bugs. So I'm surprised to be thinking about this.

It seems a bit funny to default in one case and promote in another.

This is true, and something worth thinking more about. Right now, we kind-generalize type signatures, in an effort to make type signatures self-standing and definitive. We don't have to do this: we could leave kinds of bound variables to be unification variables, to be learned by definition or usage sites. One of the principles I've been working against is that complete signatures are really complete (and thus require defaulting). But maybe we do not need this principle.

multiplicity

I think the challenge here is that we have no constraints involved. But we really should have. Then, this would be easier. For example, if we have f x = x (let's forget about representations, for a moment), we should infer f :: alpha %mu -> alpha with a constraint mu >= One. It so happens that GHC's multiplicities are all >= One, but most systems of multiplicity include others that are not >= One. If we had a class constraint, then we could indeed just promote, and TypeClassDefaulting could do the defaulting.

When we are speaking of promotion vs defaulting, we are only discussing top-level binders, aren't we?

No. Up until now, the discussion applies equally to top-level and to nested. One plausible approach is to promote (P) for local lets but default (D) for top-level ones. Why? Because of the advantages of local reasoning. In a nested let, the region of code we have to include in our reasoning is bounded.

No. Up until now, the discussion applies equally to top-level and to nested. One plausible approach is to promote (P) for local lets but default (D) for top-level ones. Why? Because of the advantages of local reasoning. In a nested let, the region of code we have to include in our reasoning is bounded.

Promoting for local let and defaulting for toplevel would probably be my inclination, indeed. I think that I expect promotion in the local lets. But defaulting is a reasonable behaviour at toplevel (where everything is sort of expected to have a signature anyway).

One of my principles in thinking about type inference design is never to rely on eager unification.

Agreed. I'm not relying on it. But if we do eager unification we will much more seldom get these soluble-but-nevertheless-abstracted constraints. Having them is not wrong or unsound, it's just unsightly. With (P) we never eliminate them; with (D) + eager unification we usually eliminate them.

What is the plan for allowing the defaults to be configured by the user, e.g. allowing a module to use UnliftedRep as a default throughout a module? (It can quickly get tedious to have to write (a :: TYPE UnliftedRep) everywhere.)

My issue with defaulting unconstrained variables (of kind RuntimeRep say) is that we lose this configurability. The OP seems to say that defaulting for variables appearing under a Concrete# constraint can use user typeclass defaults, but otherwise we use the built-in LiftedRep default. I think this could be very confusing to users: we don't actually ever print out Concrete# constraints to the user, so it will look like we're choosing either the user-specified default or the built-in default at random.

The OP seems to say that defaulting for variables appearing under a Concrete# constraint can use user typeclass defaults, but otherwise we use the built-in LiftedRep default.

Not really. Consider

f x = x

We will get a type (alpha :: TYPE rr) -> alpha, and constraint Concrete# alpha. So:

We promote rr because it is free in an argument of Concrete#.
But we also promote rr because it is free in a "runtime-rep component" (see definitions above)(
And if using (D) we would also add rr ~ LiftedRep.

So why bother with Concrete# at all?

So that you if you write an explicit signature f :: forall (r::RuntimeRep) (a::TYPE r). a -> a (which is perfectly legal in itself), we'll get an insoluble Concrete# r constraint when typechecking the code f x = x.
Can we ever have a Concrete# constraint on a runtime-rep that isn't a "runtime-rep component" of the type of the binding? Yes, although it seems very contrived.
```
funny () = (\y -> True) (error "urk")
```
The type of the RHS is just Bool. But what is y's type? It'll be y :: alpha :: TYPE rr. We'll get a Concrete# rr constraint from the lambda, but note that rr doesn't end up in the runtime-rep components of f's type (which is simply () -> Bool).

Now, it's true that the strange internal rr will ultimately be defaulted by top-level TypeClassDefaulting, a completely different mechanism. But it happens rarely. (Ultimately alpha will get defaulted to Any.)

I supposed that the extra constraint emitted by (D) could be configurable.

I agree that this is tricky and hard to think about.

I still advocate for (P):

It is conceptually simpler (from a language design standpoint): (D) involves one more step than (P) (emitting the equality). That's their only difference. Simpler is better.
(P) accepts more programs.
I've been thinking we would use (P) for definitions f x = x but (D) for type signatures f :: a -> a. But why? We can use (P) for both.

As a middle ground: I could see the value in using (D) for top-level definitions and (P) for local ones. It's a bit more confusing for users, but this gets to the heart of Simon's "let's have local reasoning" argument. Put a more user-facing way: GHC will use all the knowledge available in the same (top-level) declaration in order to infer the missing pieces. That's not very hard to accommodate!

Yet another possibility: instead of just looking at the one top-level declaration, we could look in an entire top-level mutually recursive group. This has a distinct advantage of allowing

f :: a -> a
f @(a :: TYPE UnliftedRep) x = x

That is, an unknown representation can be inferred by looking at the definition. (It would be strange in GHC for a type signature to depend only on its definition, and not on all the definitions in its mutually recursive group, so I widened a bit.)

This is distinct from the monomorphism restriction, where the domain of influence includes the entire module (up to top-level declaration splices, as usual).

Let me argue the other way

Unless we always do (P), we need the code for (D), so "simpler" doesn't hold. A fortiori if we sometimes do (D) and sometimes do (P).
I would like to avoid a top-level vs nested distinction if possible -- more complexity. Ditto a type-sig vs binding distinction.
The killer one for me is: if we do (D) we can later relax to (P) if we get user pressure. But once we do (P) it'll be a breaking change to adopt (D).

Let's do (D) for now, and argue at leisure.

Very useful discussion this morning, which clarified better the difference between (D) and (P): it's all about timing. That is, what we've been calling (D) is really "default as soon as possible" (defined more precisely shortly) and (P) is "default as late as possible". ("Early" could also be understood as "with less of the program able to affect the outcome" and the opposite for "late".) So it's not really that one strategy defaults and the other doesn't -- it's that (D) defaults earlier and (P) defaults later. It's true that the (P) strategy relies on TypeClassDefaulting to do the defaulting, but I think that's incidental -- we could imagine doing type-class-based defaulting at other times.

The monomorphism restriction (MR) also has the same degree of freedom as these other features.

In order to understand this better, I would like to examine a spectrum of places we could consider defaulting. Each place is exemplified with respect to the MR, because MR examples are simpler and more concise. I am not proposing any of these. I'm just laying out the space. (Understand every mention of let to also include where, and assume default (Integer) and n :: Int.) These are described in order of increasing acceptance. Every rule accepts all previous "accept" and "reject" examples.

At birth. This would accept let x = n (with the "infer type of RHS and use for LHS rule) but reject let x = n + x (because x's type is defaulting to Integer too early).
After inference of a mutually recursive group of definitions in one local let. This would accept let x = 1 + y; y = x + n but reject let x = 1; y = x + n. (Until we have #12088 (closed), type signatures are always considered a mutually recursive group of their own.) (Top-level definitions would be covered by position (5).)
After inference of all definitions in the same local let. This would accept let x = 1; y = x + n but reject let x = 1 in x + n. (Top-level definitions would be covered by position (5).)
After inference of the entire scope of a local variable. This would accept let x = 1 in x + n but reject n + let x = 1 in x. (Top-level definitions would be covered by position (5).)
After inference of a top-level mutually recursive group. This would accept x = let y = 1 in y + z; z = x + n but reject x = let y = 1 in y; z = x + n.
After inference of a top-level inter-splice group. This would accept x = 1; y = x + n but reject x = 1; $(return []); y = x + n.
After inference of a module. This would accept x = 1; $(return []); y = x + n but reject module A where x = 1; module B where import A; y = x + n.
At link time. This would accept module A where x = 1; module B where import A; y = x + n but reject module A where x = 1; module B where import A; y = x + n; module Main where import A; import B; main = print (sin x). (It would be accepted without the import B.)

I think any of these choices makes for a usable programming language. The further down the list we go, the more programs are accepted, as type inference has more information to go on. For the same reason, the further down the list we go, the less modular our language is. (I do not wish to think about implementing link-time defaulting! Though I claim it is possible.)

So the question is: where do we want to fit in this spectrum?

The MR is currently set at position (6). I do not propose we change that, as any change would be disruptive.

Exotic variable defaulting is currently set at position (2). We could imagine setting -XNoPolyKinds, -XNoPolyRepresentations, and -XNoPolyMultiplicities at different positions, but I think that's needlessly complicated. The Haskell Report implies -XNoPolyKinds must be at least at position (2), but I cannot come up with an example that could witness the difference between (2) and higher levels while saying in Haskell2010. My (P) proposal suggests setting these to work at position (6). Simon's (D) proposal says to keep these at position (2).

One beauty of setting everything at position (6) is that it would mean we could implement exactly one defaulting mechanism. (Well, two: I still think we'd need something in the final zonk.) And we could explain all of our defaulting mechanisms in the manual more uniformly.

Simon's "killer" argument for (2) -- that we could just move up this ladder later because moving up is an extension -- is a good argument. But it does mean having more defaulting mechanisms and explaining them.

Part of why I've argued for (6) is that it accepts more programs, and part of what Simon (and others) has disliked is that it's not local enough. So I now lay out positions (3), (4), and (5) as intermediate positions. Are any of them attractive? I find (4) to be quite interesting: local variables get their types by examining their scopes. It does have a disadvantage in treating locals different from globals, but we can't reasonably look through a global's entire scope (well, we can, but that's Dreaded Position (8).)

Another possibility is to have the user choose what they'd like -- perhaps per declaration via the use of a modifier. I think this is a poor design, as it gives the user too much control over something that they would have a hard time deciding about. But I claim doing this is possible.

In the end, Simon's killer argument is persuasive. But I do wish we gave users a way to notify us when they're annoyed by the lack of defaulting so that they could ask us to turn the knob here.

Richard and I discussed this again today. We resolved

There is a lot of common ground: the (P) vs (D) discussion is a corner of a corner, not worth devoting a lot of time to.
Both approaches have merits, discussed above.
The conservative choice is (D). We can loosen up to (P), accepting more programs, if we come under user pressure to do so.

So, without prejudice to future re-opening of this topic, in the light of more evidence, we'll do (D) for now.

changed the description

Make the design of defaulting explicit

Common ground, especially for inferring the type of a let-bound definition without a type signature

Generalisation

Singleton constraints (from Step 1.2 above)

Finding the free variables (from Step 1 above)

Local and global type variables (from Step 2.1 above)

Avoiding unnecessary polymorphism (from Step 2.2 above)

Non-quantifiable constraints (from Step 2.2.1 above)

Determined type variables (from Steps 3.3 and 4.2 above)

Defaulting during generalisation (from Step 4 above)

The Choice: promoting vs defaulting

Other points

Implementation

Examples

Properties

Child items ...

Activity