Counterintuitive ScopedTypeVariables behaviour

newer
Re: [Haskell-cafe] Constraints as...

older
Cannot update a field in a record...

Tom Ellis

16 Aug 2020 16 Aug '20

12:17 p.m.

I have just noticed that with ScopedTypeVariables one can write the following bizarre definition -- Inferred signature: -- add :: Num a => a -> a -> a add (x :: a) (y :: b) = x + y The reason for this behaviour is that

...

in all patterns other than pattern bindings, a pattern type signature may mention a type variable that is not in scope; in this case, the signature brings that type variable into scope.

https://downloads.haskell.org/~ghc/latest/docs/html/users_guide/glasgow_exts... and the justification is that

...

Bringing type variables into scope is particularly important for existential data constructors

But this leads to a rather puzzling user experience. Was it really not possible to design this extension in a way that would allow bringing new type variables into scope locally but not allow them to unify with other type variables from outer scopes? To be more concrete, what I would like to see is a design where `k` is allowed because the type `a` (bound within a pattern) does not need to unify with anything but `k2` is not allowed because `a` is forbidden to unify with `b`. `k3` would also be forbidden (the type variables called `a` are actually distinct) as would `k4` (the type variable for the inferred signature would be fresh and distinct from `a`). I believe this design might lead to much less puzzling behaviour. Would it be possible (within a new extension, of course) or have I overlooked something? Tom data T = forall a. MkT [a] k :: T -> T k (MkT [t::a]) = MkT t3 where (t3::[a]) = [t,t,t] data T2 a = MkT2 [a] k2 :: T2 b -> T2 b k2 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t] k3 :: T2 a -> T2 a k3 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t] k4 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t]

Show replies by date

Alexis King

17 Aug 17 Aug

8:52 p.m.

I think this behavior is obscured by the way Haskell uses the term “type variable” to refer to three different things: Variables that refer to types and are bound within the type namespace. Skolem type constants introduced by a `forall`. Solver/unification variables, aka “metavariables”, which aren’t really types at all but holes to be filled in with types. These are all subtly different, but they are so frequently treated as if they were the same that I suspect listing them out is not actually enough for many people to understand the distinctions! So let me elaborate. Suppose we view `forall` like a type-level lambda, so `forall a. a -> a` is analogous to this term: \a -> a :-> a Clearly, there is a difference between the variable `a` in the above expression and the value we eventually substitute for it when we apply the lambda. When we instantiate the function by doing, say, `id @Int`, we substitute `Int` for `a` to get `Int -> Int`. So `a` itself isn’t a type, it’s a name that refers to a type—a type variable. Under this interpretation, there’s nothing wrong with your example. In add (x :: a) (y :: b) = x + y the variable `a` names `x`’s type, and the variable `b` names `y`’s type, and there’s nothing at all wrong with having two different names that refer to the same thing. It would be like writing this term: \t -> let { a = t; b = t } in a :-> b So why does it seem confusing? Well, because if we write add :: Num t => a -> b -> t we get a type error, not two aliases for t. This is because GHC will implicitly quantify all the variables in the type signature to get add :: forall t a b. Num t => a -> b -> t so `a` and `b` are now distinct, “lambda-bound” variables, not aliases for `t`. This implicit quantification means we don’t often think about the type variables we write in type signatures as being like term variables, which are just names that reference other things. Rather, we think of type variables as being like type constants, and in fact GHC encourages this interpretation in its error reporting. To explain, consider what happens when GHC checks `add` against the type we wrote above. Somehow, it needs to figure out the types of `x` and `y`, but it can’t just say x :: a y :: b because those variables were bound by the `forall` (they are “lambda-bound”), and if we move them into some other context where they appear free, they no longer have any meaning. GHC has to somehow pick actual types for x and y, not type variables. A naïve idea would be to invent two new solver variables, aka “holes to be filled in with inferred types”, yielding something like a1, b1 fresh x :: a1 y :: b1 but this doesn’t work. Imagine we did that while typechecking this function: bad :: a -> a bad _ = True If we invented a new solver variable to use in place of `a` while typechecking `bad`, GHC would just say “oh, I can fill in the hole with Bool” and carry on, which would be quite bad! The whole point of parametricity is that the function must work with any type. So GHC actually does something else. Behind the scenes, it creates two new type definitions, distinct from all other types in the program: data FakeTypeA data FakeTypeB Now it typechecks the function using these fake types substituted for `a` and `b`: x :: FakeTypeA y :: FakeTypeB Now when GHC goes to typecheck `x + y`, it will raise a type error, since FakeTypeA and FakeTypeB are obviously not the same type. This enforces parametricity. Of course, it would be very confusing to the user to see names like “FakeTypeA” in their type errors. So GHC pulls some sleight of hand, and it prints the new types it generates as just “a” and “b”, the same as the forall-bound variables it created them for. These don’t look like type constants, because in source Haskell, all type constants start with UpperCase. But behind the scenes, they really are type constants, not type variables. GHC calls these fake type constants “rigid variables” or “skolem variables,” but don’t be fooled: they’re not variables at all, they’re constants. GHC tells this white lie to keep programmers from having to think about all this behind the scenes business. When GHC says “could not match ‘a’ with ‘b’,” it’s natural for programmers to think “ah, these are the ‘a’ and ‘b’ I wrote in my type signature,” and they learn to treat “type variables” and “rigid type constants” as one and the same. They don’t think about how this is a meaningfully different definition of “variable” than what they’re used to at the term level. This works out okay because the only way to bind type variables in source Haskell 98/2010 is universal quantification. But now ScopedTypeVariables and ExistentialQuantification enter the picture, and there’s a problem: we suddenly need to be able to bind type variables in a place that isn’t `forall`. Now we have type variable binding sites that look just like type variable uses, but they aren’t implicitly-quantified the way they are in standalone type signatures, and confusion ensues. This confusion actually goes much deeper than you might expect. Because Haskell historically hasn’t syntactically distinguished type variable uses from binding sites, we have new ambiguous situations like this: \(x :: Maybe a) -> ... Is this a type annotation on x, which states that x should have type `Maybe a`, where `a` is a type variable already in scope? Or is it a pattern binding that matches against x’s type, which asserts that it has the shape `Maybe t` and binds a new variable `a` to the type `t`? And it gets even worse! Suppose we have this: f (Foo @a x) = ... Is `@a` a visible type application of `Foo` to the in-scope type `a`, instantiating the (universally-quantified) `Foo` constructor at a particular type? Or is it a binder for an existential variable “stored inside” `Foo`? Both are reasonable (and useful) interpretations. A GHC proposal that presents a design that sorts all these ambiguities out would be most welcome. To my knowledge, no such proposal currently exists, but I could be wrong—I know this issue has been discussed on some proposals (such as visible type applications in patterns), so maybe someone has come up with something I’m not aware of. Alexis

...

On Aug 16, 2020, at 07:17, Tom Ellis wrote:

I have just noticed that with ScopedTypeVariables one can write the following bizarre definition

-- Inferred signature: -- add :: Num a => a -> a -> a add (x :: a) (y :: b) = x + y

The reason for this behaviour is that

...
in all patterns other than pattern bindings, a pattern type signature may mention a type variable that is not in scope; in this case, the signature brings that type variable into scope.

https://downloads.haskell.org/~ghc/latest/docs/html/users_guide/glasgow_exts...

and the justification is that

...
Bringing type variables into scope is particularly important for existential data constructors

But this leads to a rather puzzling user experience. Was it really not possible to design this extension in a way that would allow bringing new type variables into scope locally but not allow them to unify with other type variables from outer scopes?

To be more concrete, what I would like to see is a design where `k` is allowed because the type `a` (bound within a pattern) does not need to unify with anything but `k2` is not allowed because `a` is forbidden to unify with `b`. `k3` would also be forbidden (the type variables called `a` are actually distinct) as would `k4` (the type variable for the inferred signature would be fresh and distinct from `a`).

I believe this design might lead to much less puzzling behaviour. Would it be possible (within a new extension, of course) or have I overlooked something?

Tom

data T = forall a. MkT [a]

k :: T -> T k (MkT [t::a]) = MkT t3 where (t3::[a]) = [t,t,t]

data T2 a = MkT2 [a]

k2 :: T2 b -> T2 b k2 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t]

k3 :: T2 a -> T2 a k3 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t]

k4 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t]

Alexis King

19 Aug 19 Aug

5:12 p.m.

Adding to my previous email, I’ve noticed that the behavior I’ve described appears to be inconsistent with the documentation. From the GHC 8.10.1 User’s Guide § 9.17.1, Lexically scoped type variables » Overview http://downloads.haskell.org/ghc/8.10.1/docs/html/users_guide/glasgow_exts.h...:

...

The design follows the following principles A scoped type variable stands for a type variable, and not for a type. (This is a change from GHC’s earlier design.) Furthermore, distinct lexical type variables stand for distinct type variables. This means that every programmer-written type signature (including one that contains free scoped type variables) denotes a rigid type; that is, the type is fully known to the type checker, and no inference is involved. Lexical type variables may be alpha-renamed freely, without changing the program. Emphasis mine. This suggests that your program should not be accepted (and that GHC shouldn’t work the way I describe, even if that’s how it works in practice today), so either the current implementation is buggy, or the documentation is buggy. I am not sure which.

I have opened GHC issue #18591 https://gitlab.haskell.org/ghc/ghc/-/issues/18591 to track this inconsistency. Alexis

Alexis King

5:41 p.m.

…and now Krzysztof Gogolewski has already chimed in with an explanation:

...

The documentation is outdated (it was written in 2006). Scoped type variables can use arbitrary types since #15050 (closed), https://github.com/ghc-proposals/ghc-proposals/blob/scoped-type-variables-ty... https://github.com/ghc-proposals/ghc-proposals/blob/scoped-type-variables-ty....

So the issue is a documentation bug, and my explanation is how GHC is intended to operate. The linked proposal includes a little additional context, including a short paper, Type variables in patterns https://arxiv.org/pdf/1806.03476.pdf (published at Haskell ’18). Mystery resolved—though the docs still need fixing. Alexis

Viktor Dukhovni

7:53 p.m.

On Wed, Aug 19, 2020 at 12:41:13PM -0500, Alexis King wrote:

...

…and now Krzysztof Gogolewski has already chimed in with an explanation:

...
The documentation is outdated (it was written in 2006). Scoped type variables can use arbitrary types since #15050 (closed), https://github.com/ghc-proposals/ghc-proposals/blob/scoped-type-variables-ty... https://github.com/ghc-proposals/ghc-proposals/blob/scoped-type-variables-ty....

So the issue is a documentation bug, and my explanation is how GHC is intended to operate. The linked proposal includes a little additional context, including a short paper, Type variables in patterns https://arxiv.org/pdf/1806.03476.pdf (published at Haskell ’18). Mystery resolved—though the docs still need fixing.

I added some follow up comments to the ticket you opened. I don't agree with the analysis. Best I can tell (with some confidence after also testing with GHC 8.0 which predates the above proposal), is that instead what's happening is: 1. The function has no explicit type signature, the below is just a term-level formula for its value: add (x :: a) (y :: a) = x + y 2. The type signature is inferred as usual: add :: Num c => c -> c -> c unifying the type `c` with the types of both of the terms `x` and `y`. 3. But the formula contains two Pattern Type Signatures, binding the lexical type variable `a` to the type of `x` and the lexical type variable `b` to the type of `y`. 4. However the types of `x` and `y` are both `c` (really some arbitrary fixed type variable, modulo alpha-renaming). 5. Therefore, `a` and `b` are both the same type, not by virtue of type inference between `a` and `b`, but by virtue of simple pattern matching to an ultimately common type. The only thing that changed after the proposal was broadening of the acceptable pattern matching to allow the pattern to match something other than a type *variable*: Before: f :: a -> Int f (x :: b) = 1 -- OK b ~ a f :: Int -> Int f (x :: b) = 1 -- Bad b ~ Int, not a type variable! After: f :: a -> Int f (x :: b) = 1 -- OK b ~ a f :: Int -> Int f (x :: b) = 1 -- OK b ~ Int However, Tom's example works both with GHC 8.0.x -- 8.6.x which fall into the *before* case, and with GHC 8.8 and up, which are in the *after* case. -- Viktor.

Alexis King

9:58 p.m.

What you say may be true, but the behavior you describe is still inconsistent with the documentation. After all, the documentation states these things must all be simultaneously true: Scoped type variables stand for type variables, not arbitrary types. Distinct scoped type variables correspond to distinct type variables. The type variables are always rigid. Tom’s program can’t possibly obey all three of these things because that would imply that `x :: a` and `y :: b` where `a` and `b` are distinct skolems, so `add` could not possibly typecheck. But it does—so it sounds like older versions of GHC just diverged from the documentation in other, less significant ways. Alexis

...

On Aug 19, 2020, at 14:53, Viktor Dukhovni wrote:

I added some follow up comments to the ticket you opened. I don't agree with the analysis. Best I can tell (with some confidence after also testing with GHC 8.0 which predates the above proposal), is that instead what's happening is:

1. The function has no explicit type signature, the below is just a term-level formula for its value:

add (x :: a) (y :: a) = x + y

2. The type signature is inferred as usual:

add :: Num c => c -> c -> c

unifying the type `c` with the types of both of the terms `x` and `y`.

3. But the formula contains two Pattern Type Signatures, binding the lexical type variable `a` to the type of `x` and the lexical type variable `b` to the type of `y`.

4. However the types of `x` and `y` are both `c` (really some arbitrary fixed type variable, modulo alpha-renaming).

5. Therefore, `a` and `b` are both the same type, not by virtue of type inference between `a` and `b`, but by virtue of simple pattern matching to an ultimately common type.

The only thing that changed after the proposal was broadening of the acceptable pattern matching to allow the pattern to match something other than a type *variable*:

Before:

f :: a -> Int f (x :: b) = 1 -- OK b ~ a

f :: Int -> Int f (x :: b) = 1 -- Bad b ~ Int, not a type variable!

After:

f :: a -> Int f (x :: b) = 1 -- OK b ~ a f :: Int -> Int f (x :: b) = 1 -- OK b ~ Int

However, Tom's example works both with GHC 8.0.x -- 8.6.x which fall into the *before* case, and with GHC 8.8 and up, which are in the *after* case.

-- Viktor.

Viktor Dukhovni

11:22 p.m.

On Wed, Aug 19, 2020 at 04:58:00PM -0500, Alexis King wrote:

...

What you say may be true, but the behavior you describe is still inconsistent with the documentation. After all, the documentation states these things must all be simultaneously true:

1. Scoped type variables stand for type variables, not arbitrary types. 2. Distinct scoped type variables correspond to distinct type variables. 3. The type variables are always rigid.

Though perhaps much too careful/complete a read of the documentation is needed to see support in the documentation for the conclusions I ultimately reached, I do believe that the text is on some level there (that is, the behaviour essentially matches the documentation when all of it is read with some care, to some extent "between the lines"), but it could/should be much more explicit. Specifically, though reading the high-level overview indeed leads to your summary above: The design follows the following principles * A scoped type variable stands for a type variable, and not for a type. (This is a change from GHC’s earlier design.) [ This is the part that's later relaxed in 15050 ] * Furthermore, distinct lexical type variables stand for distinct type variables. This means that every programmer-written type signature (including one that contains free scoped type variables) denotes a rigid type; that is, the type is fully known to the type checker, and no inference is involved. [ Item 2 + 3 ] * Lexical type variables may be alpha-renamed freely, without changing the program. [ More of 3. ] These are later refined in a way that reduces the universality of 2 and 3. First we have: A declaration type signature that has explicit quantification (using forall) brings into scope the explicitly-quantified type variables, in the definition of the named function. that shows that scoped type variables are typically introduced via an explicit "forall", and of course in this case, given "forall a b. ..." the "a" and "b" in question are distinct. But when we get to "9.17.4. Pattern type signatures": http://downloads.haskell.org/ghc/8.10.1/docs/html/users_guide/glasgow_exts.h... In the case where all the type variables in the pattern type signature are already in scope (i.e. bound by the enclosing context), matters are simple: the signature simply constrains the type of the pattern in the obvious way. Unlike expression and declaration type signatures, pattern type signatures are not implicitly generalised. The pattern in a pattern binding may only mention type variables that are already in scope. ... However, in all patterns other than pattern bindings, a pattern type signature may mention a type variable that is not in scope; in this case, the signature brings that type variable into scope. we see that are they are rather different beasts, that deviate from the description in the overview. In particular, "2" no longer holds, because these are neither generalised, nor universally quantified. Instead, most of the time they are just references to allready bound lexical variables constraining the type of the term to be the type of some variable already in scope. These are different provided the earlier definitions are different (e.g. forall a b. ...). However, when a Pattern Type Signature brings a new variable into scope, it is simply a type alias for the type of the term, and by virtue of not being univsally quantified or generalised, ... is a mere alias of whatever type variable (after 15050, relaxed to also allow actual types) is associated with the term in question. Now admittedly, this involves some "reading between the lines" to infer that the crucial thing that (unavoidably) gives us "2" and "3" is introduction via universal quantification. And that the overview does not adequately carve out an exemption from that far more common case to the case where instead a Pattern Type Signature just pattern matches the existing type of a term and gives a name so we can reuse that name in other terms. And it is these pattern matched type aliases, that while not directly unified, might under various conditions refer to the same actual type even if bound to different lexical names, because their bindings at the point of introduction are not to new universally quantified types, but are to known (possibly inferred) types of the terms in the pattern. So my conclusion is that enough information is present in the documentation to understand the observed behaviour as consistent with the detailed case-by-case breakdown, even while not matching the general overview.

...

Tom’s program can’t possibly obey all three of these things because that would imply that `x :: a` and `y :: b` where `a` and `b` are distinct skolems, so `add` could not possibly typecheck. But it does—so it sounds like older versions of GHC just diverged from the documentation in other, less significant ways.

Well, in fact the major deviations from the overview are there even even as far back as GHC 7.10. All that changed more recently (GHC 8.8) is a more minor shift to allow the "aliased" term type to be a concrete type, rather than just a type variable. I do agree that the documentation should be more clear, in particular the overview is NOT a good (correct) description of lexicals brought into scope via Pattern Type Signatures. -- Viktor.

Albert Y. C. Lai

20 Aug 20 Aug

5:38 p.m.

I wonder if the rule comes down to simply this? To every variable, or rather name or identifier (term level, type level, meta level, state variables, any subject matter that lets you create names), you always want to do two things: * Define it, or at least introduce it (e.g., quantifier, lambda). * Use it, and it refers to something you defined/introduced. Perhaps the rule is simply: When LHS has (pattern :: typevar), this defines typevar to be [an alias of] the type of the term that matches that pattern. (BTW, this also introduces term-level vars in the pattern.) When RHS has (termexpr :: typeexpr), type variables in typeexpr are references, use sites. (BTW, term-level vars in termexpr are also references, use sites.) I guess the latter is marred by the Haskell Report (or NoScopedTypeVariables) rule, e.g., (termexpr :: [t]) is like (termexpr :: forall t. [t]). Still, a reference is involved, it just refers to something else. Here is an example of my view. Given ScopedTypeVariables, f :: forall a. a -> [a] f ((x :: b) :: c) = [x :: a, x :: b, x :: c] I introduce 1 var and define 2 aliases, and go on to use all 3 in chorus.

Alexis King

21 Aug 21 Aug

11:13 p.m.

...

On Aug 19, 2020, at 18:22, Viktor Dukhovni wrote:

Though perhaps much too careful/complete a read of the documentation is needed to see support in the documentation for the conclusions I ultimately reached, I do believe that the text is on some level there

I do not agree with your interpretation. :)

...

But when we get to "9.17.4. Pattern type signatures" […] we see that are they are rather different beasts, that deviate from the description in the overview. In particular, "2" no longer holds, because these are neither generalised, nor universally quantified.

It is true that such variables are not generalized, but the documentation does not say “generalized”—it says rigid. If we consider that the primary purpose of this feature is to bring existentials into scope, then there is no violation of the principle when used in the intended way. After all, existentials brought into scope via `case` are very much rigid (and distinct from all other variables). It is true that GHC implements a much more flexible behavior that allows entirely different things to be brought into scope. But that is just part of the divergence from the documentation—there’s nothing that says pattern signatures are exempt from that restriction. It only states that pattern signatures that refer to unbound variables bring those variables into scope, nothing more. Of course, this is all somewhat immaterial—there is no way to objectively determine which interpretation is “correct,” and the docs are unambiguously unclear regardless, so they should be changed. But purely as a matter of friendly debate, I stand by my interpretation. Alexis

Viktor Dukhovni

22 Aug 22 Aug

12:29 a.m.

On Fri, Aug 21, 2020 at 06:13:23PM -0500, Alexis King wrote:

...

...
On Aug 19, 2020, at 18:22, Viktor Dukhovni wrote:

Though perhaps much too careful/complete a read of the documentation is needed to see support in the documentation for the conclusions I ultimately reached, I do believe that the text is on some level there

I do not agree with your interpretation. :)

Fair enough, the text is ultimately not clear. [ TL;DR the rest is inessential detail for the morbidly curious. ]

...

...
But when we get to "9.17.4. Pattern type signatures" […] we see that are they are rather different beasts, that deviate from the description in the overview. In particular, "2" no longer holds, because these are neither generalised, nor universally quantified.

It is true that such variables are not generalized, but the documentation does not say “generalized”—it says rigid. If we consider that the primary purpose of this feature is to bring existentials into scope, then there is no violation of the principle when used in the intended way. After all, existentials brought into scope via `case` are very much rigid (and distinct from all other variables).

The leap I was making is that it is specifically universal quantification that leaves the design no choice but to make the variables rigid. With "forall a b." naturally "a" and "b" vary *independently*, and so must be kept distinct. On the other hand, given that Pattern Type Signatures are not universally quantified, there is no longer a *prior requirement* to keep them distinct, and so I am ultimately not surprised that they may not be rigid in the same way as those brought into scope via "forall", indeed we can now (as of GHC 8.8) write: foo :: Int -> Int -> String foo (x :: a) (y :: b) = "(" ++ (show @a x) ++ ", " ++ (show @b y) ++ ")" λ> foo 1 2 "(1, 2)" where "a" becomes a synonym for "Int" and so does "b", and so are surely not distinct types. But even with GHC 8.0 (7.10 and perhaps older would also be OK were this example changed to avoid TypeApplications), we can write: foo :: Show c => c -> c -> String foo (x :: a) (y :: b) = "(" ++ (show @a x) ++ ", " ++ (show @b y) ++ ")" λ> foo 1 2 "(1, 2)" where "a" and "b" are aliases for "c". I don't think you disagree that this is what the actual implementation is doing. My point was just that with some effort one might read the actual behaviour as consistent with the spirit of the documentation, and that the leap required to read it that way may not be too great.

...

It is true that GHC implements a much more flexible behavior that allows entirely different things to be brought into scope. But that is just part of the divergence from the documentation—there’s nothing that says pattern signatures are exempt from that restriction. It only states that pattern signatures that refer to unbound variables bring those variables into scope, nothing more.

Yes, the literal text does not actually carve out an exception, I'm making a retroactive extrapolation from the inferred constraints on the implementation given the context.

...

Of course, this is all somewhat immaterial—there is no way to objectively determine which interpretation is “correct,” and the docs are unambiguously unclear regardless, so they should be changed. But purely as a matter of friendly debate, I stand by my interpretation.

Yes it would be nice for someone to craft a proposed change to the docs. Any volunteers? My attempt to read the docs liberally was largely in order to better understand the actual implementation and to intuit how and why the actual choice was made, and might be seen to be consistent with the overall design. -- Viktor.

Richard Eisenberg

24 Aug 24 Aug

2:22 a.m.

The intended semantics here is from the paper (https://richarde.dev/papers/2018/pat-tyvars/pat-tyvars.pdf). See the paragraph right before section 5.6, which describes the change from the outdated documentation (saying that each scoped type variable must refer to a distinct variable) to the current implementation (where a scoped type variable can stand for any type). The idea behind the change is that, when we write f x = not x, we intend for x to stand for any Bool. So, when we write f (x :: a) = not x, we should allow a to stand for any type -- in this case, it will stand for Bool. Going back to Tom's original post, suggesting a refinement: consider data Ex2 = forall a. Ex2 a a f (Ex2 (x :: b) (y :: c)) = False Is that accepted or rejected? I think Tom's rules would accept, because neither b nor c are bound to a variable from an outer scope. Yet this suggests that b and c are different. Or what about data Eql = forall a. (a ~ Int) => Eql a g :: Eql -> Int g (Eql (x :: b)) = x + x Hrm. Now it seems we can allow `b` to stand for `a`... but `a` is also the same as Int. So that's confusing. We could also have `a` equal a type variable from an outer scope, thus circumventing the check Tom wants to introduce. It all gets very murky. Simplest to allow variables in that position unify with any type, as GHC is today. Regardless, the documentation must be updated. Thanks for making the ticket. Richard

...

On Aug 21, 2020, at 8:29 PM, Viktor Dukhovni wrote:

On Fri, Aug 21, 2020 at 06:13:23PM -0500, Alexis King wrote:

...
...
On Aug 19, 2020, at 18:22, Viktor Dukhovni wrote:

Though perhaps much too careful/complete a read of the documentation is needed to see support in the documentation for the conclusions I ultimately reached, I do believe that the text is on some level there

I do not agree with your interpretation. :)

Fair enough, the text is ultimately not clear. [ TL;DR the rest is inessential detail for the morbidly curious. ]

...
...
But when we get to "9.17.4. Pattern type signatures" […] we see that are they are rather different beasts, that deviate from the description in the overview. In particular, "2" no longer holds, because these are neither generalised, nor universally quantified.

It is true that such variables are not generalized, but the documentation does not say “generalized”—it says rigid. If we consider that the primary purpose of this feature is to bring existentials into scope, then there is no violation of the principle when used in the intended way. After all, existentials brought into scope via `case` are very much rigid (and distinct from all other variables).

The leap I was making is that it is specifically universal quantification that leaves the design no choice but to make the variables rigid. With "forall a b." naturally "a" and "b" vary *independently*, and so must be kept distinct.

On the other hand, given that Pattern Type Signatures are not universally quantified, there is no longer a *prior requirement* to keep them distinct, and so I am ultimately not surprised that they may not be rigid in the same way as those brought into scope via "forall", indeed we can now (as of GHC 8.8) write:

foo :: Int -> Int -> String foo (x :: a) (y :: b) = "(" ++ (show @a x) ++ ", " ++ (show @b y) ++ ")"

λ> foo 1 2 "(1, 2)"

where "a" becomes a synonym for "Int" and so does "b", and so are surely not distinct types. But even with GHC 8.0 (7.10 and perhaps older would also be OK were this example changed to avoid TypeApplications), we can write:

foo :: Show c => c -> c -> String foo (x :: a) (y :: b) = "(" ++ (show @a x) ++ ", " ++ (show @b y) ++ ")"

λ> foo 1 2 "(1, 2)"

where "a" and "b" are aliases for "c". I don't think you disagree that this is what the actual implementation is doing. My point was just that with some effort one might read the actual behaviour as consistent with the spirit of the documentation, and that the leap required to read it that way may not be too great.

...
It is true that GHC implements a much more flexible behavior that allows entirely different things to be brought into scope. But that is just part of the divergence from the documentation—there’s nothing that says pattern signatures are exempt from that restriction. It only states that pattern signatures that refer to unbound variables bring those variables into scope, nothing more.

Yes, the literal text does not actually carve out an exception, I'm making a retroactive extrapolation from the inferred constraints on the implementation given the context.

...
Of course, this is all somewhat immaterial—there is no way to objectively determine which interpretation is “correct,” and the docs are unambiguously unclear regardless, so they should be changed. But purely as a matter of friendly debate, I stand by my interpretation.

Yes it would be nice for someone to craft a proposed change to the docs. Any volunteers?

My attempt to read the docs liberally was largely in order to better understand the actual implementation and to intuit how and why the actual choice was made, and might be seen to be consistent with the overall design.

-- Viktor. _______________________________________________ Haskell-Cafe mailing list To (un)subscribe, modify options or view archives go to: http://mail.haskell.org/cgi-bin/mailman/listinfo/haskell-cafe Only members subscribed via the mailman list are allowed to post.

Tom Ellis

7 Sep 7 Sep

12:04 p.m.

Thanks, Alexis, for the detailed explanation about how "type variables" in Haskell are doing a sort of "triple duty". I think my unconscious understanding of type variables with distinct names was 1. they can be chosen independently 2. they don't unify 1 is not true, not least because of MultiParamTypeClasses. I can't choose `a` and `b` completely indepently in the following type signature foo :: C a b => a -> b I can only choose them independently to the extent that a `C` instance exists. Moreover, type equality constraints mean that 2 is not true! The `a` and `b` in the following do indeed unify[1] bar :: a ~ b => a -> b The simple world that Hindly-Milner-Damas created for us becomes more complex when extensions that affect the type system come into play. I'm reassured that type variable bindings in patterns in case alternatives can now bind monomorphic types, that is case () of (() :: t) -> () is now permitted. The previous behaviour seemed inconsistent. On the other hand I'm still somewhat concerned that the meaning of the syntax can vary depending on whether the type variable is in scope or not. That is, I'm not comfortable that the same syntax is used in baz1 to assert that an expression *has a type* as is used in baz2 to *create a fresh name* for a type that an expression has. baz1 :: forall a. a -> () baz1 a = case () of (() :: a) -> () baz2 :: forall a. a -> () baz2 a = case () of (() :: b) -> () "let" does not suffer this inconsistency. The latter is simply not supported. I realise that some syntax must offer the ability to come up with a fresh name, I'm just not convinced that this overloading is the way to do it. To me it feels like "let x = exp" meaning "if `x` is not in scope then bind the value of `exp` to the variable `x`. If `x` is in scope, then check that it is equal to `exp` and if not then crash.". Anyway, I'm sure that opens a whole new can of worms, so thanks for your clarifying remarks to help get my understanding to here! Tom [1] Perhaps I'm conflating an implementation ("unification") with a specification (some semantic notion of what "type equality" means) but I hope my intention here is clear enough. On Mon, Aug 17, 2020 at 03:52:22PM -0500, Alexis King wrote:

...

I think this behavior is obscured by the way Haskell uses the term “type variable” to refer to three different things:

Variables that refer to types and are bound within the type namespace.

Skolem type constants introduced by a `forall`.

Solver/unification variables, aka “metavariables”, which aren’t really types at all but holes to be filled in with types.

These are all subtly different, but they are so frequently treated as if they were the same that I suspect listing them out is not actually enough for many people to understand the distinctions! So let me elaborate.

Suppose we view `forall` like a type-level lambda, so `forall a. a -> a` is analogous to this term:

\a -> a :-> a

Clearly, there is a difference between the variable `a` in the above expression and the value we eventually substitute for it when we apply the lambda. When we instantiate the function by doing, say, `id @Int`, we substitute `Int` for `a` to get `Int -> Int`. So `a` itself isn’t a type, it’s a name that refers to a type—a type variable.

Under this interpretation, there’s nothing wrong with your example. In

add (x :: a) (y :: b) = x + y

the variable `a` names `x`’s type, and the variable `b` names `y`’s type, and there’s nothing at all wrong with having two different names that refer to the same thing. It would be like writing this term:

\t -> let { a = t; b = t } in a :-> b

So why does it seem confusing? Well, because if we write

add :: Num t => a -> b -> t

we get a type error, not two aliases for t. This is because GHC will implicitly quantify all the variables in the type signature to get

add :: forall t a b. Num t => a -> b -> t

so `a` and `b` are now distinct, “lambda-bound” variables, not aliases for `t`.

This implicit quantification means we don’t often think about the type variables we write in type signatures as being like term variables, which are just names that reference other things. Rather, we think of type variables as being like type constants, and in fact GHC encourages this interpretation in its error reporting.

To explain, consider what happens when GHC checks `add` against the type we wrote above. Somehow, it needs to figure out the types of `x` and `y`, but it can’t just say

x :: a y :: b

because those variables were bound by the `forall` (they are “lambda-bound”), and if we move them into some other context where they appear free, they no longer have any meaning. GHC has to somehow pick actual types for x and y, not type variables. A naïve idea would be to invent two new solver variables, aka “holes to be filled in with inferred types”, yielding something like

a1, b1 fresh x :: a1 y :: b1

but this doesn’t work. Imagine we did that while typechecking this function:

bad :: a -> a bad _ = True

If we invented a new solver variable to use in place of `a` while typechecking `bad`, GHC would just say “oh, I can fill in the hole with Bool” and carry on, which would be quite bad! The whole point of parametricity is that the function must work with any type.

So GHC actually does something else. Behind the scenes, it creates two new type definitions, distinct from all other types in the program:

data FakeTypeA data FakeTypeB

Now it typechecks the function using these fake types substituted for `a` and `b`:

x :: FakeTypeA y :: FakeTypeB

Now when GHC goes to typecheck `x + y`, it will raise a type error, since FakeTypeA and FakeTypeB are obviously not the same type. This enforces parametricity.

Of course, it would be very confusing to the user to see names like “FakeTypeA” in their type errors. So GHC pulls some sleight of hand, and it prints the new types it generates as just “a” and “b”, the same as the forall-bound variables it created them for. These don’t look like type constants, because in source Haskell, all type constants start with UpperCase. But behind the scenes, they really are type constants, not type variables. GHC calls these fake type constants “rigid variables” or “skolem variables,” but don’t be fooled: they’re not variables at all, they’re constants.

GHC tells this white lie to keep programmers from having to think about all this behind the scenes business. When GHC says “could not match ‘a’ with ‘b’,” it’s natural for programmers to think “ah, these are the ‘a’ and ‘b’ I wrote in my type signature,” and they learn to treat “type variables” and “rigid type constants” as one and the same. They don’t think about how this is a meaningfully different definition of “variable” than what they’re used to at the term level. This works out okay because the only way to bind type variables in source Haskell 98/2010 is universal quantification.

But now ScopedTypeVariables and ExistentialQuantification enter the picture, and there’s a problem: we suddenly need to be able to bind type variables in a place that isn’t `forall`. Now we have type variable binding sites that look just like type variable uses, but they aren’t implicitly-quantified the way they are in standalone type signatures, and confusion ensues.

This confusion actually goes much deeper than you might expect. Because Haskell historically hasn’t syntactically distinguished type variable uses from binding sites, we have new ambiguous situations like this:

\(x :: Maybe a) -> ...

Is this a type annotation on x, which states that x should have type `Maybe a`, where `a` is a type variable already in scope? Or is it a pattern binding that matches against x’s type, which asserts that it has the shape `Maybe t` and binds a new variable `a` to the type `t`? And it gets even worse! Suppose we have this:

f (Foo @a x) = ...

Is `@a` a visible type application of `Foo` to the in-scope type `a`, instantiating the (universally-quantified) `Foo` constructor at a particular type? Or is it a binder for an existential variable “stored inside” `Foo`? Both are reasonable (and useful) interpretations.

A GHC proposal that presents a design that sorts all these ambiguities out would be most welcome. To my knowledge, no such proposal currently exists, but I could be wrong—I know this issue has been discussed on some proposals (such as visible type applications in patterns), so maybe someone has come up with something I’m not aware of.

Alexis

...
On Aug 16, 2020, at 07:17, Tom Ellis wrote:

I have just noticed that with ScopedTypeVariables one can write the following bizarre definition

-- Inferred signature: -- add :: Num a => a -> a -> a add (x :: a) (y :: b) = x + y

The reason for this behaviour is that

...
in all patterns other than pattern bindings, a pattern type signature may mention a type variable that is not in scope; in this case, the signature brings that type variable into scope.

https://downloads.haskell.org/~ghc/latest/docs/html/users_guide/glasgow_exts...

and the justification is that

...
Bringing type variables into scope is particularly important for existential data constructors

But this leads to a rather puzzling user experience. Was it really not possible to design this extension in a way that would allow bringing new type variables into scope locally but not allow them to unify with other type variables from outer scopes?

To be more concrete, what I would like to see is a design where `k` is allowed because the type `a` (bound within a pattern) does not need to unify with anything but `k2` is not allowed because `a` is forbidden to unify with `b`. `k3` would also be forbidden (the type variables called `a` are actually distinct) as would `k4` (the type variable for the inferred signature would be fresh and distinct from `a`).

I believe this design might lead to much less puzzling behaviour. Would it be possible (within a new extension, of course) or have I overlooked something?

Tom

data T = forall a. MkT [a]

k :: T -> T k (MkT [t::a]) = MkT t3 where (t3::[a]) = [t,t,t]

data T2 a = MkT2 [a]

k2 :: T2 b -> T2 b k2 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t]

k3 :: T2 a -> T2 a k3 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t]

k4 (MkT2 [t::a]) = MkT2 t3 where (t3::[a]) = [t,t,t]

...

_______________________________________________ Haskell-Cafe mailing list To (un)subscribe, modify options or view archives go to: http://mail.haskell.org/cgi-bin/mailman/listinfo/haskell-cafe Only members subscribed via the mailman list are allowed to post.

Richard Eisenberg

8 Sep 8 Sep

6:16 p.m.

...

On Sep 7, 2020, at 8:04 AM, Tom Ellis wrote:

On the other hand I'm still somewhat concerned that the meaning of the syntax can vary depending on whether the type variable is in scope or not.

Yes. This has been a spot of discomfort for some time (at least for me, and I know I'm not completely alone). A recent accepted, unimplemented proposal (https://github.com/ghc-proposals/ghc-proposals/blob/master/proposals/0285-no...) describes -XNoPatternSignatureBinds, which disallows not-in-scope variables from appearing in pattern signatures. Instead, the variable must come into scope some other way. That "other way" depends on proposals not yet completely formed/accepted, but once the dust has settled, I think you'll get what you want with -XNoPatternSignatureBinds. Richard

Viktor Dukhovni

19 Aug 19 Aug

8:13 p.m.

On Sun, Aug 16, 2020 at 01:17:11PM +0100, Tom Ellis wrote:

...

-- Inferred signature: -- add :: Num a => a -> a -> a add (x :: a) (y :: b) = x + y [...]

But this leads to a rather puzzling user experience. Was it really not possible to design this extension in a way that would allow bringing new type variables into scope locally but not allow them to unify with other type variables from outer scopes?

Back to Tom's original question, the answer is "no" because in fact no unification of `a` and `b` is taking place. The type variables `a` and `b` in the above lexical Pattern Type Signatures are not "generalised": http://downloads.haskell.org/ghc/8.10.1/docs/html/users_guide/glasgow_exts.h... Unlike expression and declaration type signatures, pattern type signatures are not implicitly generalised. So what the example definition of `add` says (beyond giving a formula for add x y) is that also the lexical type variables `a` and `b` satisfy: a ~ TypeOf(x) b ~ TypeOf(y) but since we've already inferred: TypeOf(x) ~ TypeOf(y), bringing `a` and `b` into scope does not change that in any way. -- Viktor.

1810

Age (days ago)

1833

Last active (days ago)

List overview

Download

13 comments

5 participants

participants (5)

Albert Y. C. Lai
Alexis King
Richard Eisenberg
Tom Ellis
Viktor Dukhovni