Monad laws

Rafael Almeida

25 Jun 2014 25 Jun '14

3:54 a.m.

Hello, You guys have been great taking my questions. Thank you. Now I have another one :) What can go wrong if I make an Monad instance but don't follow Monad rules (identity and associativity)? Sure it would be misleading for someone using the non-conforming class. They may make code that assume those laws, although they don't hold. However, could it make the compiler generate bad code or some standard functions to behave badly or even break? []'s Rafael

Attachments:

attachment.html (text/html — 580 bytes)

Show replies by date

Mateusz Kowalczyk

25 Jun 25 Jun

4:33 a.m.

On 06/25/2014 05:54 AM, Rafael Almeida wrote:

...

Hello,

You guys have been great taking my questions. Thank you. Now I have another one :)

What can go wrong if I make an Monad instance but don't follow Monad rules (identity and associativity)? Sure it would be misleading for someone using the non-conforming class. They may make code that assume those laws, although they don't hold. However, could it make the compiler generate bad code

The compiler makes no assumption (at least that I know of) that laws are followed. It should never generate ‘bad’ code (whatever that might mean) anyway.

...

or some standard functions to behave badly or even break?

Yes, namely they wouldn't do ‘the expected thing’ if they rely on the laws. Also someone might be overeager and use RULES based on just type-class laws.

...

[]'s Rafael

Basically, if you don't have a monad, don't make it an instance of Monad, it defeats the whole point of having a typeclass expecting some laws to be satisfied to begin with. -- Mateusz K.

John Lato

4:52 a.m.

On Wed, Jun 25, 2014 at 12:33 PM, Mateusz Kowalczyk

...

wrote:

...

On 06/25/2014 05:54 AM, Rafael Almeida wrote:

...
Hello,

You guys have been great taking my questions. Thank you. Now I have another one :)

What can go wrong if I make an Monad instance but don't follow Monad rules (identity and associativity)? Sure it would be misleading for someone using the non-conforming class. They may make code that assume those laws, although they don't hold. However, could it make the compiler generate bad code

The compiler makes no assumption (at least that I know of) that laws are followed. It should never generate ‘bad’ code (whatever that might mean) anyway.

The compiler makes assumptions about associativity when de-sugaring do-notation. If the monad laws aren't followed, it's possible for these two blocks to show different behavior (given that a,b,c are all values of the misbehaved Monad instance):

...

do { a; b; c }

...

a >> b >> c

I think everyone can agree that this is surprising, at the very least. Although it's not the compiler that's generating bad code here. John L.

Richard Eisenberg

6:14 p.m.

On Jun 25, 2014, at 12:52 AM, John Lato wrote:

...

The compiler makes assumptions about associativity when de-sugaring do-notation. If the monad laws aren't followed, it's possible for these two blocks to show different behavior (given that a,b,c are all values of the misbehaved Monad instance):

...
do { a; b; c }

...
a >> b >> c

I think everyone can agree that this is surprising, at the very least. Although it's not the compiler that's generating bad code here.

As far as I know, GHC makes no assumptions about associativity, or any class-based laws. The effect John observes above is accurate, but it is a direct consequence of the design of Haskell in the Haskell 2010 Report, not any assumptions in the compiler. Specifically, Section 3.14 (https://www.haskell.org/onlinereport/haskell2010/haskellch3.html#x8-470003.1...) says that `do { e; stmts }` desugars to `e >> do {stmts}`. In the case of `do { a; b; c }`, that means we get `a >> (b >> c)`. However, in Table 4.1 in Section 4.4.2 (under https://www.haskell.org/onlinereport/haskell2010/haskellch4.html#x10-800004....), we see that (>>) is *left*-associative, meaning that `a >> b >> c` means `(a >> b) >> c`. Are the different meanings here an "assumption" of associativity? I suppose one could construe it that way, but I just see `do { a; b; c}` and `a >> b >> c` as different chunks of code with different meanings. If the monad in question upholds the associativity law, then the chunks evaluate to the same result, but they're still distinct. Richard

Alexander Solla

7:31 p.m.

...

If the monad in question upholds the associativity law, then the chunks evaluate to the same result, but they're still distinct.

Distinct with respect to what equivalence relation? Also, an object isn't a monad if it isn't associative. On Wed, Jun 25, 2014 at 11:14 AM, Richard Eisenberg wrote:

...

On Jun 25, 2014, at 12:52 AM, John Lato wrote:

The compiler makes assumptions about associativity when de-sugaring do-notation. If the monad laws aren't followed, it's possible for these two blocks to show different behavior (given that a,b,c are all values of the misbehaved Monad instance):

...
do { a; b; c }

...
a >> b >> c

I think everyone can agree that this is surprising, at the very least. Although it's not the compiler that's generating bad code here.

As far as I know, GHC makes no assumptions about associativity, or any class-based laws. The effect John observes above is accurate, but it is a direct consequence of the design of Haskell in the Haskell 2010 Report, not any assumptions in the compiler.

Specifically, Section 3.14 ( https://www.haskell.org/onlinereport/haskell2010/haskellch3.html#x8-470003.1...) says that `do { e; stmts }` desugars to `e >> do {stmts}`. In the case of `do { a; b; c }`, that means we get `a >> (b >> c)`. However, in Table 4.1 in Section 4.4.2 (under https://www.haskell.org/onlinereport/haskell2010/haskellch4.html#x10-800004....), we see that (>>) is *left*-associative, meaning that `a >> b >> c` means `(a >> b) >> c`.

Are the different meanings here an "assumption" of associativity? I suppose one could construe it that way, but I just see `do { a; b; c}` and `a >> b >> c` as different chunks of code with different meanings. If the monad in question upholds the associativity law, then the chunks evaluate to the same result, but they're still distinct.

Richard

_______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe

Richard Eisenberg

11:30 p.m.

On Jun 25, 2014, at 3:31 PM, Alexander Solla wrote:

...

...
If the monad in question upholds the associativity law, then the chunks evaluate to the same result, but they're still distinct.

Distinct with respect to what equivalence relation?

I was thinking of syntactic equality (modulo renaming) on desugared programs. Another way of witnessing that the chunks are distinct is that the sequence of memory states they induce will be different -- even an associative member of the Monad class may have efficiency differences depending on the precise association.

...

Also, an object isn't a monad if it isn't associative.

You're right, of course. I meant a member of Haskell's Monad class, which may or may not be a monad. Richard

Rafael Almeida

29 Jun 29 Jun

2:59 a.m.

On Wed, Jun 25, 2014 at 3:14 PM, Richard Eisenberg wrote:

...

On Jun 25, 2014, at 12:52 AM, John Lato wrote:

The compiler makes assumptions about associativity when de-sugaring do-notation. If the monad laws aren't followed, it's possible for these two blocks to show different behavior (given that a,b,c are all values of the misbehaved Monad instance):

...
do { a; b; c }

...
a >> b >> c

I think everyone can agree that this is surprising, at the very least. Although it's not the compiler that's generating bad code here.

As far as I know, GHC makes no assumptions about associativity, or any class-based laws. The effect John observes above is accurate, but it is a direct consequence of the design of Haskell in the Haskell 2010 Report, not any assumptions in the compiler.

Specifically, Section 3.14 ( https://www.haskell.org/onlinereport/haskell2010/haskellch3.html#x8-470003.1...) says that `do { e; stmts }` desugars to `e >> do {stmts}`. In the case of `do { a; b; c }`, that means we get `a >> (b >> c)`. However, in Table 4.1 in Section 4.4.2 (under https://www.haskell.org/onlinereport/haskell2010/haskellch4.html#x10-800004....), we see that (>>) is *left*-associative, meaning that `a >> b >> c` means `(a >> b) >> c`.

Are the different meanings here an "assumption" of associativity? I suppose one could construe it that way, but I just see `do { a; b; c}` and `a >> b >> c` as different chunks of code with different meanings. If the monad in question upholds the associativity law, then the chunks evaluate to the same result, but they're still distinct.

Great explanation! Would you say that the main reason for implementing monads so >> is associative is to make sure the do notation does the "right thing"? Monad is a notion from math which was imported into Haskell. As far as I know, the reason behind it was to give Haskell the possibility of doing IO while keeping itself pure. If there was a data type just like Moand, with

...

...
=, >>, return and all. It behaved exactly the same, but no one cared about identity or associativity, then it wouldn't be a monad, but it would solve the computation with side-effects thing just the same. Why was associativity and identity brought to the language as well? What is there to be gained by such a law which can't be enforced by the compiler?

Brandon Allbery

3:11 a.m.

On Sat, Jun 28, 2014 at 10:59 PM, Rafael Almeida wrote:

...

What is there to be gained by such a law which can't be enforced by the compiler?

There are lots of things that can't be enforced by the compiler; for example, it can't force you to write an Ord instance that implements a correct total ordering. Does this make Ord less useful to you? Or pointless? -- brandon s allbery kf8nh sine nomine associates allbery.b@gmail.com ballbery@sinenomine.net unix, openafs, kerberos, infrastructure, xmonad http://sinenomine.net

Rafael Almeida

3:38 a.m.

On Sun, Jun 29, 2014 at 12:11 AM, Brandon Allbery wrote:

...

On Sat, Jun 28, 2014 at 10:59 PM, Rafael Almeida wrote:

...
What is there to be gained by such a law which can't be enforced by the compiler?

There are lots of things that can't be enforced by the compiler; for example, it can't force you to write an Ord instance that implements a correct total ordering. Does this make Ord less useful to you? Or pointless?

No. I'm not trying to say Monad laws are useless or pointless. Rather, I am looking to understand the usefulness of those laws. People are indeed going to expect that the < operator is transitive. Even though the compiler can do nothing about it. I'm not sure how to explain why we need < to be transitive. If you do so, please explain that to me as well. I always took it for granted that such law must be obeyed.

Brandon Allbery

3:52 a.m.

On Sat, Jun 28, 2014 at 11:38 PM, Rafael Almeida wrote:

...

No. I'm not trying to say Monad laws are useless or pointless. Rather, I am looking to understand the usefulness of those laws.

My point was more that Ord could *also* be different; it could support partial ordering, for example, or could support non-mathematical ordering which would e.g. enable Data.Map to be used with things that don't have mathematical total ordering, like Complex Double, or possibly to fix its behavior with Double whose Ord instance is actually not a proper total ordering; what matters is not that there is a mathematical total ordering, but that there is *some* kind of reliable ordering that can be used to build a tree. I imagine that, like Ord, a decision was made to implement the proper mathematical abstraction and not merely a convenient one. This seems to be the "Haskell way". (I'm not sure how it explains Double, though the numeric hierarchy has a lot of compromises in the name of convenience or expected behavior. Possibly Monad was in some sense a reaction to this, even: "we got that one wrong, let's do this one correctly".) -- brandon s allbery kf8nh sine nomine associates allbery.b@gmail.com ballbery@sinenomine.net unix, openafs, kerberos, infrastructure, xmonad http://sinenomine.net

Mike Meyer

4:15 a.m.

On Sat, Jun 28, 2014 at 10:52 PM, Brandon Allbery wrote:

...

I imagine that, like Ord, a decision was made to implement the proper mathematical abstraction and not merely a convenient one. This seems to be the "Haskell way". (I'm not sure how it explains Double, though the numeric hierarchy has a lot of compromises in the name of convenience or expected behavior. Possibly Monad was in some sense a reaction to this, even: "we got that one wrong, let's do this one correctly".)

Double, like Int, is a computer construct, not a mathematical one. They both map to hardware types that act like mathematical ones until you look closely, but have better performance than the software constructs that have better behavior.

Brandon Allbery

4:16 a.m.

On Sun, Jun 29, 2014 at 12:15 AM, Mike Meyer wrote:

...

On Sat, Jun 28, 2014 at 10:52 PM, Brandon Allbery wrote:

...
I imagine that, like Ord, a decision was made to implement the proper mathematical abstraction and not merely a convenient one. This seems to be the "Haskell way". (I'm not sure how it explains Double, though the numeric hierarchy has a lot of compromises in the name of convenience or expected behavior. Possibly Monad was in some sense a reaction to this, even: "we got that one wrong, let's do this one correctly".)

Double, like Int, is a computer construct, not a mathematical one. They both map to hardware types that act like mathematical ones until you look closely, but have better performance than the software constructs that have better behavior.

Sure, but that doesn't change the fact that IEEE754 NaN in particular means there is no total ordering. -- brandon s allbery kf8nh sine nomine associates allbery.b@gmail.com ballbery@sinenomine.net unix, openafs, kerberos, infrastructure, xmonad http://sinenomine.net

ok＠cs.otago.ac.nz

9:28 a.m.

On the subject of Double and laws, I imagine it would have been possible to do the kind of thing the SML Basis Library does for equality. SML spells equality '=' and it's not defined on the floating point types, which have a *different* operator, '=='. In the same way, the Prelude could have been structured so that integers and ratios belonged to Ord but floats did not. Floats could have had operators like #< #= instead. It would have been surprising, but so is x == x being False surprising.

Mike Meyer

9:53 a.m.

On Sun, Jun 29, 2014 at 4:28 AM, wrote:

...

On the subject of Double and laws, I imagine it would have been possible to do the kind of thing the SML Basis Library does for equality. SML spells equality '=' and it's not defined on the floating point types, which have a *different* operator, '=='.

If you do that, where do you stop? Not all Int's have additive inverses, and most floats have multiple things they can be added to that don't change their value. Can we call both of those +? Or do we need a name for them or two? Actually, I don't know if SML got it right or not, because I don't know what the "==" operator does. But you should *never* compare floats for equality. While there are exceptions, they are so rare that you're likely never to run into one, so just be safe, and don't do it. So if SML's "==" operator is an equality comparison, they at best got it half right. If it's some kind of programmer-controlled "fuzzy equality", well, maybe. But I think just leaving it out would be a better solution. In the same way, the Prelude could have been structured so that

...

integers and ratios belonged to Ord but floats did not. Floats could have had operators like #< #= instead. It would have been surprising, but so is x == x being False surprising.

Is x == x being false really any more surprising than x + 1 == x being true?

Alexander Solla

11:05 p.m.

...

Is x == x being false really any more surprising than x + 1 == x being true?

Yes, a lot more. Equality is a symmetric relation. So "x equals x" is true for every x. A logic/language in which that is not true is unsound. x + 1 == x is just an equation. Coming up with a theory where it holds is straightforward. For example, addition followed by the "fractional part" operation, on the set [0,1].

Mike Meyer

11:29 p.m.

On Sun, Jun 29, 2014 at 6:05 PM, Alexander Solla wrote:

...

...
Is x == x being false really any more surprising than x + 1 == x being true?

Yes, a lot more. Equality is a symmetric relation. So "x equals x" is true for every x. A logic/language in which that is not true is unsound.

x + 1 == x is just an equation. Coming up with a theory where it holds is straightforward. For example, addition followed by the "fractional part" operation, on the set [0,1].

Your theory is wrong. This is the same problem you run into with "x == x" being False: you create theories from assumptions about how these things behave based on your experience with abstract mathematical objects, and there are values for which those assumptions do not hold. Surprise results when you run into those cases. IEEE 754 floats under + and * are *not* ℝ. Or even ℚ. They just play them in the computer. There are simple large subsets of possible operands for which addition (and hence the other three operators) doesn't work like the mathematical realities they model.

Alexander Solla

30 Jun 30 Jun

7:48 a.m.

On Sun, Jun 29, 2014 at 4:29 PM, Mike Meyer wrote:

...

On Sun, Jun 29, 2014 at 6:05 PM, Alexander Solla wrote:

...
...
Is x == x being false really any more surprising than x + 1 == x being true?

Yes, a lot more. Equality is a symmetric relation. So "x equals x" is true for every x. A logic/language in which that is not true is unsound.

x + 1 == x is just an equation. Coming up with a theory where it holds is straightforward. For example, addition followed by the "fractional part" operation, on the set [0,1].

Your theory is wrong. This is the same problem you run into with "x == x" being False: you create theories from assumptions about how these things behave based on your experience with abstract mathematical objects, and there are values for which those assumptions do not hold. Surprise results when you run into those cases.

You can call it "wrong", but it doesn't make it so. (==) is supposed to model (witness) equality. The laws for Eq are the theory.

...

IEEE 754 floats under + and * are *not* ℝ. Or even ℚ. They just play them in the computer. There are simple large subsets of possible operands for which addition (and hence the other three operators) doesn't work like the mathematical realities they model.

I didn't say they are. I had not committed myself to any position about floating point numbers. They are certainly degenerate objects, similar to bottom, and do not "model" numbers. Specifically because they (floats) I think the problem here is that you're confusing the theory with the model. You are definitely misusing the terminology, and unfortunately, projecting at us for not understanding it they way you want us to.

Mike Meyer

8:10 a.m.

On Mon, Jun 30, 2014 at 2:48 AM, Alexander Solla wrote:

...

On Sun, Jun 29, 2014 at 4:29 PM, Mike Meyer wrote:

...
On Sun, Jun 29, 2014 at 6:05 PM, Alexander Solla wrote:

...
x + 1 == x is just an equation. Coming up with a theory where it

...
holds is straightforward. For example, addition followed by the "fractional part" operation, on the set [0,1].

Your theory is wrong. This is the same problem you run into with "x == x" being False: you create theories from assumptions about how these things behave based on your experience with abstract mathematical objects, and there are values for which those assumptions do not hold. Surprise results when you run into those cases.

You can call it "wrong", but it doesn't make it so. (==) is supposed to model (witness) equality. The laws for Eq are the theory.

That is true, but what you said was "coming up with a theory where it [the equation x == x + 1 being false] holds is straightforward. For example, addition followed by ...". Ah, I see. You're not trying to provide a theory as to why "floats" don't behave like real numbers, but a mathematical operation which includes objects that act like some "floats". True, any sufficiently sophisticated developer won't have problems with x == x + 1 being true. Then again, that's also true for x == x being false.

Richard A. O'Keefe

12:23 a.m.

On 29/06/2014, at 9:53 PM, Mike Meyer wrote:

...

On Sun, Jun 29, 2014 at 4:28 AM, wrote: On the subject of Double and laws, I imagine it would have been possible to do the kind of thing the SML Basis Library does for equality. SML spells equality '=' and it's not defined on the floating point types, which have a *different* operator, '=='.

If you do that, where do you stop?

Well, SML stopped right there.

...

Not all Int's have additive inverses, and most floats have multiple things they can be added to that don't change their value. Can we call both of those +? Or do we need a name for them or two?

For what it's worth, O'CAML *does* have separate operators for integer addition and floating-point addition.

...

Actually, I don't know if SML got it right or not, because I don't know what the "==" operator does.

It's really very easy to find on the web. val == real * real -> bool val != real * real -> bool val ?= real * real -> bool [x == y] returns true if and only if neither x nor y is a NaN and x and y are equal, ignoring signs on zeros. This function is equivalent to the IEEE = operator. The second function != is equivalent to (not o op ==) and the IEEE ?<> operator. [?=] returns true if and only if either argument is a NaN or the arguments are bitwise equal, ignoring signs on zeros. It is equivalent to the IEEE ?= operator.

...

But you should *never* compare floats for equality.

You are mistaken.

...

While there are exceptions, they are so rare that you're likely never to run into one, so just be safe, and don't do it.

I have in fact *often* run into the so-called exceptions. Indeed, anyone using Javascript and wanting to compare two "integers" will in fact be comparing floating point numbers and they will be fully justified in doing so.

...

So if SML's "==" operator is an equality comparison, they at best got it half right.

It depends on what you want to call 'an equality comparison'. It is one of the comparison operators specified in the IEEE standard. There is no fuzziness or ambiguity about its semantics. They COULDN'T leave it out and claim IEEE conformance.

...

If it's some kind of programmer-controlled "fuzzy equality", well, maybe. But I think just leaving it out would be a better solution.

I have seldom seen any kind of programmer-controlled "fuzzy equality" that wasn't either dangerously wrong or dangerously misapplied, and I speak as an old APLer who knows what ⌷CT stands for, where to find it in the standard, and why it is risky. If you leave out ==, some irritated programmer will just define fun ==(x, y) = x >= y andalso x <= y and you're back where you started. To prevent anyone doing that, you have to leave out *ALL* the floating-point comparison operators. That is not sensible.

...

Is x == x being false really any more surprising than x + 1 == x being true?

Yes. The idea of there being a "number" x such that x + 1 == x is familiar to anyone who has ever met the extended real line. Or who knows that ℵ₀ + 1 = ℵ₀.

...

Albert Y. C. Lai

10:43 p.m.

On 14-06-28 11:38 PM, Rafael Almeida wrote:

...

People are indeed going to expect that the < operator is transitive. Even though the compiler can do nothing about it. I'm not sure how to explain why we need < to be transitive. If you do so, please explain that to me as well. I always took it for granted that such law must be obeyed.

Every sub-quadratic-time comparison-based sorting algorithm assumes that comparisons are transitive.

4024

Age (days ago)

4029

Last active (days ago)

List overview

Download

19 comments

10 participants

participants (10)

Albert Y. C. Lai
Alexander Solla
Brandon Allbery
John Lato
Mateusz Kowalczyk
Mike Meyer
ok＠cs.otago.ac.nz
Rafael Almeida
Richard A. O'Keefe
Richard Eisenberg

Monad laws

Rafael Almeida

Richard Eisenberg

Richard Eisenberg

Rafael Almeida

Rafael Almeida

tags

participants (10)