`zip` doesn't work with infinite `Seq`s

older
Re: [Haskell-cafe] type families...

Matteo Ferrando

7 Oct 2014 7 Oct '14

2:57 a.m.

Hello, We are writing a compiler[1] for a course and found that the `zip` function included in the `Data.Sequence` module, `zip :: Seq a -> Seq b -> Seq (a, b)` would hang on the following code:

...

-- using the `zip` from `Data.Sequence` zip ys (fomList $ repeat x)

We checked the implementation[2] in the source of `Data.Sequence` and found the following:

...

zip :: Seq a -> Seq b -> Seq (a, b) zip = zipWith (,)

...

-- Here `zipWith` assumes *non-infinite `Seq`s* zipWith :: (a -> b -> c) -> Seq a -> Seq b -> Seq c zipWith f xs ys | length xs <= length ys = zipWith' f xs ys | otherwise = zipWith' (flip f) ys xs

...

-- Function not exported by `Data.Seq`, assumes `length xs <= length ys` zipWith' :: (a -> b -> c) -> Seq a -> Seq b -> Seq c zipWith' f xs ys = snd (mapAccumL k ys xs) where k kys x = case viewl kys of (z :< zs) -> (zs, f x z) EmptyL -> error "zipWith': unexpected EmptyL"

In the lazy reading of the documentation we did, we didn't find any warning of using infinite `Seq`s for zips. (Maybe there are warings that we didn't see). But looking at the code of `zip` in the `Prelude`:

...

zip :: [a] -> [b] -> [(a,b)] zip (a:as) (b:bs) = (a,b) : zip as bs zip _ _ = []

We see that we could just *pattern-match* both heads, instead of making assumptions. Maybe this should be better explained in the documentation[3] of `zip` for `Seq`s:

...

zip :: Seq a -> Seq b -> Seq (a, b) O(min(n1,n2)). zip takes two sequences and returns a sequence of corresponding pairs. If one input is short, excess elements are discarded from the right end of the longer sequence.

Or just change the implementation for it to work with infinite `Seq`s. For those of you who are curious, we ended up using the following code to fix the *infinite `Seq`s problem*:

...

-- using the `zip` from `Prelude` zip (toList ys) (repeat x)

[1] https://github.com/chamini2/sapphire [2] http://hackage.haskell.org/package/containers-0.5.5.1/docs/src/Data-Sequence... [3] http://hackage.haskell.org/package/containers-0.5.5.1/docs/Data-Sequence.htm...

Attachments:

attachment.html (text/html — 3.1 KB)

Show replies by date

Chris Wong

7 Oct 7 Oct

3:27 a.m.

Hi Matteo, Data.Sequence provides a general-purpose *finite* sequence. There is no such thing as an infinite Seq! In fact, you'll find that while head $ repeat 'a' results in 'a', Seq.head . Seq.fromList $ repeat 'a' never returns. Chris On Tue, Oct 7, 2014 at 3:57 PM, Matteo Ferrando wrote:

...

Hello,

We are writing a compiler[1] for a course and found that the `zip` function included in the `Data.Sequence` module, `zip :: Seq a -> Seq b -> Seq (a, b)` would hang on the following code:

...
-- using the `zip` from `Data.Sequence` zip ys (fomList $ repeat x)

We checked the implementation[2] in the source of `Data.Sequence` and found the following:

...
zip :: Seq a -> Seq b -> Seq (a, b) zip = zipWith (,)

...
-- Here `zipWith` assumes *non-infinite `Seq`s* zipWith :: (a -> b -> c) -> Seq a -> Seq b -> Seq c zipWith f xs ys | length xs <= length ys = zipWith' f xs ys | otherwise = zipWith' (flip f) ys xs

...
-- Function not exported by `Data.Seq`, assumes `length xs <= length ys` zipWith' :: (a -> b -> c) -> Seq a -> Seq b -> Seq c zipWith' f xs ys = snd (mapAccumL k ys xs) where k kys x = case viewl kys of (z :< zs) -> (zs, f x z) EmptyL -> error "zipWith': unexpected EmptyL"

In the lazy reading of the documentation we did, we didn't find any warning of using infinite `Seq`s for zips. (Maybe there are warings that we didn't see). But looking at the code of `zip` in the `Prelude`:

...
zip :: [a] -> [b] -> [(a,b)] zip (a:as) (b:bs) = (a,b) : zip as bs zip _ _ = []

We see that we could just *pattern-match* both heads, instead of making assumptions.

Maybe this should be better explained in the documentation[3] of `zip` for `Seq`s:

...
zip :: Seq a -> Seq b -> Seq (a, b) O(min(n1,n2)). zip takes two sequences and returns a sequence of corresponding pairs. If one input is short, excess elements are discarded from the right end of the longer sequence.

Or just change the implementation for it to work with infinite `Seq`s.

For those of you who are curious, we ended up using the following code to fix the *infinite `Seq`s problem*:

...
-- using the `zip` from `Prelude` zip (toList ys) (repeat x)

[1] https://github.com/chamini2/sapphire [2] http://hackage.haskell.org/package/containers-0.5.5.1/docs/src/Data-Sequence... [3]http://hackage.haskell.org/package/containers-0.5.5.1/docs/Data-Sequence.htm...

_______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe

Matteo Ferrando

3:49 a.m.

Thanks to Chris and Sam for the explanation! That explains it. On Mon, Oct 6, 2014 at 10:57 PM, Chris Wong wrote:

...

Hi Matteo,

Data.Sequence provides a general-purpose *finite* sequence. There is no such thing as an infinite Seq!

In fact, you'll find that while

head $ repeat 'a'

results in 'a',

Seq.head . Seq.fromList $ repeat 'a'

never returns.

Chris

On Tue, Oct 7, 2014 at 3:57 PM, Matteo Ferrando wrote:

...
Hello,

We are writing a compiler[1] for a course and found that the `zip` function included in the `Data.Sequence` module, `zip :: Seq a -> Seq b -> Seq (a, b)` would hang on the following code:

...
-- using the `zip` from `Data.Sequence` zip ys (fomList $ repeat x)

We checked the implementation[2] in the source of `Data.Sequence` and found the following:

...
zip :: Seq a -> Seq b -> Seq (a, b) zip = zipWith (,)

...
-- Here `zipWith` assumes *non-infinite `Seq`s* zipWith :: (a -> b -> c) -> Seq a -> Seq b -> Seq c zipWith f xs ys | length xs <= length ys = zipWith' f xs ys | otherwise = zipWith' (flip f) ys xs

...
-- Function not exported by `Data.Seq`, assumes `length xs <= length ys` zipWith' :: (a -> b -> c) -> Seq a -> Seq b -> Seq c zipWith' f xs ys = snd (mapAccumL k ys xs) where k kys x = case viewl kys of (z :< zs) -> (zs, f x z) EmptyL -> error "zipWith': unexpected EmptyL"

In the lazy reading of the documentation we did, we didn't find any warning of using infinite `Seq`s for zips. (Maybe there are warings that we didn't see). But looking at the code of `zip` in the `Prelude`:

...
zip :: [a] -> [b] -> [(a,b)] zip (a:as) (b:bs) = (a,b) : zip as bs zip _ _ = []

We see that we could just *pattern-match* both heads, instead of making assumptions.

Maybe this should be better explained in the documentation[3] of `zip` for `Seq`s:

...
zip :: Seq a -> Seq b -> Seq (a, b) O(min(n1,n2)). zip takes two sequences and returns a sequence of corresponding pairs. If one input is short, excess elements are discarded from the right end of the longer sequence.

Or just change the implementation for it to work with infinite `Seq`s.

For those of you who are curious, we ended up using the following code to fix the *infinite `Seq`s problem*:

...
-- using the `zip` from `Prelude` zip (toList ys) (repeat x)

[1] https://github.com/chamini2/sapphire [2]

http://hackage.haskell.org/package/containers-0.5.5.1/docs/src/Data-Sequence...

...
[3] http://hackage.haskell.org/package/containers-0.5.5.1/docs/Data-Sequence.htm...

_______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe

Chris Wong

3:49 a.m.

...

Data.Sequence provides a general-purpose *finite* sequence. There is no such thing as an infinite Seq!

In fact, you'll find that while

head $ repeat 'a'

results in 'a',

Seq.head . Seq.fromList $ repeat 'a'

never returns.

To add to my previous comment: the key feature of Seq is constant-time access to both ends of the sequence. It does this by caching the the first and last few elements in the constructor. Given these constraints, the behavior you observe makes sense. To construct a Seq (as the call to fromList does), we must find the last element in the list so that we can cache it. But an infinite list doesn't have a last element (by definition). So fromList never terminates. I don't think there's a way to allow infinite sequences while also having efficient access to both ends. The Halting Problem probably comes into it somewhere. The solution you gave is likely the best one. Chris

Carter Schonwald

4:29 a.m.

the halting problem has nothing to do with it. Simply you can't cache the tail if no finite number of steps will get you there :) (halting problem style impossibility results tend to be wayyyy niftier) On Mon, Oct 6, 2014 at 11:49 PM, Chris Wong wrote:

...

...
Data.Sequence provides a general-purpose *finite* sequence. There is no such thing as an infinite Seq!

In fact, you'll find that while

head $ repeat 'a'

results in 'a',

Seq.head . Seq.fromList $ repeat 'a'

never returns.

To add to my previous comment: the key feature of Seq is constant-time access to both ends of the sequence. It does this by caching the the first and last few elements in the constructor.

Given these constraints, the behavior you observe makes sense. To construct a Seq (as the call to fromList does), we must find the last element in the list so that we can cache it. But an infinite list doesn't have a last element (by definition). So fromList never terminates.

I don't think there's a way to allow infinite sequences while also having efficient access to both ends. The Halting Problem probably comes into it somewhere. The solution you gave is likely the best one.

Chris _______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe

Brent Yorgey

9 Oct 9 Oct

1:23 p.m.

http://hackage.haskell.org/package/fmlist-0.8/docs/Data-FMList.html On Tue, Oct 7, 2014 at 12:29 AM, Carter Schonwald < carter.schonwald@gmail.com> wrote:

...

the halting problem has nothing to do with it. Simply you can't cache the tail if no finite number of steps will get you there :)

(halting problem style impossibility results tend to be wayyyy niftier)

On Mon, Oct 6, 2014 at 11:49 PM, Chris Wong wrote:

...
...
Data.Sequence provides a general-purpose *finite* sequence. There is no such thing as an infinite Seq!

In fact, you'll find that while

head $ repeat 'a'

results in 'a',

Seq.head . Seq.fromList $ repeat 'a'

never returns.

To add to my previous comment: the key feature of Seq is constant-time access to both ends of the sequence. It does this by caching the the first and last few elements in the constructor.

Given these constraints, the behavior you observe makes sense. To construct a Seq (as the call to fromList does), we must find the last element in the list so that we can cache it. But an infinite list doesn't have a last element (by definition). So fromList never terminates.

I don't think there's a way to allow infinite sequences while also having efficient access to both ends. The Halting Problem probably comes into it somewhere. The solution you gave is likely the best one.

Chris _______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe

_______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe

3923

Age (days ago)

3925

Last active (days ago)

List overview

Download

5 comments

4 participants

participants (4)

Brent Yorgey
Carter Schonwald
Chris Wong
Matteo Ferrando