Re: [Haskell-cafe] Something like optimistic evaluation

29 Apr 2008

      Brent Yorgey wrote:
...
On Mon, Apr 28, 2008 at 6:09 PM, Daniil Elovkov 
mailto:daniil.elovkov@googlemail.com> 
wrote:
Hello
Somewhat on the topic of optimistic evaluation, I've just thought of
    another way to evaluate thunks.
When the program is about to block on some IO, what if we start a
    thread to evaluate (any) unevaluated thunks. We'll have additional
    system thread, but the blocked one will not actually consume any
    processor time.
This would take place only when the program is compiled as threaded
    and run with -N k, k>1.
The RTS knows at least about some operations that will block, those
    which IO operations are implemented with. for example. It could
    merely start a process of evaluating any (or something more clever)
    outstanding thunks right before going into one of those operations
    and stop it when it's back.
Of course, it's not like optimistic evaluation because we don't
    avoid creating thunks. But in a sense it's similar. It could also be
    compared with incremental garbage collection :)
Has something like that been done, discussed?
This sounds like it could be helpful in certain circumstances, but in 
many cases it could probably lead to unpredictable (and uncontrollable!) 
memory usage.  I could imagine a situation where my program is running 
along just fine, and then one day it takes a long time to do a read from 
the network due to latency or whatever, and suddenly memory usage shoots 
through the roof, due to evaluation of some infinite (or even just very 
large) data structure.
Yes, well, optimistic evaluation itself, as I understand, already 
exploits some mechanisms for avoiding that kind of thing. For example it 
stops evaluating a thunk if it starts to take too long. In case of OE we 
have to care about time (and memory). In this 'behind IO' case we only 
have to care about memory usage.

There can be some rules, like have an upper bound of amount of memory 
taken by new thunks. Also, of course it would make sense to do a sort of 
'breadth first' evaluation.

After all, the research has already been done on OE, its trade-offs, 
what happens with infinite data structures, etc. This would be just a 
relatively minor tweak. And, in terms of real execution time it would be 
simply free, it seems.