Laziness and XML parsing

8 Nov 2011

      I want to parse a large xml file (2GB), without putting the whole thing into memory. It's pretty simple with a sax parser in most languages, you just stream bytes to the sax parser, and wait for sax events. 

Here's what I think the equivalent is in Haskell - https://gist.github.com/1346854

Is the xml file being read lazily? It seems lazy, but it also seems like all the sax events would be loaded into memory. If not, how is that possible? In order to be lazy, it seems like parse would have to be an impure function, so that it could back to the disk to get more stuff. 

~sean

Sean Hess

Felipe Almeida Lessa

Sean Hess

Michael Snoyman

Sean Hess

Michael Snoyman

Sean Hess

Michael Snoyman

David McBride

tags

participants (4)