
22 Oct
2007
22 Oct
'07
9:52 a.m.
Hi Ketil,
I'm struggling to get my HXT-based parser to parse a largish file (<300MB), even after breaking into reasonably-sized chunks. The culprit appears to be parsing one element comprising 25K lines of text, which apparently requires more memory than the 2Gb my computer is equipped with.
You can try TagSoup (http://www-users.cs.york.ac.uk/~ndm/tagsoup/) which isn't really a complete XML parser, but may do what you want. The other option is HaXml which Malcolm has been adding lazy parsing to - I'm not sure if that is in a released variant or not. Thanks Neil