
27 Apr
2010
27 Apr
'10
10:26 a.m.
On 27 April 2010 16:22, John Creighton
Subject: Is XHT a good tool for parsing web pages? I looked a little bit at XHT and it seems very elegant for writing concise definitions of parsers by forms but I read that it fails if the XML isn't strict and I know a lot of web pages don't use strict XHTML. Therefore I wonder if it is an appropriate tool for web pages.
I don't know about XHT but tagsoup [1] does a pretty good job parsing web pages. Peter [1] http://hackage.haskell.org/package/tagsoup