
18 Jun
2006
18 Jun
'06
8:30 a.m.
Has anyone explored destructuring HTML with Parsec? Any other ideas on how to best do this? I'm looking to scrape bits of information from more or less unstructured HTML pages. I'm looking to structure, tag and classify the content afterwards. I think that developing HTML scrapers requires short tweak-compile- run cycles and is probably best done in Perl, Python, Ruby, i.e. dynamic languages but I wonder if someone has found otherwise. Thanks, Joel -- http://wagerlabs.com/