19 Apr
2011
19 Apr
'11
3:11 a.m.
Since the document claims it is HTML, you should be parsing it with an HTML parser. Try hxt-tagsoup -- specifically, the "parseHtmlTagSoup" arrow.