Parsec and destructuring HTML content

Has anyone explored destructuring HTML with Parsec? Any other ideas on how to best do this? I'm looking to scrape bits of information from more or less unstructured HTML pages. I'm looking to structure, tag and classify the content afterwards. I think that developing HTML scrapers requires short tweak-compile- run cycles and is probably best done in Perl, Python, Ruby, i.e. dynamic languages but I wonder if someone has found otherwise. Thanks, Joel -- http://wagerlabs.com/

Hello Joel, Sunday, June 18, 2006, 4:30:35 PM, you wrote:
I think that developing HTML scrapers requires short tweak-compile- run cycles and is probably best done in Perl, Python, Ruby, i.e. dynamic languages but I wonder if someone has found otherwise.
hugs (winhugs) is rather fast, loading several KLOCs per second, and finally you can compile program with ghc -- Best regards, Bulat mailto:Bulat.Ziganshin@gmail.com

Hi
hugs (winhugs) is rather fast, loading several KLOCs per second, and finally you can compile program with ghc
WinHugs (but not Hugs) also has a feature called auto-reload - if you change and save any of the source code files it will automatically detect and reload them - no more ":r" :) Another useful feature of WinHugs is hyperlinks for errors, it gives you a link which will quickly open the appropriate file at the appropriate line in your prefered editor. These two features, along with the underlying compile speed of Hugs, can make programming with WinHugs even quicker than Python etc. Thanks Neil

Joel Reymont
I think that developing HTML scrapers requires short tweak-compile- run cycles and is probably best done in Perl, Python, Ruby, i.e. dynamic languages but I wonder if someone has found otherwise.
I get very short tweak-run cycles from haskell-mode in emacs. I've bound my F12 key to compile, and I use Local Variables to set the compile-command to execute my unit tests. For more information, http://www.haskell.org/hawiki/HaskellMode -- I've tried to teach people autodidactism, | ScannedInAvian.com but it seems they always have to learn it for themselves.| Shae Matijs Erisson
participants (4)
-
Bulat Ziganshin
-
Joel Reymont
-
Neil Mitchell
-
Shae Matijs Erisson