Did you mean HaXmL?
I am sorry that I can't explain what I want well.
I think this module cannot parse HTML file like this.
I don't mind the javascript code.

I want to trancelate following code

<html>
<p> hogehoge </p>
<script>if(window.mw){
mw.loader.state({"<script>":"</script>","user":"ready","user.groups":"ready"});
}
</script>
</html>


to like this

<html>
     <p>
         hogehoge
     <script>

in short, I want structure of HTML  excludeing javascript.

2013/12/25 Brandon Allbery <allbery.b@gmail.com>
On Tue, Dec 24, 2013 at 2:03 PM, akira kawata <a.kawashiro@gmail.com> wrote:
<html>
<script>
//<![CDATA[
<!-- -->
//]]>
</script>
</html>

An XML parser might help with CDATA blocks.

--
brandon s allbery kf8nh                               sine nomine associates
allbery.b@gmail.com                                  ballbery@sinenomine.net
unix, openafs, kerberos, infrastructure, xmonad        http://sinenomine.net