There is a bug in the HTML parser: Tags within quotes are interpreted:
<input type="text" name="test" value="<p>asdf</p>" />
I think it is line 3754f (src/html.cc) which evokes the unwanted
behaviour. I'd fix this bug by introducing two variables: The first one
states whether we're currently inside of a quote and the second one
stores its type (single or double quote). As long as the current
character does not equal the type, all characters in between the
starting and ending quote will be ignored. Of course we should be also
able to deal with escaped quotes allowing constructs similar to the
following one:
<input type="text" name="test" value="<p>\"asdf\"</p>" />
--Tim