HTML parser bug

Dec. 5, 2009

      There is a bug in the HTML parser: Tags within quotes are interpreted:

<input type="text" name="test" value="<p>asdf</p>" />

I  think  it  is line  3754f  (src/html.cc)  which  evokes  the unwanted
behaviour.  I'd fix this bug by introducing two variables: The first one
states whether  we're currently  inside of  a quote  and the  second one
stores  its type  (single or  double  quote).  As  long  as  the current
character  does  not equal  the  type,  all  characters  in  between the
starting and ending quote will be  ignored.  Of course we should be also
able  to deal  with escaped  quotes allowing  constructs similar  to the
following one:

<input type="text" name="test" value="<p>\"asdf\"</p>" />

--Tim

tim.nieradzik＠gmx.de

corvid＠lavabit.com

jcid＠dillo.org

tags

participants (3)