DOCTYPE and other fixes
Hi there, I just commited DOCTYPE sniffing (for the sake of bug-meter error-message accuracy). This is still in state of flux, but at least now the unescaped & inside URIs error messages that crowded the bug meter (aka. unrecognized entities) only appear for HTML4.01 and XHTML. The same code can serve for better checking XHTML (Matthias: this is what we talked about in Germany). In a nutshell, one of the nasty effects of SGML and XML is that formerly perfect URI-values now need to escape '&' as '&' (inside atrribute values). This breaks HTML compliance for lots of sites with script generated URL refrences. Webmasters usually workaround this by omitting DOCTYPE so it's parsed as in the old days. There's also an important fix in the new tag parser of dpip. Has anyone more feedback with the new FLTK2 GUI downloads dpi? -- Cheers Jorge.-
On Thu, Jan 19, 2006 at 04:07:13PM -0300, Jorge Arellano Cid wrote:
I just commited DOCTYPE sniffing (for the sake of bug-meter error-message accuracy). This is still in state of flux, but at least now the unescaped & inside URIs error messages that crowded the bug meter (aka. unrecognized entities) only appear for HTML4.01 and XHTML.
Hi Jorge, escaping special characters inside attribute values is required for older versions of HTML as well. Hence, I think Dillo should report these errors for all HTML version, maybe apart from "tag soup", i.e., HTML documents without doctype. (According to the HTML 2.0 spec, an HTML document without doctype shall be treated as HTML 2.0, but I don't know if anyone ever cared about this.) Maybe one could reduce the number of error messages, for example by reporting only the number of unescaped "&" inside each URI or by showing only the first such error etc.
The same code can serve for better checking XHTML.
Yes, this could be done, in particular if Dillo accepted XHTML served as such and not as text/html.
Has anyone more feedback with the new FLTK2 GUI downloads dpi?
not yet tested ... All the best, -- Matthias Franz
participants (2)
-
Jorge Arellano Cid
-
Matthias Franz