On Mon, May 05, 2014 at 02:25:01AM +0000, eocene wrote:
Jorge wrote:
On Mon, May 05, 2014 at 12:54:06AM +0000, eocene wrote:
Jorge wrote:
On Sun, May 04, 2014 at 09:09:48PM +0000, eocene wrote:
This msg says that it was changed because it wasn't required under certain conditions. HTML4 spec gives it as:
Note. In SGML, it is possible to eliminate the final ";" after a character reference in some cases (e.g., at a line break or immediately before a tag). In other circumstances it may not be eliminated (e.g., in the middle of a word). We strongly suggest using the ";" in all cases to avoid problems with user agents that require this character to be present.
...and there's an "IIRC" in the msg that XHTML requires it.
The HTML5 spec requires a terminating ';' in all cases.
Then, it looks like requiring it again in this case may be the way to go (I seem to recall there were lots of unterminated NBSP).
Are you saying always for html5, (probably) always for xhtml, and for attributes with html4?
I'm saying we should find a simple heuristic that copes with the current situation.
If you want simple, I can just require it unconditionally and find out what happens.
Your first suggestion looks quite reasonable. Please try it and make some field tests. I'm currently working on the double imgbuf problem... -- Cheers Jorge.-