dillo people: It turns out that dillo handles Chinese quite badly because the text doesn't contain whitespace (BUG 827). It looks like Japanese is that way as well. 1) People using East Asian languages: Does this do roughly the right thing? I've read that there are picky rules about certain individual characters, but I just mean in general here. 2) Thoughts on optimization are welcomed. littlebat: For some reason, I was thinking that whitespace was required between words in dillo, but I misremembered. littlebat wrote:
Hi, Thanks for the work on Dillo.
I think it isn't a good idea to resolve long chinese line wrapping problem by inserting space after punctuation mark("?" or "?" or "?", etc..).
Here is a "Dillo2 Wrap Long Chinese Line Test" page: http://www.learndiary.com/en/dillo2-wrap-long-chinese-line-test.htm . Open it with Firefox or IE, then open it with Dillo2, You will see the different of wrapping long chinese line between them.
Note, unlike english, there is no space between Chinese characters. So, it should be wrapped after any Chinese character which has reached the end of a line. I don't know the technique detail about this, but, I think, let the Chinese characters function as (english) commas or periods is a good idea, so, a new line can be started after any Chinese charcter which character has reached the end of a line.
littlebat
corvid wrote:
Would it make things better or worse than it is now if I inserted a space when encountering a "?" or "?" so that dillo would know that it can wrap there? If that doesn't show up properly, I mean the characters that function as commas and periods.
(I'm trying to think of something simple that will at least help you somewhat without having to make big changes right now :) I realize that it _should_ be able to wrap after most CJK characters, but...)
It could be worse because 1) Those spaces would still be there if you were copying and pasting the text to something else. 2) If you wanted to search a page for "text?text", you would have to enter "text? text" with a space in there.