BardStephenFox said:
I think the concerns on HTML are inline cross-linking/cross-scripting issues. It would be possible to practically hijack a board by aggressive cross-linking threads. This would be bad for a board like EN World if somebody inlined a porn site/board in the middle of a thread. Yeah, the mods could clean it up. But until somebody did, it would look like EN World was hosting that content.
Right. I wasn't asking for full HTML support though, just for HTML character entities. One of the web browsers I use doesn't render non-ASCII characters if they're literally inlined.
I'm not aware of a cross-site scripting vulnerability involving the use of character entities. You may be thinking of the issues raised in
Unicode Technical Report #36: Unicode Security Considerations (section 2, "Visual Security Issues"), but prohibiting the usage of HTML character entities while still permitting literal inlining of Unicode characters doesn't really do anything to guard against this kind of attack.
E.g., if you can read the following:
www.google.com
and if the following is visible, looks the same as the above, and is a hyperlink:
http://www.google.com/
Then neither EN World's forum software nor your browser is guarding you from this issue, and the lack of support for HTML character entities seems more like a vB design oversight than a deliberate security measure.
(For me, using Mozilla Firefox on Linux, the plain text Google lookalike displays normally, but my browser has replaced the fullwidth Latin characters in the hyperlinked text with decimal HTML character entities as recommeded in UTR #36, referenced above.)
I offer all of this for the sake of discussion and enlightenment, not because this is a problem I expect the EN World admins to fix. It's a limitation in the underlying software, so I reckon I'll just have to deal with it, or patch the other browser I use to pass through non-ASCII as is. Maybe there's a configuration option for it—I'll poke through the docs.
I think the two-word answer to my question, for now at least, is “you can't.”
