I know UTF-8 is one of those "features" that many people on slashdot have missed for a long time, but I thought most of that was for simple additions like the euro/pound/yen symbols and such.
Am I the only one that would prefer not to see non-english text mixed in with the comments?
Then again, I'd fully support an exception for Klingon. Maybe elvish too.
Yeah it may mean some unreadable posts, but as pointed out in the story text moderation should take care of that. There was even talk of a new mod category, something like "-1 Unintelligible" :P
I rather like that idea for a mod.
There will be stories about Finns with äs in their names. Swedes with Ås in their names, and maybe, just maybe, stores about icelandic volcanoes with ðs in their name. Whilst I like the idea of everyone agreeing to use English language, that doesn't mean every word they'll be typing will be English.
Note - this post isn't UTF-8, this is plain old ASCII, I used the &entity; syntax.
At present, there's nothing stopping someone from writing in German, French, Spanish, or many other languages using Latin letters (more or less). Since it doesn't happen, I don't think we need to worry.
(But if I post in a quote with an em-dash: â€” or some â€œproperâ€ â€˜quotes,â€™ maybe an arrow â†’ or temperature (0Â°C) it won't make a mess.)
(Nope, looks like it's going to make a mess.)
Well, why shouldn't we be able to post non-English comments?
And even if you don't, it would be nice to be able to spell some people's names correctly.
Lech WaÅ‚Ä™sa [wikipedia.org]FriÃ°rik ÃžÃ³r FriÃ°riksson [wikipedia.org]Jaroslav HaÅ¡ek [wikipedia.org]
How do you consider the use of â„ƒ or ãŽž units: are they proper English or part of the non-english text?
Oooopps. A copy/paste of these characters from the "Character map" (on Ubuntu/Firefox) straight into the reply text box results in them showing mangled in the preview (no mater if plain-old text or HTML).
To provide the context: I was enquiring about the use of these ℃ and ㎞ units.
The bytes are probably interpreted by slashcode as latin1 instead of utf8.