[Language] Help with (non-english) characters!

It might comes from the font you use to display text on your screen, I use occidental (ISO), I think that this is a european norm, that might be the reason why it looks funny on your computer.

Also, do you want to include the greek symbols?
 

log in or register to remove this ad

Blacksad said:
It might comes from the font you use to display text on your screen, I use occidental (ISO), I think that this is a european norm, that might be the reason why it looks funny on your computer.

Perhaps. Does the OE "œ" look different to you - less like "o" + "e" and more like "c" + "e"?

Blacksad said:
Also, do you want to include the greek symbols?

No, not really.

This is actually a collection of symbols that might be used to name characters (among other things).
 

The letter ß has no upper-case version. It never appears at the beginning of a word. When writing whole words in caps, ß is usually replaced with ss.
 



OK, I stopped laughing.

You missed Latin Letter Small Ezh With Tail.

OK, I've started again.

Unicode Code Charts

Sorry, it's kind of like having someone come up to you and ask you to tell them the rules to D&D. :D

Short answer - the Unicode people have been working on it for 10 years and they're still finding new ones. The "Proposed New Characters" page is particularly a must-see.

If you're doing character generation or something, I think the easy way is to use the Unicode support in the operating system and let the system programmers deal with it.
 

tarchon said:
OK, I stopped laughing.

You missed Latin Letter Small Ezh With Tail.

OK, I've started again.

Unicode Code Charts

Sorry, it's kind of like having someone come up to you and ask you to tell them the rules to D&D. :D

Short answer - the Unicode people have been working on it for 10 years and they're still finding new ones. The "Proposed New Characters" page is particularly a must-see.

If you're doing character generation or something, I think the easy way is to use the Unicode support in the operating system and let the system programmers deal with it.

Here's the problem - I want to disallow non-letter characters. No "-", no "&", no "€". I'm willing to live with a far-from-complete character set, but can't live with even a single nonletter.
 

Hi Charles,

Just to chime in, in German the: ö ü ä and Ö Ä Ü are very important. Also of course ß... where would I be if I couldn't order a Maß of beer for example ;)

Hope this is what you were looking for.

-Will
 

CRGreathouse said:


Here's the problem - I want to disallow non-letter characters. No "-", no "&", no "€". I'm willing to live with a far-from-complete character set, but can't live with even a single nonletter.

Usually, there will be an isalpha() function that does this reasonably well. It depends on what APIs you have access to though. In VC++ 5, you'll probably want to use iswalpha(), which supports Unicode characters. They have others for MS's "multibyte characters," which I believe are for their own encoding systems. Java has isLetter(). Unfortunately, some implementations of these include things like dollar signs, but Unicode does have a classification system that will identify those. In Java you can use Character.getType() to examine the type and make your own filter.
 


Remove ads

Top