[comp.emacs] foreign alphabets and case folding

nelson@sun.soe.clarkson.edu (Russ Nelson) (07/19/88)

How do non-english-speakers initialize downcase_table, given the
assumption in search.c that A-Z and a-z are the only letters?
-- 
nelson@clutx.bitnet, nelson@clutx.clarkson.edu, uunet!clutx.clarkson.edu!nelson

cjh@hpausla.HP.COM (Clifford Heath) (07/26/88)

> How do non-english-speakers initialize downcase_table, given the
> assumption in search.c that A-Z and a-z are the only letters?

It depends entirely on the character set and the language.  HP generally
uses the ROMAN8 character set for European languages, and it has
associated case shift tables that are different for different languages.
I.e.  the obvious shift isn't always the correct one; French generally
strips accents for upper case, unless it's Canadian French...  etc.

The emerging standard character set for European languages seems to be
Latin1, which is ISO standard 8859/1.  Grrr, once again HP is first, and
yet fails to become the standard.  We have a way of doing that :-(

Clifford Heath, Hewlett Packard Australian Software Operation.
(UUCP: hplabs!hpfcla!hpausla!cjh, ACSnet: cjh@hpausla.oz)