[comp.lang.pascal] Char sorting seq.

ZCCBJSB%EB0UB011.BITNET@cunyvm.cuny.edu (Josep Sau B.) (11/09/90)

Those can seem silly questions to someone, but they are very important
to any textual management program that pretend to be truly international:

Which is the correct lexicographic ordering of the following
characters in their languages:

- The letters 'Eth' and 'Thorn' of Icelandic
  In code page 850: Capital Eth  chr(209), lower case Eth chr(208),
  Capital Thorn chr(232), lower case thorn (231).
  May them be equated to 'D', 'T' or some other letters in ordering ?
  Any other alive language uses these letters?

- The 'Beta' of German (chr(225) in all code pages)
  May it be translated ALWAYS to 'SS' ?

- The 'Y with accent acute' of Czek.
  In code page 850: Capital chr(237), lower case chr(236),
  May them be equated always to simple 'Y' in ordering ?
  Any other alive language uses these letters?

- The Spanish double-letters 'CH' and 'LL' were considered as
  simple letters in ordering, and C < CH < D, L < LL < M.
  A friend told me Real Academia de la Lengua Espaola modified
  this about a month ago to make easier sorting for computers.
  Is that true ? Any Spanish speaker had heart anything about that ?

Many thanks for any pertinent answer...


--Josep Sau B. <ZCCBJSB@EB0UB011>
------------------------------------------------------------------
'Every science needs the right words to be expressed the right way'
           Raimundus Lulius (Catalan philosopher)
------------------------------------------------------------------