sommar@enea.UUCP (Erland Sommarskog) (07/04/87)
Below follows a draft to ISO-Latin/1 that was posted to this group a year and a half ago. Does anyone know if this draft has become a standard or has it been modified? For their VT200 series DEC uses something they call "DEC multi- national character set". It is a somewhat incomplete version of the ISO drat with some differences. Does these differences reflect changes from the draft or are they just DEC's own invention? Please send your responses by mail to sommar@enea.se. ====================================================================== The byte value is in the document represented by a notation xx/yy, where xx is the upper nibble (four bits), and yy is the lower nibble (in decimal!). The lower part of the table, i.e. positions 02/00 to 07/14 is exactly the same as ASCII. The upper part of the table contains the characters we can't live without in large parts of the world. Since I do not know how to send pictures in a standardised way (is macpaint documents OK?), I here include a table from ISO No.1: 10/00 NO-BREAK SPACE 10/01 INVERTED EXCLAMATION MARK 10/02 CENT SIGN 10/03 POUND SIGN 10/04 CURRENCY SIGN 10/05 YEN SIGN 10/06 BROKEN BAR 10/07 PARAGRAPH SIGN, SECTION SIGN 10/08 DIAERESIS 10/09 COPYRIGHT SIGN 10/10 FEMININE ORDINAL INDICATOR 10/11 LEFT ANGLE QUOTATION MARK 10/12 NOT SIGN 10/13 SOFT HYPHEN 10/14 REGISTERED TRADE MARK SIGN 10/15 MACRON 11/00 DEGREE SIGN 11/01 PLUS-MINUS SIGN 11/02 SUPERSCRIPT TWO 11/03 SUPERSCRIPT THREE 11/04 ACUTE ACCENT 11/05 SMALL GREEK LETTER MU, MICRO SIGN 11/06 PILCROW SIGN 11/07 MIDDLE DOT 11/08 CEDILLA 11/09 SUPERSCRIPT ONE 11/10 MASCULINE ORDINAL INDICATOR 11/11 RIGHT ANGLE QUOTATION MARK 11/12 VULGAR FRACTION ONE QUARTER 11/13 VULGAR FRACTION ONE HALF 11/14 VULGAR FRACTION THREE QUARTERS 11/15 INVERTED QUESTION MARK 12/00 CAPITAL LETTER A WITH GRAVE ACCENT 12/01 CAPITAL LETTER A WITH ACUTE ACCENT 12/02 CAPITAL LETTER A WITH CIRCUMFLEX ACCENT 12/03 CAPITAL LETTER A WITH TILDE 12/04 CAPITAL LETTER A DIAERESIS 12/05 CAPITAL LETTER A WITH RING ABOVE 12/06 CAPITAL DIPHTHONG A WITH E 12/07 CAPITAL LETTER C WITH CEDILLA 12/08 CAPITAL LETTER E WITH GRAVE ACCENT 12/09 CAPITAL LETTER E WITH ACUTE ACCENT 12/10 CAPITAL LETTER E WITH CIRCUMFLEX ACCENT 12/11 CAPITAL LETTER E WITH DIAERESIS 12/12 CAPITAL LETTER I WITH GRAVE ACCENT 12/13 CAPITAL LETTER I WITH ACUTE ACCENT 12/14 CAPITAL LETTER I WITH CIRCUMFLEX ACCENT 12/15 CAPITAL LETTER I WITH DIAERESIS 13/00 CAPITAL ICELANDIC LETTER ETH 13/01 CAPITAL LETTER N WITH TILDE 13/02 CAPITAL LETTER O WITH GRAVE ACCENT 13/03 CAPITAL LETTER O WITH ACUTE ACCENT 13/04 CAPITAL LETTER O WITH CIRCUMFLEX ACCENT 13/05 CAPITAL LETTER O WITH TILDE 13/06 CAPITAL LETTER O WITH DIAERESIS 13/07 (This position shall not be used) 13/08 CAPITAL LETTER O WITH OBLIQUE STROKE 13/09 CAPITAL LETTER U WITH GRAVE ACCENT 13/10 CAPITAL LETTER U WITH ACUTE ACCENT 13/11 CAPITAL LETTER U WITH CIRCUMFLEX ACCENT 13/12 CAPITAL LETTER U WITH DIAERESIS 13/13 CAPITAL LETTER Y WITH ACUTE ACCENT 13/14 CAPITAL ICELANDIC LETTER THORN 13/15 SMALL GERMAN LETTER SHARP s 14/00 SMALL LETTER a WITH GRAVE ACCENT 14/01 SMALL LETTER a WITH ACUTE ACCENT 14/02 SMALL LETTER a WITH CIRCUMFLEX ACCENT 14/03 SMALL LETTER a WITH TILDE 14/04 SMALL LETTER a WITH DIAERESIS 14/05 SMALL LETTER a WITH RING ABOVE 14/06 SMALL DIPHTHONG a WITH e 14/07 SMALL LETTER c WITH CEDILLA 14/08 SMALL LETTER e WITH GRAVE ACCENT 14/09 SMALL LETTER e WITH ACUTE ACCENT 14/10 SMALL LETTER e WITH CIRCUMFLEX ACCENT 14/11 SMALL LETTER e WITH DIAERESIS 14/12 SMALL LETTER i WITH GRAVE ACCENT 14/13 SMALL LETTER i WITH ACUTE ACCENT 14/14 SMALL LETTER i WITH CIRCUMFLEX ACCENT 14/15 SMALL LETTER i WITH DIAERESIS 15/00 SMALL ICELANDIC LETTER ETH 15/01 SMALL LETTER n WITH TILDE 15/02 SMALL LETTER o WITH GRAVE ACCENT 15/03 SMALL LETTER o WITH ACUTE ACCENT 15/04 SMALL LETTER o WITH CIRCUMFLEX ACCENT 15/05 SMALL LETTER o WITH TILDE 15/06 SMALL LETTER o WITH DIAERESIS 15/07 (This position shall not be used) 15/08 SMALL LETTER o WITH OBLIQUE STROKE 15/09 SMALL LETTER u WITH GRAVE ACCENT 15/10 SMALL LETTER u WITH ACUTE ACCENT 15/11 SMALL LETTER u WITH CIRCUMFLEX ACCENT 15/12 SMALL LETTER u WITH DIAERESIS 15/13 SMALL LETTER y WITH ACUTE ACCENT 15/14 SMALL ICELANDIC LETTER THORN 15/15 SMALL LETTER y WITH DIAERESIS End of table So, go ahead and start implementing the standard on Unix. I desperately need it! -- Dan Sahlin ..!mcvax!enea!ttds!dan ====================================================================== -- Erland Sommarskog ENEA Data, Stockholm sommar@enea.UUCP
lasko@video.dec.com (Tim Lasko[TBU Architecture]DTN 223-2186) (07/11/87)
ISO Latin-1, or more completely ISO Latin Alphabet No 1, is now an international standard as of February 1987 (IS 8859, Part 1). For those American USEnet'rs that care, the 8-bit ASCII standard, which is essentially the same code, is going through the final administrative processes prior to publication. The code table that was posted earlier by Mr. Sommarskog to the net is from an earlier draft of the standard, the following changes have been made: OLD DRAFT: 13/07 (This position shall not be used) 15/07 (This position shall not be used) FINAL STANDARD: 13/07 MULTIPLICATION SIGN 15/07 DIVISION SIGN Those two characters were added mainly out of the fear that individual vendors would use the positions for non-interchangeable and incompatible purposes, thus defeating the idea of the standard. The two symbols chosen were more or less a compromise from a large list of eligible characters. ISO Latin-1 (IS 8859/1) is actually one of an entire family of eight-bit one-byte character sets, all having ASCII on the left hand side, and with varying repertoires on the right hand side: Pt 1. Latin Alphabet No 1 (caters to Western Europe - now approved) Pt 2. Latin Alphabet No 2 (caters to Eastern Europe - now approved) Pt 3. Latin Alphabet No 3 (caters to SE Europe + others - in draft ballot) Pt 4. Latin Alphabet No 4 (caters to Northern Europe - in draft ballot) Pt 5. Latin-Cyrillic alphabet (right half all Cyrillic - processing currently suspended pending USSR input) Pt 6. Latin-Arabic alphabet (right half all Arabic - now approved) Pt 7. Latin-Greek alphabet (right half Greek + symbols - in draft ballot) Pt 8. Latin-Hebrew alphabet (right half Hebrew + symbols - proposed) I expect to update this list shortly, because next week I'm attending the meeting of the ISO Working Group concerned with these standards is being held. (ISO TC97/SC2/WG3 for those that can decipher that.) Regarding DEC Multinational: The development of DEC Multinational preceded the original ISO drafts, which largely explains the differences. It was actually ECMA, the European Computer Manufacturers Association, who first proposed a eight-bit, one-byte code table similar to what became ISO Latin-1 today. =================================== Tim Lasko Digital Equipment Corporation Maynard, MA (video!lasko@dec.COM <-- boy I hope that's right...)