[comp.sys.ibm.pc] VI fails on German language

egauss@houxf.UUCP (E.GAUSS) (08/13/87)

I have found a bug in VI that makes it quite difficult to process
German text which uses the IBM-2 character set.  This is the standard
screen character set for the AT&T PC 6300.  An umlauted U is mapped
by VI upon input into an end-of-file and the whole process stops.

The problem is that VI strips off the leading bit from eight bit
characters.  The umlauted characters requiring the eighth bit get
mapped into various control characters.  This can be coped with by
using a filter which un-maps them after VI but I have found no
way to process text that contains an umlauted capital U.

Does anyone have a fix?

Ed Gauss, 
  ...!ihnp4!houxf!egauss

bicker@hoqax.UUCP (The Resource, Poet of Quality) (08/14/87)

In article <1370@houxf.UUCP>, egauss@houxf.UUCP (E.GAUSS) writes:
| I have found a bug in VI that makes it quite difficult to process
| German text which uses the IBM-2 character set.  This is the standard
| screen character set for the AT&T PC 6300.  An umlauted U is mapped
| by VI upon input into an end-of-file and the whole process stops.
| 
| The problem is that VI strips off the leading bit from eight bit
| characters.  The umlauted characters requiring the eighth bit get
| mapped into various control characters.  This can be coped with by
| using a filter which un-maps them after VI but I have found no
| way to process text that contains an umlauted capital U.
| 
| Does anyone have a fix?
| 

Yes. Use emacs.

If your at AT&T (which, by your machine id, it seems you are), most
machines have, and all machines can get the orignial emacs. If not
there are numerous net versions: Micro Emacs and so forth.

BK

tj@mks.UUCP (T. J. Thompson) (08/17/87)

In article <1042@hoqax.UUCP>, bicker@hoqax.UUCP (The Resource, Poet of Quality) writes:
: In article <1370@houxf.UUCP>, egauss@houxf.UUCP (E.GAUSS) writes:
: | I have found a bug in VI that makes it quite difficult to process
: | German text which uses the IBM-2 character set.  This is the standard
: | screen character set for the AT&T PC 6300.  An umlauted U is mapped
: | by VI upon input into an end-of-file and the whole process stops.
: | ...
: | Does anyone have a fix?
: | 
: 
: Yes. Use emacs.
: 
Or use MKS vi, which properly handles eight-bit characters, including
the accented chars.
-- 
     ll  // // ,'/~~\'   T. J. Thompson {decvax,ihnp4,seismo}!watmath!mks!tj
    /ll/// //l' `\\\     Mortice Kern Systems Inc.         (519) 884-2251
   / l //_// ll\___/     43 Bridgeport Rd. E., Waterloo, ON, Can. N2J 2J4
O_/                         1e9 kph: It's not just a good idea, it's the law