[comp.sys.sun] Sun-Spots Digest, v5n31

Sun-Spots-Request@RICE.EDU (Vicky Riffle) (08/12/87)

SUN-SPOTS DIGEST         Tuesday, 11 August 1987       Volume 5 : Issue 31

Today's Topics:
      Ethernet problems induced by bogus ICMP Address Mask Reply (3)
                 rwhod not broadcasting on proper address
                             dsun on SunView
                     Adding a SCSI disk to a SUN-3/52
                      Re: VME - VME adapters for SUN
                         Re: File system problems
                           File server troubles
               TCP/IP (terminal server) printing from Suns?
                   3.0+ driver for the Interphase 2181?
                        SUN-3/50 memory expansion?
                      ICS-100 AD/DA board on a SUN?
                          PIC tool or TeX tool?
         MacPaint/PICT files --> Sun Raster/Icon formats: tools?

----------------------------------------------------------------------

Date: Fri, 07 Aug 87 22:04:44 EDT
From: Preston Mullen <mullen@nrl-css.arpa>
Subject: Ethernet problems induced by bogus ICMP Address Mask Reply

Here's a new one to add to the list of things that can induce broadcast
storms and other serious problems.  It involves an unpleasant interaction
between SunOS 3.3 and 3.4 and Wollongong's WIN/VX software version 3.0.
(Ironically, this problem was noticed when the Suns and VAXes were
upgraded to what is generally considered to be better networking
software.)

When a diskless Sun 3 workstation running SunOS 3.3 or 3.4 boots, at some
point it sends out a broadcast ICMP Address Mask Request.  This is in
accordance with RFC950; unfortunately, an incorrect reply from any machine
on the network can be accepted by the workstation, and some incorrect
masks can induce the workstation to start sending all packets as Ethernet
broadcasts, which instantly leads to a broadcast storm.

If this happens, the workstation will probably fail to finish booting
completely, usually stopping during the NFS mount with "RPC: not
registered" messages, but sometimes sooner.  Also, the workstation may
itself then generate an incorrect reply (sent as an Ethernet broadcast) to
a subsequent ICMP Address Mask Request from some other machine, thus
spreading the virus.  Other symptoms that may be observed include
extremely sluggish operation of diskless machines and "no carrier" and
"Ethernet jammed" notices from Suns (aside from those generated during
broadcast storms).

This happened here on a non-subnetted Class B network when the 3.0 release
of Wollongong's IP/TCP software was installed on two VAXes running VMS.
Both VAXes replied to a Sun's ICMP Address Mask Request with network masks
of 0000FFFF instead of the correct FFFF0000.  The diskless Sun gullibly
swallowed this and began to encapsulate every IP packet in an Ethernet
broadcast packet.  Even after we isolated our network from the offending
VAXes, any Sun in this state would respond with the bogus netmask to a
broadcast from another Sun and thus continue the problem.

The solution was to disconnect the offending VAXes from the network, halt
ALL the diskless Suns, then reboot them.  After that, we could let the
VAXes back on the network, but it was not really safe since any diskless
Sun reboot would start the cycle over again.

The people who own the VAXes that started the problem have told me that
the problem is really in the Wollongong software (i.e., not a
configuration error) and is a holdover from some "broken 4.3bsd code".
Evidently there are also problems with setting the address mask.  They
report that Wollongong has sent them a fix.

==> Potential users of Wollongong 3.0 software should get a fix from
Wollongong before running 3.0 on a network where a machine might broadcast
an ICMP address mask request.

==> Sun should make their networking code smarter.  There is no way a
netmask of 0000FFFF could ever be valid, since it must include the normal
network part of the address as well as the subnet part.  The Suns should
have ignored the faulty replies instead of being driven berserk by them.
I've reported this to Sun.  The problem doesn't affect Suns running 3.2 or
earlier releases since those releases have no subnetting support.

It would probably be better if the diskless Sun did not broadcast the
Address Mask Request but instead sent it directly to the server.  I think
that the request is broadcast after the ifconfig of the diskless Sun's
Ethernet interface (anyway, an "address mask set to FFFF0000" report
appears on the console quite a while before the problem shows itself).  If
that is so, then it's not clear why an ICMP Address Mask Request needs to
be sent at all, since the network mask can be specified in the ifconfig in
the rc.boot file.

By the way, our class B network is partitioned by level 2 bridges (Digital
DEBETs).  Needless to say, they passed every bad packet and broadcast
right through.  (The VAXes were on the other side of a bridge from my
Suns.)  Yep, I'll be moving my stuff behind an IP gateway now.

Many, many thanks to Van Jacobson for 'tcpdump', which proved instrumental
in tracking this down.  I hope Sun will let Van release the source code
for tcpdump.

	Preston Mullen
	Computer Science and Systems Branch
	Information Technology Division
	Naval Research Laboratory
	Washington DC 20375-5000

P.S.    Why do diskless Sun workstations running SunOS 3.4 broadcast an
	ARP request for IP address 0.0.60.216 very early in the boot
	sequence?  (The ARP packet asks that replies go to 0.0.60.216.)
	This appears to be wired into the Sun networking software.
	It's harmless enough, but it should not be there.  Maybe
	someone forgot to take out some debugging code.

------------------------------

Date: Sun, 9 Aug 87 08:02:44 EDT
From: hedrick@topaz.rutgers.edu (Charles Hedrick)
Subject: Re: Ethernet problems induced by bogus ICMP Address Mask Reply

I believe your ARP request for 0.0.60.216 is used during the boot
sequence, at a point when the Suns don't know their Internet address.
They seem to use their serial number as an address.  You'll also find such
addressing in the routing tables in certain versions of SunOS.  I suggest
that you take 60 * 256 + 216, and see if you don't have a Sun with that
serial number.

------------------------------

Date: Mon, 10 Aug 87 08:31:12 CDT
From: timk@newton.ncsa.uiuc.edu (Tim Krauskopf)
Subject: Re:  Ethernet problems induced by bogus ICMP Address Mask Reply

We encountered the exact same problem (diagnosed with tcpdump) between our
diskless Sun 3/50 (SunOS 3.3) and a Vaxstation (TWG under VMS).  The
icmpmask response from TWG is backwards.

Easy work-around while waiting for bug fixes from TWG:
  Install the netmask parameter in the rc.boot files (ifconfig lines) of as
  many diskless Suns as you feel like.
  Each of the Suns with the mask installed will operate correctly.
  For the rest of the machines, the russian roulette has much better odds:
  When the ICMP request goes out, there are several Suns waiting to respond
  and they are faster at responding than the VAXen.  Whoever gets there first
  wins.

Isn't this fun?

Tim Krauskopf
National Center for Supercomputing Applications
University of Illinois
timk%newton@uxc.cso.uiuc.edu

------------------------------

Date: Fri, 7 Aug 87 15:21:41 MDT
From: hi!kurt@hc.dspo.gov (Kurt Zeilenga)
Subject: rwhod not broadcasting on proper address

We are configuring our hosts (Release 3.4) like:

/etc/ifconfig ie0 sunhost netmask 255.255.248.0 broadcast 129.24.15.255 \
    -trailers up

[[ That last bit was one line in the original message.  --wnl ]]

where the address of the hostname would be something like:

129.24.13.3     sunhosts

We broadcast on the subnet address 129.24.8 on host all '1 (ie:
129.24.15.255) as configured above.  This is to conform with Internet
Standards.  On our Ultrix and 4.3BSD systems this works fine.

On our SUNs, in.routed seems to understand this, yet in.rwhod doesn't.
Rwho seems to ignore the broadcast address ifconfiged in.  Do you have a
fix for this or have any suggests? 

Also, if for some reason you want to have two logic IP networks on the
same ethernet cable and want your SUN (with two ie boards) to gateway you
have to muck with the ifconfig lines.  The boards get their ethernet
address from the SUN ID prom, hence have the same address, and you get two
controllers at the same hardware address.  Not good.  To get around this
we did:

ifconfig ie1 02:20:20:20:20:20 gateway -trailer up

Works great.

- Kurt (zeilenga@hc.dspo.gov)

------------------------------

Date: Fri, 7 Aug 87 14:16:16 PDT
From: eggert@grand.sm.unisys.com (Paul Eggert)
Subject: dsun on SunView

In Sun-Spots v5n29 tamir@cs.ucla.edu (Yuval Tamir) asks whether anybody
has gotten the old dsun program to work under SunView.  The following
fixes should get dsun.tool to work under Sun UNIX Release 3.4.  We've
changed dsun.tool in other ways, so your line numbers may differ.  Be sure
your dev.h file matches ditroff's; there seem to be different ones
floating around.

Surely something better than this should be available these days!

-- Paul Eggert (sdcrdcf!eggert@cam.unisys.com)

[[ The diff was too large to include in the digest.  It is available via
anonymous FTP from "titan.rice.edu" as "sun-source/dsun.diff".  We will
try to honor requests for individual mailings to those without direct
ARPANet access.  --wnl ]]

------------------------------

Date: Wed, 8 Jul 87 15:51:15 EDT
From: Mark Mendell <mendell@turing.toronto.edu>
Subject: Adding a SCSI disk to a SUN-3/52

We wanted to add some disk space to our SUN-3/52, but found that Sun's
price of $5000 (CDN) for a 71 Meg disk to be a bit high.  We decided to
put together our own disk.  We bought a Priam 519 190 Meg disk ($3249
CDN), and an Adaptec 4000A disk controller ($181 CDN).  A box and power
supply was purchased independently.   We had to buy the 50 pin connectors
(one for the back of the shoebox, one for the controller) for the cost of
$50 CDN.  We wired the cable ourselves. Total cost about $4000 CDN for 190
Meg disk (157 Meg formatted).

Caveats:
    We originally bought an Adaptec 5500 disk controller, because it
supports four disks.  It also supports the SCSI disconnect-reconnect
protocol.  Unfortunately, Sun OS 3.3 doesn't support disconnect-reconnect
except on the SCSI tape drive.  The disk couldn't be used from Unix with
that controller.  The system would hang when accessing the disk as a block
device.  Sun couldn't tell us if this would be fixed in Sun OS 4.0.

    We used diag to format the new disk, but we were guessing on the
number of sectors per track.  We originally tried 18 sectors/track, but it
didn't work, and the diag failed when trying to reformat at 17 sectors/
track.  We had to take the disk to another system, and format with another
controller for a couple of tracks.  Only then would diag let us re-format
at 17 sectors/track.

    The new disk controller was set to SCSI address 1, and the new
drive was drive 0.  The config line is:
	disk		sd1 at si0 drive 8 flags 0

------------------------------

Date: Wed, 5 Aug 87 10:37:39 +0200
From: sunuk!sunfra!rich@sun.com (Richard Saunders - Sun France Consulting)
Subject: Re: VME - VME adapters for SUN

A Swiss company makes a sophisticated VME multiple cage adapter called the
Vertical Bus System. Here is the address:

	Creative Electronic Systems S.A.
	Route du Pont-Butin 70
	CH-1213 Petit-Lancy
	Switzerland

Telephone (41) 2292 5745; ask for Francois-Henri Worm.
Good luck! Richard Saunders (Sun France Consulting).

------------------------------

Date: 6 Aug 87 02:33:37 GMT
From: hedrick@topaz.rutgers.edu (Charles Hedrick)
Subject: Re: File system problems

In response to kfl's problem with a trashed file system.  It is of course
possible that 3.3 of SunOS has a set of bugs that causes all of the things
you heard to be true.  We have experience only with 3.2.  However it is
far more likely that the Sun rep you were talking to is very confused
about a number of issues.  We have a number of Sun systems, and run out of
disk space from time to time.  We have had no trouble with file systems
being messed up, except when there is an actual hardware problem with the
disk.  (Of course that's not to say there are no other possible causes.  I
trust that you have the partitions set up properly, and they aren't
overlapping or something?) What happens when the file system is 90% full
(this shows as 100% on df) is that new files can't be written.  In some
cases, new files may be truncated.  But this should not affect existing
files, nor the integrity of the file system.  One could imagine situations
where every last disk block was used, and it was impossible to update some
directory.  (Does anybody know whether this is possible?)  But normally
Unix is set up so you can't go over 90% unless you are root, and this
should prevent any such thing from happening even if it were possible.

I've never heard of the file system being trashed by using too much memory
in a user program.  I'm reluctant to say that no such bug is possible,
since I've seen many strange bugs in my time, but this doesn't sound like
a very likely one (unless perhaps your swap space overlaps one of your
file systems).  

After you restore a level 0 dump, you should do a new level 0 dump, in
order to have a dump whose inode numbers agree with those actually on
disk.  This is discussed in the man page for restore.  But there is no way
that failure to do a dump will trash the file system.  You certainly don't
want to do it to /dev/null, since the whole point of doing it is to get a
new dump tape.  There is no rush.  You should simply do the new level 0 at
the time you would normally do your next incremental.

------------------------------

Date: Thu 6 Aug 87 17:55:54 PDT
From: Doug Bryan <Bryan@sierra.stanford.edu>
Subject: File server troubles

We are having some serious problems with our file server and the Sun
techies seem to be of little help.  Thus, I thought I would appeal to the
collective wisdom of net-land...

We are running a 3/180 file server with 3 Eagles (under OS 3.2).  The
following clips from /usr/adm/messages show the problem:

Aug  6 03:06
panic: bad rmfree
syncing disks... 9 9 7 3 done
dumping to dev 301, offset 25880
dump succeeded
Rebooting Unix...

Aug  6 11:37
panic: bad rmfree
syncing disks... xy1e: read reset (lost interrupt) -- 
		blk #85070, abs blk #85070
13 13 11 7 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 done
dumping to dev 301, offset 25880
dump succeeded
Rebooting Unix...

Aug  6 12:53
panic: bad rmfree
syncing disks... mbrelse: MR == 0!!!
xy2c: read reset (lost interrupt) -- blk #132544, abs blk #132544
6 6 6 3 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 done
dumping to dev 301, offset 25880
dump succeeded
Rebooting Unix...

Aug  6 13:06
panic: bad rmfree
syncing disks... 10 10 9 6 6 6 6 6 6 6 6 6 6 6 6 6 xy0h: write reset
	 (lost interrupt) -- blk #29648, abs blk #692048
3 3 3 3 done
dumping to dev 301, offset 25880
dump succeeded
Rebooting Unix...

Aug  6 13:13
panic: bad rmfree
syncing disks... xy1e: read reset (lost interrupt) --
		 blk #102536, abs blk #102536
16 16 14 8 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 done
dumping to dev 301, offset 25880
dump succeeded
Rebooting Unix...

When all 3 disks are being accesses, the file server crashes.  Because
of our user load, it has been crashing about once every hour.  And, as
you would expect, this often results in fsck being run and files lost.

All Sun has to say about the problem is that it is a "known" problem
and will be fixed by vapor-ware (OS 3.4).  My questions are:

	- Does anyone else have a similar file server configuration?
	  If so, are you experiencing this kind of thing?
	- What can I do until the vapor-ware materializes?

Tonight I'm going to remove one of the disks; then we will be running 2
Eagles, one per controller.  Sun seems to think the problem only occurs
when there are 2 Eagles on one controller.  Any other ideas?  We plan to
buy another server, but that will take a while.  An Eagle is a terrible
thing to waste.

doug

------------------------------

Date: Tue, 4 Aug 87 13:12 EDT
From: SIMON%M_SCRVX2%sdr.slb.com@relay.cs.net
Subject: TCP/IP (terminal server) printing from Suns?

How does one go about driving a printer through a terminal server running
TCP/IP?   The boxes concerned are a Sun 3/180S file server, and a Bridge
CS/100 terminal server configured with TCP/IP software.  I don't mind
whether a permanent TCP connection is maintained or whether a new
connection is formed for each print job.

Thanks,

Simon Barnes
simon%m_scr%sdr.slb.com@relay.cs.net
(I'll summarise any responses received directly).

[[ Please mail responses directly to him.  I will send out a summary when
he provides me with one.  --wnl ]]

------------------------------

Date: Tue, 4 Aug 87 13:11:21 MDT
From: donn@cs.utah.edu (Donn Seeley)
Subject: 3.0+ driver for the Interphase 2181?

At some time in the past, an Interphase 2181 disk controller was purchased
for our Sun-2s.  We'd like to use this disk controller to create a Sun-2
disk server and thus produce a homogeneous Sun-2 cluster (which we can
safely ignore thereafter).  Unfortunately no one verified at the time of
the purchase that Sun supported the 2181...  It seems that the ip driver
in SunOS 3.2 only supports the older 2180 controller, although there are
indications in the source that someone did at least attempt to produce a
working 2181 driver.  Interphase won't take the 2181 as a trade-in on
something better and doesn't have any suggestions about using it; no one
at Interphase whom we talked to appeared to remember that there was ever a
2181 product, which tells you something about turnover in the field...

If you have a driver for the 2181, or any constructive comments about what
to do with it, please send me mail...

Donn Seeley    University of Utah CS Dept    donn@cs.utah.edu
40 46' 6"N 111 50' 34"W    (801) 581-5668    utah-cs!donn

------------------------------

Date: 7 Aug 87 14:23:55 GMT
From: marantz@aramis.rutgers.edu (Roy Marantz)
Subject: SUN-3/50 memory expansion?

Does anyone know of a way to put more memory on a Sun-3/50.  We have a
bunch of them and feel they could use more memory when running lisp (and
the like).  Unfortunately we don't have enough money to trash them and buy
replacement machines (sun-3/60 or something).  We'd like to add at least
4Mb and can probably afford around $2K per machine.  Oh, I'm talking about
35, or so, units.  I've talked with everyone at Sun that I can get hold of
and they don't seem to believe it can be done reliably.  Anyone have any
ideas?  I'm willing to build/buy piggy back boards or almost anything
else.  Thanks for any suggestions.

Roy

------------------------------

Date: Fri, 7 Aug 87 08:43:48 EDT
From: "Craig F. Reese" <cfreese%super.org@relay.cs.net>
Subject: ICS-100 AD/DA board on a SUN?

I am posting this for a newsless friend: (please reply to cfreese)

Does anyone have software for or experience with the ICS-100 AD/DA
board with a SUN workstation.  I believe the board is made
in Canada but sold in the U.S. by SKY computers.  Any info 
greatly appreciated.

Craig F. Reese
Supercomputing Research Center
USENET: cfreese@super.org
CSNET: cfreese%super.org@csnet-relay

------------------------------

Date: Thu 6 Aug 87 15:07:03 PDT
From: CONTR14@nosc-tecr.arpa
Subject: PIC tool or TeX tool?

Does anyone know if there is a public domain copy of the following tools:

	a.) PIC like tool

	b.) LaTex or Tex like tool

We have a network of Sun 3/50 machines and would like to use PIC with
troff instead of purchasing ditroff.  LaTex or Tex may be alternatives if
the above is not possible.  Please communicate directly with me at the
following address:

	CONTR14@NOSC-TECR.ARPA

Thanks in advance,
Jim Baldo

[[ Editor's comments:

The VorTeX project at Berkeley distributes a package that includes a
DVI previewing tool (a DVI file is what a run of TeX produces).  This is
what I use to preview TeX documents.  Send a US Mail address to
"Vortex@ucbarpa.berkeley.edu" and they will (theoretically) send you a
packet of information.

I have written a tool that makes previewing PIC and Ideal figures easy.
Unfortunately, the tool runs a modified version of PIC to get the output
in an easy-to-read format.  Although the tool itself is distributable, the
modified version of PIC is not.

You CANNOT use PIC with vanilla troff (unless you set up PIC to draw its
pictures with zillions of periods) because it does not have the drawing
primitives that ditroff has.   --wnl ]]

------------------------------

Date: Thu, 6 Aug 87 12:58:51 PDT
From: moriarty@tc.fluke.com (Jeff Meyer)
Subject: MacPaint/PICT files --> Sun Raster/Icon formats: tools?

I have been using the utility posted a while ago, mp2sun, to transform
MacPaint files into Sun Raster files (for use in the -background option in
suntools, etc).  It works fine, but there are two tasks which I have still
been unable to perform on MacPaint images, and I was wondering if any of
you more informed folks had information on tools to use:

1)  Is there any tool for tranforming a *portion* of a MacPaint file into a
    Sun raster file?  Currently, you only get a Raster file the size of a
    MacPaint file.  I think I can alter mp2sun.c myself, but I was wondering
    if anyone else had done it already (I'd also need something to count
    pixel width and height in paint files -- andy Mac utilities to do this?).

2)  Something to change portions of MacPaint images/bitmaps/icons into
    Sun Icon format.

Finally, is there anything out there which transforms PICT format into Sun
Raster format (much like the "Drawing to Paint" menu item in SuperPaint).

Thanks for your time -- I'll be happy to post anything I get.

Moriarty, aka Jeff Meyer
INTERNET:     moriarty@tc.fluke.COM
Manual UUCP:  {uw-beaver, sun, allegra, hplsla, lbl-csam}!fluke!moriarty
<*> DISCLAIMER: Do what you want with me, but leave my employers alone! <*>

------------------------------