[comp.sys.sun] Some good press on SunOS 4.0

stpeters@uunet.uu.net (01/28/89)

Sun-Spots (comp.sys.sun) is a lot like TV news: if something bad happens,
it's news, but the good side doesn't appear.  Given all the bad press that
SunOS 4.0 has gotten of late, some of the good side needs to be told.

According to 'uptime', my group's server has been up 113 days now.  A
3/260 with 4 disks, 9 clients (3/60's, 3/50's, 3/110's), and a
LaserWriter, this machine serves an active imaging and software
development group.  (It is, by the way, on a large and complicated
[IP-subnetted] Ethernet, with machines from almost any vendor you can
think of.)  It just sits there and hums along.

Sure, we found a gotcha or two in going to 4.0 - you always do with any
major OS upgrade, but compared with, say, going from 1.0 to 2.0, the
upgrade to 4.0 was a piece of cake.  4.0 is loaded with goodies for users
and makes system management much easier.

If you're still running 3.X, you're in the dark ages (well, maybe 3.5.1 is
just the grey ages).

Dick St.Peters                        
GE Corporate R&D, Schenectady, NY
stpeters@ge-crd.arpa              
uunet!steinmetz!stpeters

lalonde@uunet.uu.net (Terry Lalonde - iccad) (02/09/89)

Yes! we had a couple of 3/260s up for several weeks (45 & 60) as well.
However we rebooted them to run filesystem checks.  These checks did
reveal/fix some problems.  I would recommend you run these checks as well
on some kind of regular basis, if you aren't already doing so.

We had some some strange behaviour (i.e. files being written over with
garbage or other files), which went away with this fsck/reboot.

Terry Lalonde
Usenet:  ...!uunet!mitel!lalonde

cudcv%warwick.ac.uk@nss.cs.ucl.ac.uk (Rob McMahon) (02/10/89)

steinmetz!dawn!stpeters@uunet.uu.net writes:
>According to 'uptime', my group's server has been up 113 days now.

Hmm, here's the uptime from our two multi-user machines, this is Saturday
lunchtime, things are really quiet:

orchid:
 12:12pm  up 1 day,  4:43,  12 users,  load average: 0.10, 0.11, 0.00

Sun 4/280, OS 4.0.1, 2 disks, only 8M memory (more was supposed to arrive
last Thursday (not from Sun).  Had to be rebooted when all the
in.telnetd's and in.rlogind's decided to eat all the processor and the
load average went up to 35.  Happened three times now, but I've only just
reported it to Sun, I never report things the first time they happen.

poppy:
 12:16pm  up 3 days, 20 hrs,  3 users,  load average: 1.18, 1.34, 1.20

Sun 3/160, OS 4.0.1, 3 disks, 16M memory, serving 3 Sun3/50s.  Last had to be
rebooted when it jammed up after:

Jan 31 16:09:55 poppy vmunix: mti0: DMA output error
Jan 31 16:09:55 poppy last message repeated 2 times
Jan 31 16:09:55 poppy vmunix: mti0: read error code <21>. Probable hardware fault
Jan 31 16:09:55 poppy vmunix: output_echo_char: out of blocks
Jan 31 16:09:55 poppy last message repeated 13 times

First time it's happened, I haven't reported it.  Current load average is
offset by one by a process which is stuck, reported by ps as in `D' "short
term" wait, and has been for two days.  Reported end of October, Sun
"can't reproduce the problem", although it sometimes happens to use once a
day, on both of our multi-user machines.  Last time it happended, earlier
in the week, all the nfsd's got stuck in this state, and the clients
froze.

Three of our four Sun 3/50s (OS 4.0.1, 4M) are currently spouting:

clnttcp_create: RPC: Program not registered
rpc.statd: cannot talk to statd at sol

once every 15 seconds.  Sol is a Gould PN6031 running a version of NFS
that does not have statd/lockd.  Rebooting doesn't cure this, I have to cd
/etc/sm.bak and remove a file and then reboot.  Reported to Sun about a
week ago, no answer yet.

>Sure, we found a gotcha or two in going to 4.0 - you always do with any
>major OS upgrade, but compared with, say, going from 1.0 to 2.0, the
>upgrade to 4.0 was a piece of cake.  4.0 is loaded with goodies for users
>and makes system management much easier.
>
>If you're still running 3.X, you're in the dark ages (well, maybe 3.5.1 is
>just the grey ages).

If you're still running 3.X, think long and hard before going to 4.0, and
be prepared to back out.

[[ This next section was sent in later:  --wnl ]]

As of 17/01/89 we had 17 problem reports outstanding with Sun, dating back
as far as September/October:
	Closing mailtool sometimes gives panic: Bus error
	Windows `pop' after period of inactivity
	restore gives spurious `resync restore' when file straddles dump tapes
	GNUemacs on an ALM-2 freezes the terminal on suspend/quit
	Losing characters on multiple rlogins/telnets to a single host
	pwd leaves thousands of /tmp/.getwd files around
	ctrace doesn't work on Sun4s
	Fileservers give itrunc: new size = 0, blocks = -16
	Processes stuck in ps `D' state
	screenblank dies leaving screen blank
	root can read files via NFS sometimes without remote root access
	ps dumps core
	NFS inconsistencies, updates not noticed sometimes
	finger dumps core
	panic: mclput
	Writes to /dev/rst8 hang near end of tape, tape unusable until reboot
	panic: spec_getapage pvn_kluster

We are looking to buy a new machine over the Summer, and Sun has lost a
lot of Brownie points over their software reliability and support, despite
the convenience of having another machine with the same architecture.

Rob
-- 
UUCP:   ...!mcvax!ukc!warwick!cudcv	PHONE:  +44 203 523037
JANET:  cudcv@uk.ac.warwick             ARPA:   cudcv@warwick.ac.uk
Rob McMahon, Computing Services, Warwick University, Coventry CV4 7AL, England

weiser.pa@xerox.com (02/14/89)

I'll second the good news.  We very heavily use some of the more unusual
features of 4.0 (VM features, mostly), and they work reliably as
advertised.

-mark