[comp.sys.sun] Stuck procs on 4/330 os 4.0.3

dove@uunet.uu.net (Webster Dove) (05/30/90)

This machine shows a loadav of 3 though only 1 job seems to be running.  I
believe that update and biod (PID 111) are stuck (pri -15), but I have no
idea why.  I tried 'kill -9 111 176' which I presume fails because their
pri is too low.

Can anyone tell me what causes this?  It looks like an NFS problem.  Is it
possible for an NFS request to fail and lockup the process?

uptime
 10:20pm  up 19 days,  6:19,  3 users,  load average: 3.19, 3.04, 3.02

/etc/mount
/dev/sd6a on / type 4.2 (rw)
/dev/sd6g on /usr type 4.2 (rw)
/dev/sd6h on /home type 4.2 (rw)
/dev/sd6e on /export/swap type 4.2 (rw)
/dev/sd6d on /export/root type 4.2 (rw)
/dev/sd6f on /export/exec type 4.2 (rw)
asp_2:/nfs/asp_2/disk_c/ on /nfs/asp_2/disk_c type nfs (rw,noquota)
asp_2:/nfs/asp_2/disk_a/ on /nfs/asp_2/disk_a type nfs (rw,noquota)
asp_1:/nfs/asp_1/disk_b/ on /nfs/asp_1/disk_b type nfs (rw,noquota)
asp_1:/nfs/asp_1/disk_a/ on /nfs/asp_1/disk_a type nfs (rw,noquota)
asp_3:/nfs/asp_3/disk_a/ on /nfs/asp_3/disk_a type nfs (rw,noquota)
vci_2:/nfs/vci_2/disk_a/ on /nfs/vci_2/disk_a type nfs (rw,noquota)
mailhost:/var/spool/mail on /var/spool/mail type nfs (rw,noquota,bg,secure)
vci_1:/nfs/vci_1/disk_a/ on /nfs/vci_1/disk_a type nfs (rw,noquota)
local_sun4:/local.sun4 on /local type nfs (rw,noquota)
asp_4:/nfs/asp_4/disk_a/ on /nfs/asp_4/disk_a type nfs (rw,noquota)

ps -aglx
      F UID   PID  PPID CP PRI NI  SZ  RSS WCHAN    STAT TT  TIME COMMAND
      3   0     0     0  0 -25  0   0    0 runout   D    ?   0:04 swapper
   8001   0     1     0  0   5  0  72   32 child    I    ?   0:01 /sbin/init -
      3   0     2     0  0 -24  0   0    0 child    D    ?   0:09 pagedaemon
   8001   0    48     1  0   1  0  56  120 select   I    ?   0:46 portmap
   8001   3    51     1  0   1  0  40   96 select   I    ?   0:40 ypbind
   8001   0    66     1  0   1  0  48  128 select   S    ?   3:04 in.routed
   8001   0   109     1  0   1  0  24    0 nfs_dnlc I    ?   0:38  (biod)
   8001   0   110     1  0   1  0  24    0 nfs_dnlc I    ?   0:39  (biod)
   8001   0   111     1  0 -15  0  24    0 Sysbase  D    ?   0:17  (biod)
   8001   0   112     1  0   1  0  24    0 nfs_dnlc I    ?   0:39  (biod)
   8001   0   125     1  0   1  0  72  152 select   I    ?   0:13 syslogd
   8001   0   135     1  0   1  0  72  160 select   I    ?   0:12 rpc.mountd -n
 408000   0   152     1  1   3  0  72    0 Sysbase  IW   ?   0:00 rarpd le0 bat
   8000   0   153   152  0   1  0  40    0 socket   IW   ?   0:00 rarpd le0 bat
   8000   0   155     1  0   1  0  64    0 select   IW   ?   0:00 rpc.bootparam
   8201   0   167     1  0  15  0  40  120 pause    S    ?   0:31 /usr/local/bi
      0   0   176     1  2 -15  0  24    0 Sysbase  DW   ?  34:23 update
 408001   0   180     1  0   1  0  96  192 Sysbase  I    ?   0:27 cron
   8001   0   194     1  0   1  0  72  112 select   I    ?   0:27 inetd
   8000   0   199     1  0   1  0  72    0 select   IW   ?   0:03 /usr/lib/lpd
   8001   0 26714     1  1   1  0 200  616 select   I    ?   0:03 /usr/bin/X11/
   8001   0 26983   167  0   1  0 104  768 select   S    ?   1:05 lockscreen_de
   8001   0    54     1  0   1  0  72  184 select   I    co  0:03 keyserv
   8000   0   160     1  0   1  0  72    0 select   IW   co  0:00 rpc.statd
   8000   0   162     1  0   1  0  80    0 select   IW   co  0:00 rpc.lockd
   80011001   465     1  0   1  0  48  184 select   I    co  0:01 selection_svc
   8001   0 26981     1  3   3  0  48   72 Sysbase  I    co  0:00 - std.9600 co
   8201 464 23574     1  0  15  0  96  312 pause    I    p0  0:36 /bin/csh -f e
   8001   0 27162   194  6   1  0  40  272 select   S    p0  0:04 in.rlogind
 408201 281 27163 27162  1  15  0 120  520 pause    I    p0  0:00 -tcsh-5.18 (t
   8001 464 27190 23574255 102 192272 2744          R>N  p0  3:24 sppeed6.fun 3
   8001 281 27194 27163  1   5  0  72  352 child    I    p0  0:00 Mail sunspots
   8001 281 27195 27194 12   1  0 360 1048 select   S    p0  0:00 emacs /tmp/Re
   8000   0  3990   194  0   1  0  40    0 select   IW   p1  0:01 in.rlogind
 408000 510  3991  3990  2   3  0 128    0 Sysbase  IW   p1  0:02 -tcsh-5.18 (t
      1 281 27201 27195 65  41  0 160  416          R    p2  0:00 ps -aglx
 408001 281 26722 26714  6   3  0 120  408 Sysbase  I    p6  0:00 -csh -i (tcsh


/usr/etc/nfsstat

Server rpc:
calls      badcalls   nullrecv   badlen     xdrcall
0          0          0          0          0          

Server nfs:
calls      badcalls
0          0          
null       getattr    setattr    root       lookup     readlink   read       
0 0%       0 0%       0 0%       0 0%       0 0%       0 0%       0 0%       
wrcache    write      create     remove     rename     link       symlink    
0 0%       0 0%       0 0%       0 0%       0 0%       0 0%       0 0%       
mkdir      rmdir      readdir    fsstat     
0 0%       0 0%       0 0%       0 0%       

Client rpc:
calls      badcalls   retrans    badxid     timeout    wait       newcred
959248     49         691        43         705        0          26         

Client nfs:
calls      badcalls   nclget     nclsleep
958683     0          958683     0          
null       getattr    setattr    root       lookup     readlink   read       
0  0%      312718 32% 2625  0%   0  0%      249534 26% 200722 20% 43060  4%  
wrcache    write      create     remove     rename     link       symlink    
0  0%      64248  6%  36760  3%  6046  0%   406  0%    132  0%    597  0%    
mkdir      rmdir      readdir    fsstat     
170  0%    46  0%     41330  4%  289  0%