[net.unix-wizards] kernal panic

boyce@daemen.UUCP (Doug Boyce) (11/21/85)

We recently had the system panic twice within two hours with a "exit: m_getclr"
panic.  Could anyone explain what causes this and how to prevent it.


-- 

Doug Boyce   Daemen College, Amherst NY

UUCP : {decvax,dual,rocksanne,watmath,rocksvax}!sunybcs!daemen!boyce
       ihnp4!kitty!daemen!boyce
ARPA : boyce@buffalo@csnet-relay

	"And you to can go to Daemen College and enter the
		high tech field of floor polishing."

guy@sun.uucp (Guy Harris) (11/28/85)

> We recently had the system panic twice within two hours with a
> "exit: m_getclr" panic.  Could anyone explain what causes this and how
> to prevent it.

When a process exits, its user area gets discarded; only its process table
entry remains behind (until its parent does a "wait" and reclaims its
corpse).  The process' exit status and user/system CPU time (and other
resource usage figures, in 4.xBSD) for it and its children have to remain
around.  In most versions of UNIX, this is squirreled away in the process
table entry (some fields in the process table entry which aren't used for
the corpse of a process are recycled for this purpose).  In 4.2BSD, however,
they allocate an "mbuf" to hold the resource usage.  "mbuf"s are memory
buffers also used by the networking code.  If the networking code has eaten
them all up for its own purposes, "exit" can't get an "mbuf" and,
annoyingly, panics.

The simplest fix is to have it just note the fact that it couldn't get an
"mbuf" and not bother remembering the resource usage information (and make
sure that if "p->p_ru", which is the pointer to the "mbuf" in question, is
NULL, you *don't* try to use the pointer, since it doesn't point to
anything!).  You might also want to see why you're runing out of "mbuf"s;
there may be some problem in your kernel or on your network.

Anybody know what 4.1BSD did, and why they would have used an "mbuf" in 4.2?

	Guy Harris