[comp.dcom.lans] NW386 3.1 on PS/2-80 = ABENDS. HELP!

medici@elbereth.rutgers.edu (Mark Medici) (11/01/90)

I am having some continuing problems with a NetWare/386 3.1a (6/13/90)
server.  The problem is reported drive failure and shutdown while
performing massive write operations (such as backing-up one server to
another).  I have two other NetWare/386 3.1a servers running is
similar configurations without problems.  Here are the specifics:

Hardware:

IBM PS/2-80 (8580-041), with
	16MHz 80386
	16MB RAM
	3.5" diskette drive A:
	IBM PS/2 ESDI fixed disk controller
	Core HC-310 300MB ESDI fixed drive 0
	IBM 70MB ESDI fixed drive 1
	IBM Token Ring Adapter II/a (4mbps)
	Standard IBM VGA display circuit and 8512 monitor

System Software:

Novell NetWare/386 3.1a of 6/13/90
   AUTOEXEC.BAT contains:
	Prompt Server IML-Proto^G$P$G 
	SERVER

   STARTUP.NCF contains:
	load PS2ESDI slot=8

   AUTOEXEC.NCF contains:
	file server name IML-PROTO
	ipx internal net 80061106
	load TOKEN
	bind IPX to TOKEN net=80061100
	mount all
	load REMOTE <password>
	load RSPX
	load PATCHMAN
	load DELDIRFX	
	load FIXOPEN
	track on
	load MONITOR

    DRIVE ALLOCATION:
	Drive 0 (300MB) has a 2MB DOS bootable partition, and a 298MB
	NetWare/386 partition.  The NetWare partition is devoted to
	volume SYS: and hot-fix is enabled.

	Drive 1 (70MB) is devoted as a single NetWare/386 partition of
	70MB.  This partition is devoted to volume USER: and hot fix
	is enabled.

PROBLEM:

When performing massive writes to the server, server will disconnect
target drive (usually SYS:, a 300MB Core ESDI on an IBM ESDI
controller) complaining about drive failure.  Client sees Error
writing to Fat and other write failures.  Errors on server scroll by
too fast to see the beginning of the problem.  No messages are
recorded in any error logs, because SYS: gets shutdown.

If I try to run VREPAIR or DOWN and EXIT the server, I get the
following ABEND message:

>System halted Wednesday  October 31, 1990    5:48:35pm
>Abend: Nonmaskable Interrupt (Parity Error) Processor Exception
>Parity Error was generated by the system board
>    OS version Novell Netware 386 V3.10 Rev. A 6/13/90
>    Running Process: Polling Process
>
>    Stack  E4 DB 00 00 88 D6 F1 FF 00 00 00 00 0E 00 00 00
>           02 00 00 00 05 00 00 00 00 00 00 00 04 00 00 00
>           45 72 72 6F 72 20 77 72 69 74 69 6E 67 20 46 41

Attempts to copy diagnostic image to diskette failed, regardless of
capacity diskette used.

A previously recorded ABEND of the same nature produced only a
slightly different stack listing:

>    Stack  D7 DB 00 00 00 00 00 00 00 00 00 00 0E 00 00 00
>           02 00 00 00 05 00 00 00 00 00 00 00 04 00 00 00
>           45 72 72 6F 72 20 77 72 69 74 69 6E 67 20 46 41


What I've tried:

I have tried two completely different boxes, different controllers,
different token-ring cards, different drives, keyboards, etc.  The
only non-IBM part in the box is the Core HC-310 fixed drive, which is
NetWare/386 certified.  This system initially had an 80387 math chip
installed.  Removing the math chip (which was done between the first
and second stack listing) seemed to eliminated the problem.  However,
the the system just abended again while I was backing up another
server on the same ring using NBACKUP.  I have also reformatted the
drives, tried using them with and without running Surface Analysis
(they all passed), and have re-installed the complete NW386 3.1a
package at least 6 times.  Nothing has eliminated the problem.


Plea for help:

What is really puzzling is that I have two other servers similarly
configured that are experiencing no such problems.  Maybe this is just
because they don't normally get the amount of writes I've been doing
on this box.

I have tried contacting my Novell reseller, but they are slow to
respond because we did not purchase the hardware from them (2 - 3 year
old systems from an IBM grant).  I have been hesitant to contact
Novell because of their outrageous technical support costs ($200/hr!).

If anyone can offer any advice or suggestions I will be most grateful.

------------------------------------------------------------------------------
Mark Medici/SysProg3 * Rutgers University/CS-USG * medici@elbereth.rutgers.edu
------------------------------------------------------------------------------
-- 
------------------------------------------------------------------------------
Mark Medici/SysProg3 * Rutgers University/CS-USG * medici@elbereth.rutgers.edu
------------------------------------------------------------------------------