[comp.unix.ultrix] MVAX III memory errors

dan@rna.UUCP (Dan Ts'o) (10/05/89)

	We seem to be getting ECC memory errors on our MVAX III (VAXstation
3200, running Ultrix 2.2), which might also explain the numerous crashes
we have. Can someone quickly interpret this error log and tell me which of
the two 8mb memory boards is at fault ?
	Thanks. (Please email response).

				Cheers,
				Dan Ts'o		212-570-7671
				Dept. Neurobiology	dan@rna.rockefeller.edu
				Rockefeller Univ.	...cmcl2!rna!dan
				1230 York Ave.		rna!dan@nyu.edu
				NY, NY 10021		tso@rockefeller.arpa
							tso@rockvax.bitnet

						  uerf version 2.2-005 

********************************* ENTRY     1. *********************************

----- EVENT INFORMATION SEGMENT -----

EVENT CLASS                             OPERATIONAL EVENT 
OS EVENT TYPE                  250.     ASCII MSG 
SEQUENCE NUMBER                337.
OPERATING SYSTEM                        ULTRIX 32 
OCCURRED/LOGGED ON                      Wed Oct  4 22:00:46 1989 EDT
SYSTEM ID                 x0A000003
OCCURRED ON SYSTEM                      rnh 
MESSAGE                                 Hi-Rate CRD log 

********************************* ENTRY     2. *********************************

----- EVENT INFORMATION SEGMENT -----

EVENT CLASS                             ERROR EVENT 
OS EVENT TYPE                  101.     MEMORY ERROR 
SEQUENCE NUMBER                336.
OPERATING SYSTEM                        ULTRIX 32 
OCCURRED/LOGGED ON                      Wed Oct  4 22:00:46 1989 EDT
SYSTEM ID                 x0A000003
OCCURRED ON SYSTEM                      rnh 

----- UNIT INFORMATION -----

UNIT CLASS                              MEMORY 
UNIT TYPE                               MS650 
ERROR SYNDROME                          MEMORY CRD ERROR 

----- KA650 MEMORY ERROR REGS -----

CNTRLR NO                        1.
MEMCSR16                  x200B963D     ECC ERR SYNDROME      CORRECTED DATA 
                                                                BIT = 30 
                                        MAIN MEM: PHY PG ADR  x5CB
                                        MAIN MEM: CORRECTABLE ECC ERROR 
MEMCSR17                  x00000062     CHECK BITS =          x62
                                        ECC ENABLED 
                                        CRD INTERRUPT DISABLED 
                                        MAIN MEMORY CYCLE SEL = 5/3 
MEMCSRN                   x80000016
MEMCON                    x00020003     BANK ENABLES =        x3
                                        MS650-AA - MEMORY MODULE #1 
                                        SYSTEM BANK =         0.

UNKNOWN ERROR ENTRY - DUMP FOLLOWS

0000:     01500044  252AB5CE  0A000003  00010301                    
0010:     00686E72  00000000  00000000  0AFF0065                    
0020:     FFFFFFFF  FFFFFFFF  00000001  01010001                    
0030:     200B963D  00000062  80000016  00020003                    
0040:     5E3C7E25                                                  


********************************* ENTRY     3. *********************************

----- EVENT INFORMATION SEGMENT -----

EVENT CLASS                             ERROR EVENT 
OS EVENT TYPE                  113.     KA650 ERROR & STATUS REGS 
SEQUENCE NUMBER                335.
OPERATING SYSTEM                        ULTRIX 32 
OCCURRED/LOGGED ON                      Wed Oct  4 22:00:46 1989 EDT
SYSTEM ID                 x0A000003
OCCURRED ON SYSTEM                      rnh 

----- KA650 ERROR & STATUS REGS -----

CACR                      x00900D90
                                        2ND LEVEL CACHE ENABLED 
                                        CVAX CYCLE SPEED =    10 
DSER                      x00000000
QBEAR                     x0000000F     Q-22 BUS PAGE ADR =   xF
DEAR                      x00000000     MAIN MEM PAGE ADR =   x0
CBTCR                     xC0000004     CDAL BUS T/O INTRVAL  x4
                                        TIMEOUT DURING CPU/DMA READ OR WRITE 
IPCR0                         x0020
                                        LOCAL MEMORY EXT ACCESS ENABLED 
CADR                      x000000FC
                                        D STREAM MODE ENABLED 
                                        I STREAM MODE ENABLED 
                                        SET 1 ENABLED 
                                        SET 2 ENABLED 
MSER                      x00000000
                                        1ST LEVEL CACHE HIT 

********************************* ENTRY     4. *********************************

----- EVENT INFORMATION SEGMENT -----

EVENT CLASS                             OPERATIONAL EVENT 
OS EVENT TYPE                  250.     ASCII MSG 
SEQUENCE NUMBER                334.
OPERATING SYSTEM                        ULTRIX 32 
OCCURRED/LOGGED ON                      Wed Oct  4 22:00:46 1989 EDT
SYSTEM ID                 x0A000003
OCCURRED ON SYSTEM                      rnh 
MESSAGE                                 CRD interrupt 

********************************* ENTRY     5. *********************************

----- EVENT INFORMATION SEGMENT -----

EVENT CLASS                             OPERATIONAL EVENT 
OS EVENT TYPE                  250.     ASCII MSG 
SEQUENCE NUMBER                333.
OPERATING SYSTEM                        ULTRIX 32 
OCCURRED/LOGGED ON                      Wed Oct  4 22:00:46 1989 EDT
SYSTEM ID                 x0A000003
OCCURRED ON SYSTEM                      rnh 
MESSAGE                                 Hi-Rate CRD log 

********************************* ENTRY     6. *********************************

----- EVENT INFORMATION SEGMENT -----

EVENT CLASS                             ERROR EVENT 
OS EVENT TYPE                  101.     MEMORY ERROR 
SEQUENCE NUMBER                332.
OPERATING SYSTEM                        ULTRIX 32 
OCCURRED/LOGGED ON                      Wed Oct  4 22:00:46 1989 EDT
SYSTEM ID                 x0A000003
OCCURRED ON SYSTEM                      rnh 

----- UNIT INFORMATION -----

UNIT CLASS                              MEMORY 
UNIT TYPE                               MS650 
ERROR SYNDROME                          MEMORY CRD ERROR 

----- KA650 MEMORY ERROR REGS -----

CNTRLR NO                        1.
MEMCSR16                  x200B923D     ECC ERR SYNDROME      CORRECTED DATA 
                                                                BIT = 30 
                                        MAIN MEM: PHY PG ADR  x5C9
                                        MAIN MEM: CORRECTABLE ECC ERROR 
MEMCSR17                  x00001062     CHECK BITS =          x62
                                        ECC ENABLED 
                                        CRD INTERRUPT ENABLED 
                                        MAIN MEMORY CYCLE SEL = 5/3 
MEMCSRN                   x80000016
MEMCON                    x00020003     BANK ENABLES =        x3
                                        MS650-AA - MEMORY MODULE #1 
                                        SYSTEM BANK =         0.

UNKNOWN ERROR ENTRY - DUMP FOLLOWS

0000:     014C0044  252AB5CE  0A000003  00010301                    
0010:     00686E72  00000000  00000000  0AFF0065                    
0020:     FFFFFFFF  FFFFFFFF  00000001  01010001                    
0030:     200B923D  00001062  80000016  00020003                    
0040:     5E3C7E25                                                  


********************************* ENTRY     7. *********************************

----- EVENT INFORMATION SEGMENT -----

EVENT CLASS                             ERROR EVENT 
OS EVENT TYPE                  113.     KA650 ERROR & STATUS REGS 
SEQUENCE NUMBER                331.
OPERATING SYSTEM                        ULTRIX 32 
OCCURRED/LOGGED ON                      Wed Oct  4 22:00:46 1989 EDT
SYSTEM ID                 x0A000003
OCCURRED ON SYSTEM                      rnh 

----- KA650 ERROR & STATUS REGS -----

CACR                      x00900D90
                                        2ND LEVEL CACHE ENABLED 
                                        CVAX CYCLE SPEED =    10 
DSER                      x00000000
QBEAR                     x0000000F     Q-22 BUS PAGE ADR =   xF
DEAR                      x00000000     MAIN MEM PAGE ADR =   x0
CBTCR                     xC0000004     CDAL BUS T/O INTRVAL  x4
                                        TIMEOUT DURING CPU/DMA READ OR WRITE 
IPCR0                         x0020
                                        LOCAL MEMORY EXT ACCESS ENABLED 
CADR                      x000000FC
                                        D STREAM MODE ENABLED 
                                        I STREAM MODE ENABLED 
                                        SET 1 ENABLED