[net.micro] Is parity *really* worth it?

David.Anderson@CMU-CS-G.ARPA (08/01/84)

The Macintosh designers decided that parity wasn't worth it, because of
the board space it would consume, and it would be just another thing to
go wrong.  Maybe someone else knows how carefully they measured the
tradeoffs.

--david

phil@amd.UUCP (Phil Ngai) (08/02/84)

There are two arguments here:

1) Parity decreases your MTBF and increases your cost. It decreases
your MTBF because it adds more parts which can break. (please, no
flames yet) It increases your cost by more than just a couple of DRAM
chips. You also need 2 parity generators, the logic which detects
a parity fault at an appropriate time for sampling and can remember
the fault until told to forget, a free interrupt line, and logic
to force a parity error so you can verify the parity detectors work.
This is typically about 8 chips worth. I should know, I designed
parity into my last product. At the time I did not feel it was worth
it but the customer demanded it.

2) What if you get a soft error in your RAM and your data is corrupted.
How much is that worth? Although parity decreases the MTBF it increases
your confidence factor. One can argue that soft errors only happen
a few times a year but in many applications that is a big problem.

I find all this very interesting because I am at a stage where I can
implement or not implement parity in my current project. I haven't
decided yet.

-- 
 I'm going to keep boring until I strike oil.
 Phil Ngai (408) 982-6554
 UUCPnet: {ucbvax,decwrl,ihnp4,allegra,intelca}!amd!phil
 ARPAnet: amd!phil@decwrl.ARPA

jones@fortune.UUCP (08/03/84)

#R:sri-arpa:-58600:fortune:28000047:000:1324
fortune!jones    Aug  2 18:03:00 1984

The question of Parity or Error Detection and Correction (EDC) for
memory utimately boils down to the market for your machine.  If it
is intended for business applications or runs multiusers, I believe
it should have EDC.  My opinion is aligned with the vanishing
minority and is not marketable unless you are talking about Minis
or larger.  The serious user who values his data should at least
have parity.  I liken it to the power outage light on some freezers.
It's a bother and may not work but, if it does, it could save you
a barrel of money...or at least a small pocketbook.

For the average personal computer user (careful, Jones, move your
thumb away from that lighter) it's caveat emptor.  Hey, you know,
like errors are very rare and, like,  the memory is probably not
full anyway, and, like, if it was serious the system would probably
crash, so like, you'd know, right?  Confidence counts for a lot,
with me anyway.  I still remember the first TRS-80 manual I saw
which, to my wondering eyes, stated that the only way a user could
be *confident* that his program loaded correctly off of tape was to
load again and compare loads...Thanks awfully.

Dan Jones

UUCP:	{ihnp4,ucbvax!amd,hpda,sri-unix,harpo}!fortune!jones
DDD:	(415)594-2440
USPS:	Fortune Systems Corp, 101 Twin Dolphin Drive, Redwood City, CA 94065

BILLW@SRI-KL.ARPA (08/04/84)

Although parity increases your confidence that data in memory is
valid, it isnt clear to me (from a software point of veiw) that
this necessarrilly does the user any good.  In a critical application,
it could be a very bad thing for a word of memory to suddenly go
bad, but it could be worse for the system to crash with a "parity
error" message, and lose ALL your data.  Most software is not
equiped to handle parity errors in any reasonable manner.

Parity errors are pretty rare anyway.  How many people have
actually sen a parity error on their IBMPCs?

I get the impression that with the fancy ECC chips on the market
these days, it is easier to implement ECC than parity (although
more expensive, of course).  Is this impression correct?

BillW

cdh@BBNCCX.ARPA (08/04/84)

From:  "Carl D. Howe" <cdh@BBNCCX.ARPA>

Concerning ECC:  be careful.  Some of the fancy VLSI ECC chips have such
large die sizes and have been pushed to be so fast, that THEIR soft failure
rate is higher than the RAMs they protect!!

ECC also costs a lot more.  On the other hand, it does allow you to continue
running with completely nonfunctional chips, a feature if you are looking
at a nonstop type of application.

I guess my point is that all types of error detection/correction techniques
have their places.  The hard part is choosing the right point in the design
space to run your application.  Doing some rudimentary
failure calculations at least gives you a point where you can start making tradeoffs
and know whether you are winning or losing.

Carl

brownell@harvard.ARPA (Dave Brownell) (08/06/84)

Two points:

    1)	Designs that only have hardware error detection (parity, ECC,
	or what-have-you) are SERIOUS COPOUTS.  Get that lazy programmer
	to write some code that invalidates that block of RAM, logs
	the error, kills the process using it, and then lets the
	rest of the system continue!!!

    2)	Apparently not many here are aware, but there are a large number
	of market projections that say that "fault tolerant" systems
	are the hottest growth area ($$$) in computers over the next
	decade.  NOT second to micros, note.  There are a lot of people
	with lots of money out there who want their computers to be
	reliable.  I wouldn't mind a single bit error adding $100K to
	my bank account, but the bank sure would.

In short, YES, parity is worth it.  But only as part of a whole system
design, cut these half-a**ed efforts before I get violent.  You have to
be able to recover from the errors, not just detect them.





Dave Brownell
Sequoia Systems Inc.
{allegra,floyd,ihnp4,seismo}!harvard!sequoia!brownell

seifert@ihuxl.UUCP (D.A. Seifert) (08/07/84)

On a mini-based ATE system which I won't embarrass by giving
the name of here, parity was quite handy.  Not for the RAM, mind you,
for the hard disks.  Anytime I saw a parity error from a disk I
jumped on the phone to maintainance and suggested that they clean
the heads before we had a crash.  Parity made a nice early
warning system.  Would have been nice if they had had some decent
software. And some decent hardware.  Maybe someday I'll manage to forget.

-- 
	_____
       /_____\	how in blue blazes do they expect a EE to cook on a gas stove?
      /_______\	
	|___|			    Snoopy
    ____|___|_____	       ihnp4!ihuxl!seifert