[comp.arch] WANTED: Research papers on memory/DRAM access tweaks

bryce@cbmvax.commodore.com (Bryce Nesbitt) (05/03/90)

I'm looking for hard data on the performance tradeoffs of various tweaks
to get higher performance from memory subsystems.  Techniques such as
interleaving banks, fast page mode rams, static column, prefetching, etc.

Typically a penalty is associated with doing the "wrong" thing, while "correct"
memory access are faster.  There is a tradeoff point, and I wish to explore
the available data.



Here is a real-life example: a Motorola 68030 system.  With normal DRAM it
takes 4 cycles for a memory access.  With static column DRAM (SCRAM), page
misses are 5 cycles, hits with the same column address are 3 cycles.  On top
of SCRAM, 68030 burst cycles can be added at 2 cycles each.

		normal DRAM		  4

		SCRAM (page miss)         5
		SCRAM (page hit)          3
		SCRAM burst		  X + 2 + 2 + 2

Would you enable SCRAM mode?  Would you enable burst mode for instructions?
For data?  I have an answer to the above questions, but would like to compare
notes with published data.  What papers are available?

-- 
|\_/|  . ACK!, NAK!, EOT!, SOH!
{o O} .     Bryce Nesbitt, Commodore-Amiga, Inc.
 (")        BIX: bnesbitt
  U	    USENET: bryce@commodore.COM -or- uunet!cbmvax!bryce
Lawyers: America's untapped export market.

scott@labtam.oz (Scott Colwell) (05/04/90)

From article <11275@cbmvax.commodore.com>, by bryce@cbmvax.commodore.com (Bryce Nesbitt):
> I'm looking for hard data on the performance tradeoffs of various tweaks
> to get higher performance from memory subsystems.  Techniques such as
> interleaving banks, fast page mode rams, static column, prefetching, etc.
> 
> Typically a penalty is associated with doing the "wrong" thing, while "correct"
> memory access are faster.  There is a tradeoff point, and I wish to explore
> the available data.

James Goodman and Men-chow Chiang presented a paper at the '84 Symposium
on Computer Architecture titled "The Use of Static Column RAM as a
Memory Hierarchy" which presented some trace driven simulations for
a VAX and a PDP-11. It covers using page mode on multiple banks as a means
of 'cacheing' data.  While the simulations are for rather dated cpu
architectures, the information does provide some idea on what hit rates
need to be achieved to break-even for various numbers banks in page/static
column mode.

The data is predominantly for large numbers of banks which tends to
be unrealistic with 1M and 4M drams however.

(I used this paper as input to the design of a memory controller for
a 20MHz 80386 and achieved very satisfactory results.)


-- 
Scott Colwell
Labtam Information Systems P/L	net:	scott@labtam.oz.au
Melbourne, Australia 		phone:	+61-3-587-1444