[comp.arch] multiarchitecture chips

thornburg@mtsg.ubc.ca (Jonathan Thornburg) (12/11/90)

A number of recent comp.arch postings have discussed the notion of a
multiarchitecture CPU chip, one which has on the same chip one of each
of the currently popular CPU architectures, say AMD29000, i386, i860,
M68000, M88000, R3000, Sparc, and perhaps a few others I don't remember
right now.
 
I think such a chip would be considerably slower than a single
architecture CPU chip with the same number of transistors.  The reason
is that the single architecture chip can use the "extra" transistors to
boost performance by things like more cache, multiport register files,
superscalar execution units, fully parallel floating point arithmetic
units, etc.
 
For example, more cache almost always boosts performance.  Deeper
pipelining sometimes helps, but costs more chip area for the pipelining
registers, the extra control logic, and the extra register file ports.
VLIW or superscalar techniques (eg Multiflow or IBM RS6000 respectively)
can give factors of 2-4 improvements in performance, at a cost of 4-8
times more chip area.  (See Hennessey & Patterson sections 6.7 and 6.8
for details.)
 
A rereading of Hennessy & Patterson appendix A reveals *many* ways to
speed up floating point arithmetic if more transistors are available.
For example, p.A-45 states that 1990 chip technology can't fit a fully
parallel double precision FP multiplier on the same chip as the rest of
FP arithmetic, so today's FP chips use narrower multipliers and hence
are slower at DP FP multiply.
 
I could go on, but I think the point is clear:  Dividing 1 million
transistors into 7 140K-transistor CPUs will yeild 7 CPUs, each slower
than a 1M-transistor CPU.
 
- Jonathan Thornburg

  Dept of Geophysics & Astronomy
  The University of British Columbia        thornburg@mtsg.ubc.ca
  Vancouver     BC     V6T 1W5            userbkis@ubcmtsg.bitnet
  Canada                      ...!ubc-cs!ubcmtsg.bitnet!thornburg
--
      panon@ucs.ubc.ca             or          USERPAP1@UBCMTSG
                                   or          USERPAP1@mtsg.ubc.ca
Looking for a .signature? "We've already got one. It is ver-ry ni-sce!"

mlord@bwdls58.bnr.ca (Mark Lord) (12/12/90)

In article <1990Dec10.230226.13217@unixg.ubc.ca> thornburg@mtsg.ubc.ca (Jonathan Thornburg) writes:
< 
<I could go on, but I think the point is clear:  Dividing 1 million
<transistors into 7 140K-transistor CPUs will yeild 7 CPUs, each slower
<than a 1M-transistor CPU.

Quite possibly true, but speed is not the only design goal.
If it were, then we would only build Fugitsu [sic] supercomputers
(or maybe Crays), rather than wasting our time on 80486's and R3000's,
etc..

The idea with the multiple brains is that beyond a quickly approaching point,
it is difficult to provide increased  performance without investing much more
effort in to the rest of the design (off-chip).  So for THOSE applications,
what can one do with the extra half million transistors?  Aha.. why not make
the cpu more veratile:  with multiple brains, it is still of reasonable speed,
and can still be easy to interface/design around, AND we might be able to gain
a LARGE base of already-compatible software designed for other cpus.
-- 
 ___Mark S. Lord__________________________________________
| ..uunet!bnrgate!mlord%bmerh724 | Climb Free Or Die (NH) |
| MLORD@BNR.CA   Ottawa, Ontario | Personal views only.   |
|________________________________|________________________|