[comp.parallel] CLIPS benchmarks

moskowit@paul.rutgers.edu (Len Moskowitz) (10/10/90)

		   Sun 3/80 and Tadpole TP881V (88000 RISC) 
		    Benchmarks for CLIPS Production Systems
 
 
DISCUSSION
 
Back in June and July I posted a few queries about small scale parallel
architectures (less than 64 computing nodes) for hosting production system
languages (e.g., CMU's OPS5 and NASA's CLIPS).  Since that time we've narrowed
our choice of microprocessor to the Inmos Transputers (the T800 and T801) and
the Motorola 88000 RISC.  One of our constraints is that the final system must
occupy no more than three 6U VME slots.  Both Inmos and Transtech have 6U
boards (the B014 and the MCP500 respectively) that accommodate eight size-1
transputer modules ("TRAMS").  Tadpole Technologies has a three board set based
on Motorola's Hypermodule that hosts eight 88100 CPUs and sixteen 88200
cache/MMUs.
 
We decided to initially benchmark single processor configurations of the
Transputer and 88000 versus our Sun development system and embedded target
hardware (the Motorola MVME-147).  I reported on the results of the Transputer
benchmarks in a previous post.  This report presents the results of the 88000
benchmarks and compares them with the performance of the Transputer, Sun 3/80,
and Motorola MVME-147 board.
 
Tadpole Technology loaned us a single processor version of their TP881V 88000
VME development system for evaluation.  It consisted of three boards: a
Processor module holding a single 20 megaHertz 88100 CPU, 2 88200 Cache Memory
Management Units, and 8 megabytes of memory (it can hold up to 4 25 megaHertz
88100s, 8 88200s, and up to 32 megabytes of memory); a Base module providing a
VME interface, disk i/o, network i/o, and console i/o; and a VSB module that in
our system held no memory at all and was unused.  The three boards resided in a
stand-alone VME chassis.  A simple CRT terminal acted as console.  The Tadpole
system connected to our Sun network via NFS.
 
It took less than an hour to set up the Tadpole system and to bring it to our
Sun network.  Porting of CLIPS 4.3 and our application to the TP881V using
Tadpole's TP-CDS C compiler took less than 6 hours.  We were able to use the
benchmarks used for the Transputer, the Sun, and the Motorola board with no
modifications.  We required no technical assistance to get the benchmark
running.  This is a tribute to both CLIPS' portability and the efficiency of
the Tadpole tools.
 
Our application programs are typically developed on Suns and ported to a
Motorola MVME-147 single board computer that is embedded in our products.  The
-147 uses a 25 megaHertz 68030 and 68882.  The Sun 3/80 uses a 20 megaHertz
68030 and 68882 floating point coprocessor.  The TP881V we evaluated uses a 20
megaHertz 88100.  The -147 runs VxWorks 4.0, a real time UNIX kernel.  The Suns
run SunOS 4.1.  The Tadpole TP881V runs TP-IX/88K, a multi-processor UNIX
operating system for the 88000, based on kernel technology licensed from Data
General.  The -147 uses 70 nanosecond, 1 wait state memory.  We don't know the
memory parameters for the Sun or the Tadpole.  Both the Sun and the -147 run
code generated by the same Sun C compiler, albeit linked with different
operating system service libraries.  The Tadpole used code generated by the
Tadpole TP-CDS C compiler.  All processes on the -147 other than the benchmark
were suspended (these suspended processes occupied no more than one percent of
the processing capacity of the -147).  The Sun's usual processes and their
priorities weren't interfered with in any way except that remote logins were
disabled.  The Tadpole system was in multi-user mode but ran only the benchmark
program; no other processes were active.
 
The benchmark, an expert system for waveform analysis consisting of 71 rules
averaging 4 condition elements and 4 actions, performed no output to the
display and required no disk access.  All data required for execution (the
working memory elements) were preloaded into working memory.  The rules were
pre-compiled.
 
RESULTS
 
The timing results reported below are the average ratios for 20 different data
sets.  Timing ratios (TP881V versus Sun 3/80) were very constant for the 20
data sets, varying less than 9 percent from the mean.  Timings were measured
from the invocation of the production system until rule firing ceased.
 
The 20 rule sets took between 83.98 and 201.09 seconds to run on the Sun 3/80.
Timing ratios varied from a low of 9.15 to a high of 10.18.
 
The Tadpole TP881V with a single 20 megaHertz 88100 CPU was on average 9.93
times faster than the Sun 3/80.  As reported in the previous benchmark, the
B404-5A Transputer was 2.24 times faster than the Sun 3/80 making the Tadpole
TP881V roughly 5 times faster than the B404-5A Transputer.  However, recent
timings performed by INMOS using our benchmarks on the fastest transputer
module available to them today (a B428-5) show that it is 4 times faster than
the Sun 3/80.  This leads us to believe that the 20 megaHertz TP881V is 2.45
times faster than the B428-5 Transputer.
 
In previous benchmarks the -147 board was uniformly about fifty percent faster
than the Sun 3/80, making the Tadpole TP881V roughly 7 times faster than the
-147.
 
We expect that the readily available 25 megaHertz version of the TP881V will
significantly better these figures.
 
CREDITS
 
The benchmarks were designed and run by Dr. Elliott Greene at Bendix Test
Systems Division, Teterboro, New Jersey, 07608, USA.
 
Mike Roche at Tadpole's Long Island (New York) office, and Alice Stifter at the
Austin, Texas sales office were both very helpful.  We thank them both for the
loan of the evaluation system and their support.