[comp.windows.x] xfroot timing update

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (10/06/89)

Here is an updated list of xfroot fractal-points/processor_second 
measured on various clients. The number, a count of trips/second
through the 8 line "hopalong" loop in xfroot, is a rough index of scalar 
double-precision floating point uniprocessor speed.

       Cray X-MP             157,000 to 194,000*     (*=per processor)
       Cray 2                129,000 to 183,000*     
       DEC DS5800             95,000 to 115,000
       DEC DS5400             77,000 to  91,000    
       DEC DS3100             58,000 to  75,000
       Gould NP1              44,000 to  60,000* 
       DEC Vax 6400 (vcc)     50,000 to  57,000
       Convex C200 (vc -O2)   49,000 to  55,000*
       Convex C200            41,000 to  47,000*
       Sun Sparcstation 1    ~25,000 to        
       DEC MV3900 (vcc)       22,900 to  26,100
       Sun 4/380             ~21,000 to ~23,000
       DEC Vax 6220           16,800 to  19,200
       DEC Vaxserver 3500     13,200 to  15,200
       Sun 3/50 (gcc 68881)             ~11,200
       Sequent Symmetry        9,900 to  10,500*
       Sun 3/60 (-f 68881)     8,000 to   8,750
       DEC Vaxstation 2000/vcc 5,530 to   6,330
       Sun 3/50 (-f 68881)     5,480 to   6,080
       DEC Vaxstation 2000     4,670 to   5,530
       Sun 3/60                1,960 to   2,060
       Sun 3/50                1,270 to   1,330
       Sun 3/160 (no fpa)                  ~950

Thanks to:  bryan%kewill@uunet.uu.net, eric@geology.tn.cornell,
evans@decvax.dec.com, glenn@mathcs.emory.edu, harrison@decwrl.dec.com, 
hleroy@erisa.fr, howard@aic.hrl.hac.com, kline@ux1.cso.uiuc.edu,  
for sharing their results.

I would appreciate hearing about measurements on other clients, or results
differing significantly from those above.  To perform your own 
xfroot micro-benchmark:

	1. Get xfroot/part01 (V5-I3) and xfroot/patch1(V5-I7) from
	   comp.sources.x.
	2. Install xfroot on the client to be tested, taking care
	   that you have verified the definition of HZ in xfroot.c.
	   (See the README.)
	3. Make the following two runs:

	      xfroot -a 0.1 -b 0.1 -c 0.1    (lower bound)
	      xfroot -a 3000 -b 3000 -c 3000 (upper bound)
-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (10/15/89)

Here is the 2nd updated list of xfroot fractal-points/processor_second 
measured on various clients. The number, a count of trips/second
through the 8 line "hopalong" loop in xfroot, is a rough index of scalar 
double-precision floating point uniprocessor speed. New items since the
last posting are marked with ">".

       Cray X-MP                157,000 to 194,000*     (*=per processor)
       Cray 2                   129,000 to 183,000*     
    >  Convex C2 (gcc)          117,000 to 151,000*
    >  Convex C2 (vc3/fastmath) 108,000 to 138,000*
    >  Convex C2 (vc3)           99,000 to 118,000*
       DEC DS5800                95,000 to 115,000
       DEC DS5400                77,000 to  91,000    
       DEC DS3100                58,000 to  75,000
       Gould NP1                 44,000 to  60,000* 
       DEC Vax 6400 (vcc)        50,000 to  57,000
       Convex C2 (vc2)           49,000 to  55,000*
       Convex C2 (cc)            41,000 to  47,000*
    >  Dec Vax 8650              28,000 to  33,000 
       Sun Sparcstation 1       ~25,000 to        
       DEC MV3900 (vcc)          22,900 to  26,100
    >  DG AViiON (88k 16.7 MHz)  17,200 to  24,200
       Sun 4/380                ~21,000 to ~23,000
    >  Dec Vax 8530              19,700 to  23,200   
    >  Dec Vax 8600              19,700 to  22,400
       DEC Vax 6220              16,800 to  19,200
    >  DEC MV3200 (vcc)          15,400 to  17,500
    >  IBM RT 135 (-f2 -lfm)     15,200 to  17,400
    >  DEC MV3600 (vcc)          14,500 to  17,400
    >  HP9000/360                13,700 to  15,200
       DEC Vaxserver 3500        13,200 to  15,200
    >  Dec Vaxstation 3100       13,000 to  15,100
    >  IBM RT 135                10,600 to  11,500
    >  HP9000/350                10,500 to  11,500
       Sun 3/50 (gcc 68881)                ~11,200
       Sequent Symmetry           9,900 to  10,500*
       Sun 3/60 (-f 68881)        8,000 to   8,750
    >  IBM RT 125                 7,200 to   7,600
       DEC Vaxstation 2000/vcc    5,530 to   6,330
    >  HP9000/330                 5,730 to   6,230
    >  DEC Vax 780                5,410 to   6,170
    >  HP9000/320                 5,580 to   6,150
       Sun 3/50 (-f 68881)        5,480 to   6,080
       DEC Vaxstation 2000        4,670 to   5,530
    >  DEC MVII   (cc)            4,160 to   5,210
    >  DEC MVII   (vcc)           4,080 to   5,070
       Sun 3/60                   1,960 to   2,060
       Sun 3/50                   1,270 to   1,330
       Sun 3/160 (no fpa)                     ~950
    >  DEC Vax 730                  340 to     360

A few notes on the results: gcc enhancements are due to inline code for 
sqrt and fabs. The three top Convex C2 measurements use compilers/libraries
that exploit the C2 hardware sqrt. It pays to shop around for the best 
compiler/options/libraries available for your floating point intensive code.

Thanks to:  bryan%kewill@uunet.uu.net, csmith@convex.com, 
eric@geology.tn.cornell, evans@decvax.dec.com, glenn@mathcs.emory.edu, 
harrison@decwrl.dec.com, hleroy@erisa.fr, howard@aic.hrl.hac.com, 
idallen@watgcl.waterloo.edu, jw@pan.uucp, kline@ux1.cso.uiuc.edu,  
markw@airgun.wg.waii.com, rauletta@gmuvax2.gmu.edu,
steved@longs.lance.colostate.edu,
for sharing their results.

I would appreciate hearing about measurements on other clients, or results
differing significantly from those above.  To perform your own 

	1. Get xfroot/part01 (V5-I3) and xfroot/patch1(V5-I7) from
	   comp.sources.x.
	2. Install xfroot on the client to be tested, taking care
	   that you have verified the definition of HZ in xfroot.c.
	   (See the README.)
	3. Make the following two runs:

	      xfroot -a 0.1 -b 0.1 -c 0.1    (lower bound)
	      xfroot -a 3000 -b 3000 -c 3000 (upper bound)


-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (10/22/89)

Here is the 3rd updated list of xfroot fractal-points/processor_second 
measured on various clients. The number, a count of trips/second
through the 8 line "hopalong" loop in xfroot, is a rough index of scalar 
double-precision floating point uniprocessor speed. New items since the
last posting are marked with ">".

    >  Cray 2    (scc)          304,000 to 619,000*     (*=per processor)
    >  Cray Y-MP (scc)          316,000 to 476,000*
    >  Cray X-MP (scc)          283,000 to 415,000*
       Cray X-MP (cc)           157,000 to 194,000*
       Cray 2    (cc)           129,000 to 183,000*     
       Convex C2 (gcc)          117,000 to 151,000*
       Convex C2 (vc3/fastmath) 108,000 to 138,000*
       Convex C2 (vc3)           99,000 to 118,000*
       DEC DS5800                95,000 to 115,000
    >  HP9000/853CHX             66,000 to  92,000
       DEC DS5400                77,000 to  91,000    
       DEC DS3100                58,000 to  75,000
       Gould NP1                 44,000 to  60,000* 
       DEC Vax 6400 (vcc)        50,000 to  57,000
       Convex C2 (vc2)           49,000 to  55,000*
       Convex C2 (cc)            41,000 to  47,000*
       Dec Vax 8650              28,000 to  33,000 
       Sun Sparcstation 1       ~25,000 to        
    >  HP9000/370 (ffpa)         24,000 to  28,000
    >  Titan                     22,800 to  27,100
       DEC MV3900 (vcc)          22,900 to  26,100
       DG AViiON (88k 16.7 MHz)  17,200 to  24,200
    >  Sun 4/260                 21,100 to  23,600
       Sun 4/280                ~21,000 to ~23,000
       Dec Vax 8530              19,700 to  23,200   
       Dec Vax 8600              19,700 to  22,400
       DEC Vax 6220              16,800 to  19,200
       DEC MV3200 (vcc)          15,400 to  17,500
       IBM RT 135 (-f2 -lfm)     15,200 to  17,400
       DEC MV3600 (vcc)          14,500 to  17,400
    >  HP9000/370                15,900 to  17,300
    >  IBM RT125 (afpa)          13,900 to  16,000
       HP9000/360                13,700 to  15,200
       DEC Vaxserver 3500        13,200 to  15,200
       Dec Vaxstation 3100       13,000 to  15,100
    >  Sun 3/60 (-O4 lib/f68881) 12,900 to  14,000
       IBM RT 135                10,600 to  11,500
       HP9000/350                10,500 to  11,500
       Sun 3/50 (gcc 68881)                ~11,200
       Sequent Symmetry           9,900 to  10,500*
       Sun 3/60 (-f 68881)        8,000 to   8,750
    >  386(25MHz)/387 (cc 386/ix) 7,000 to   8,200
    >  HP9000/330                 7,280 to   7,910
       IBM RT 125                 7,200 to   7,600
       DEC Vaxstation 2000/vcc    5,530 to   6,330
       HP9000/330                 5,730 to   6,230
    >  386(25MHz)/387 (gcc)       6,000 to   6,200
       DEC Vax 780                5,410 to   6,170
       HP9000/320                 5,580 to   6,150
       Sun 3/50 (-f 68881)        5,480 to   6,080
       DEC Vaxstation 2000        4,670 to   5,530
       DEC MVII   (cc)            4,160 to   5,210
       DEC MVII   (vcc)           4,080 to   5,070
       Sun 3/60                   1,960 to   2,060
       Sun 3/50                   1,270 to   1,330
       Sun 3/160 (no fpa)                     ~950
       DEC Vax 730                  340 to     360

A few notes on the results: the Cray scc compiler uses the same backend
as their Fortran compiler.  gcc enhancements are due to inline code for 
sqrt and fabs. The three top Convex C2 measurements use compilers/libraries
that exploit the C2 hardware sqrt. It pays to shop around for the best 
compiler/options/libraries available for your floating point intensive code.

Thanks to:  bav@hobbes.ksu.ksu.edu, bryan%kewill@uunet.uu.net, bt@irfu.se, 
csmith@convex.com, eric@geology.tn.cornell, evans@decvax.dec.com, 
glenn@mathcs.emory.edu, harrison@decwrl.dec.com, hleroy@erisa.fr, 
howard@aic.hrl.hac.com, hrp@boring,cray.com, idallen@watgcl.waterloo.edu, 
jpb@sn2024.cray.com, jw@pan.uucp, kline@ux1.cso.uiuc.edu,  
markw@airgun.wg.waii.com, rauletta@gmuvax2.gmu.edu, 
steved@longs.lance.colostate.edu, tac@csl.ncsu.edu, tpf@jdyx.uucp,
for sharing their results.

I would appreciate hearing about measurements on other clients, or results
differing significantly from those above.  To perform your own:

	1. Get xfroot/part01 (V5-I3) and xfroot/patch1(V5-I7) from
	   comp.sources.x.
	2. Install xfroot on the client to be tested, taking care
	   that you have verified the definition of HZ in xfroot.c.
	   (See the README.)
	3. Make the following two runs:

	      xfroot -a 0.1 -b 0.1 -c 0.1    (lower bound)
	      xfroot -a 3000 -b 3000 -c 3000 (upper bound)


-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

mayer@hplabsz.HPL.HP.COM (Niels Mayer) (10/23/89)

In article <1989Oct22.161408.27784@ux1.cso.uiuc.edu> ejk@ux1.cso.uiuc.edu (Ed Kubaitis) writes:
>Here is the 3rd updated list of xfroot fractal-points/processor_second 
>measured on various clients.
> ...
>    >  HP9000/853CHX             66,000 to  92,000
        ^^^^^^^^^^^^^

No such machine exists, but there is a HP9000/835CHX (14vMIPS/PA-RISC CPU).
I'm assuming the "853" was a typo.

-------------------------------------------------------------------------------
	    Niels Mayer -- hplabs!mayer -- mayer@hplabs.hp.com
		  Human-Computer Interaction Department
		       Hewlett-Packard Laboratories
			      Palo Alto, CA.
				   *

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (10/23/89)

The third posting indeed transposed two digits in the machine name for
xfroot timings reported for HP9000/835CHX. Also, a note attributing improved
                                    ^^
timings reported for the HP9000/330 to an improved compiler with
HP-UX 6.5 was inadvertently omitted from this posting.

-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (10/29/89)

Here is the 4th updated list of xfroot fractal-points/processor_second 
measured on various clients. The number, a count of trips/second
through the 9 line "hopalong" loop in xfroot, is a rough index of scalar 
double-precision floating point uniprocessor speed. The lower number 
represents a case where nearly all points are in-range and thus require
additional integer arithmetic, bit manipulation, and memory accesses to
record the point. The higher number reflects a case when most points are
out of range and most time is spent in floating point arithmetic.  The
numbers in parentheses are VAX 780 equivalents. "*" indicates values for 
a single processor.  New items since the last posting are marked with ">".

       Cray 2    (scc)                 304000 (56.2)    619000(100.3)*
       Cray Y-MP (scc)                 316000 (58.4)    476000 (77.1)*
       Cray X-MP (scc)                 283000 (52.3)    415000 (67.3)*
       Cray X-MP (cc)                  157000 (29.0)    194000 (31.4)*
       Cray 2    (cc)                  129000 (23.8)    183000 (29.7)*
       Convex C2 (gcc)                 117000 (21.6)    151000 (24.5)*
       Convex C2 (vc3/fastmath)        108000 (20.0)    138000 (22.4)*
       Convex C2 (vc3)                  99000 (18.3)    118000 (19.1)*
       DEC DS5800                       95000 (17.6)    115000 (18.6)*
       HP9000/835CHX                    66000 (12.2)     92000 (14.9) 
       DEC DS5400                       77000 (14.2)     91000 (14.7) 
       DEC DS3100                       58000 (10.7)     75000 (12.2) 
    >  Solbourne Series5 Cypress        58000 (10.7)     67000 (10.9) 
       Gould NP1                        44000  (8.1)     60000  (9.7)*
       DEC Vax 6400 (vcc)               50000  (9.2)     57000  (9.2) 
       Convex C2 (vc2)                  49000  (9.1)     55000  (8.9)*
       Convex C2 (cc)                   41000  (7.6)     47000  (7.6)*
       Dec Vax 8650                     28000  (5.2)     33000  (5.3) 
       Sun Sparcstation 1              ~25000  (4.6)       ???  (???) 
       HP9000/370 (ffpa)                24000  (4.4)     28000  (4.5) 
       Titan                            22800  (4.2)     27100  (4.4) 
       DEC MV3900 (vcc)                 22900  (4.2)     26100  (4.2) 
       DG AViiON (88k 16.7 MHz)         17200  (3.2)     24200  (3.9) 
       Sun 4/260                        21100  (3.9)     23600  (3.8) 
       Dec Vax 8530                     19700  (3.6)     23200  (3.8) 
       Sun 4/280                       ~21000  (3.9)    ~23000  (3.7) 
       Dec Vax 8600                     19700  (3.6)     22400  (3.6) 
       DEC Vax 6220                     16800  (3.1)     19200  (3.1) 
       DEC MV3200 (vcc)                 15400  (2.8)     17500  (2.8) 
       IBM RT 135 (-f2 -lfm)            15200  (2.8)     17400  (2.8) 
       DEC MV3600 (vcc)                 14500  (2.7)     17400  (2.8) 
       HP9000/370                       15900  (2.9)     17300  (2.8) 
       IBM RT125 (afpa)                 13900  (2.6)     16000  (2.6) 
       HP9000/360                       13700  (2.5)     15200  (2.5) 
       DEC Vaxserver 3500               13200  (2.4)     15200  (2.5) 
       Dec Vaxstation 3100              13000  (2.4)     15100  (2.4) 
    >  Sun 386i/250 Weitek (cc)         14000  (2.6)     14800  (2.4) 
       Sun 3/60 (-O4 lib/f68881)        12900  (2.4)     14000  (2.3) 
    >  Sun 3/50 (gcc 68881)             10500  (1.9)     12700  (2.1) 
       IBM RT 135                       10600  (2.0)     11500  (1.9) 
       HP9000/350                       10500  (1.9)     11500  (1.9) 
       Sequent Symmetry                  9900  (1.8)     10500  (1.7)*
       Sun 3/60 (-f 68881)               8000  (1.5)      8750  (1.4) 
       386/25 + 387   (cc 386/ix)        7000  (1.3)      8200  (1.3) 
       HP9000/330 (HP-UX 6.5 cc)         7280  (1.3)      7910  (1.3) 
       IBM RT 125                        7200  (1.3)      7600  (1.2) 
       DEC Vaxstation 2000/vcc           5530  (1.0)      6330  (1.0) 
       HP9000/330                        5730  (1.1)      6230  (1.0) 
       386/25 + 387   (gcc)              6000  (1.1)      6200  (1.0) 
       DEC Vax 780                       5410  (1.0)      6170  (1.0) 
       HP9000/320                        5580  (1.0)      6150  (1.0) 
       Sun 3/50 (-f 68881)               5480  (1.0)      6080  (1.0) 
       DEC Vaxstation 2000               4670  (0.9)      5530  (0.9) 
       DEC MVII   (cc)                   4160  (0.8)      5210  (0.8) 
       DEC MVII   (vcc)                  4080  (0.8)      5070  (0.8) 
       Sun 3/60                          1960  (0.4)      2060  (0.3) 
       Sun 3/50                          1270  (0.2)      1330  (0.2) 
       Sun 3/160 (no fpa)                 ???  (???)      ~950  (0.2) 
    >  Sun 2/120 (no fpu - cc)            530  (0.1)       560  (0.1) 
       DEC Vax 730                            340  (0.1)       360  (0.1) 
       386/25 (386/ix - no 387)               259  (0.0)       260  (0.0) 

A few notes on the results: the Cray scc compiler uses the same backend
as their Fortran compiler.  gcc enhancements are due to inline code for 
sqrt and fabs. The three top Convex C2 measurements use compilers/libraries
that exploit the C2 hardware sqrt. It pays to shop around for the best 
compiler/options/libraries available for your floating point intensive code.

Thanks to:  bav@hobbes.ksu.ksu.edu, bryan%kewill@uunet.uu.net, bt@irfu.se, 
csmith@convex.com, csu@alembic.acs.com, eric@geology.tn.cornell, 
dave@rutgers.edu, evans@decvax.dec.com, glenn@mathcs.emory.edu, 
harrison@decwrl.dec.com, hleroy@erisa.fr, howard@aic.hrl.hac.com, 
hrp@boring.cray.com, idallen@watgcl.waterloo.edu, jpb@sn2024.cray.com, 
jw@pan.uucp, kline@ux1.cso.uiuc.edu,  markw@airgun.wg.waii.com, 
paul@db0tui66.bitnet, rauletta@gmuvax2.gmu.edu, skam@solbourne.com,
steved@longs.lance.colostate.edu, tac@csl.ncsu.edu, tpf@jdyx.uucp,
for sharing their results.

I would appreciate hearing about measurements on other clients, or results
differing significantly from those above.  To perform your own:

	1. Get xfroot/part01 (V5-I3) and xfroot/patch1(V5-I7) from
	   comp.sources.x. These are available via anonymous ftp from
	   uunet.uu.net. While they will eventually be found there in
	   comp.sources.x/volume5, as of this writing they are in
	   comp.sources.x/new/890924.0.Z and 890929.0. If you don't
	   have ftp access to uunet.uu.net, I will be happy to mail
	   a copy (~700 lines.)
	2. Install xfroot on the client to be tested, taking care
	   that you have verified the definition of HZ in xfroot.c.
	   (See the README.)
	3. Make the following two runs:

	      xfroot -a 0.1 -b 0.1 -c 0.1    (lower bound)
	      xfroot -a 3000 -b 3000 -c 3000 (upper bound)


-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (11/06/89)

Here is the 5th updated list of xfroot fractal-points/processor_second 
measured on various clients. The number, a count of trips/second
through the 9 line "hopalong" loop in xfroot, is a rough index of scalar 
double-precision floating point uniprocessor speed. The lower number 
represents a case where nearly all points are in-range and thus require
additional integer arithmetic, bit manipulation, and memory accesses to
record the point. The higher number reflects a case when most points are
out of range and most time is spent in floating point arithmetic.  
   
      Key:   () : Vax 780 equivalents
             *  : For a single processor
             +  : Using hardware square root
	     >  : New since last posting

      304000 (56.2)   619000(100.3)*     Cray 2    (scc)         
      316000 (58.4)   476000 (77.1)*     Cray Y-MP (scc)         
      283000 (52.3)   415000 (67.3)*     Cray X-MP (scc)         
      143000 (26.4)   195000 (31.6)*+  > ETA-10 G                
      157000 (29.0)   194000 (31.4)*     Cray X-MP (cc)          
      129000 (23.8)   183000 (29.7)*     Cray 2    (cc)          
      117000 (21.6)   151000 (24.5)*+    Convex C2 (gcc)         
      108000 (20.0)   144000 (23.3)    > SGI Iris 4D/240 (-lfastm)
      108000 (20.0)   138000 (22.4)*+    Convex C2 (vc3/fastmath)
       99000 (18.3)   118000 (19.1)*+    Convex C2 (vc3)         
       95000 (17.6)   115000 (18.6)      DEC DS5800              
       89000 (16.5)   111000 (18.0)    > SGI Iris 4D/240         
       73000 (13.5)    94000 (15.2)+   > Sun 4/370 (f77/libm.i1) 
       66000 (12.2)    92000 (14.9)      HP9000/835CHX           
       77000 (14.2)    91000 (14.7)      DEC DS5400              
       58000 (10.7)    75000 (12.2)      DEC DS3100              
       61000 (11.3)    70000 (11.3)    > Tektronix XD88/30       
       52000  (9.6)    69000 (11.2)+   > Sun 4/280               
       58000 (10.7)    67000 (10.9)      Solbourne Series5 Cypress  
       49000  (9.1)    60000  (9.7)*   > Gould NP1               
       50000  (9.2)    57000  (9.2)      DEC Vax 6400 (vcc)      
       49000  (9.1)    55000  (8.9)*     Convex C2 (vc2)         
       45000  (8.3)    54000  (8.8)    > SGI Iris 4D/70-GT       
       42000  (7.8)    48000  (7.8)    > Sun 4/370 (libm.i1)     
       41000  (7.6)    47000  (7.6)*     Convex C2 (cc)          
       41000  (7.6)    47000  (7.6)    > Sun 4/370               
       28000  (5.2)    33000  (5.3)      Dec Vax 8650            
       20800  (3.8)    28900  (4.7)    > Sun SPARCstation 1 (sqrt.i1/libm.i1)
       24000  (4.4)    28000  (4.5)      HP9000/370 (ffpa)       
       24700  (4.6)    27800  (4.5)    > Sun SPARCstation 1 (gcc)
       22800  (4.2)    27100  (4.4)      Titan                   
       22900  (4.2)    26100  (4.2)      DEC MV3900 (vcc)        
       19900  (3.7)    25200  (4.1)    > Sun SPARCstation 1          
       17200  (3.2)    24200  (3.9)      DG AViiON (88k 16.7 MHz)
       22300  (4.1)    23700  (3.8)    > 386/33 + 387   (cc 386/ix)
       21100  (3.9)    23600  (3.8)      Sun 4/260               
       20100  (3.7)    23400  (3.8)*   > Sequent Symmetry (fpa)  
       19700  (3.6)    23200  (3.8)      Dec Vax 8530            
       21000  (3.9)    23000  (3.7)      Sun 4/280               
       19700  (3.6)    22400  (3.6)      Dec Vax 8600            
       16800  (3.1)    19200  (3.1)      DEC Vax 6220            
       16800  (3.1)    17600  (2.9)    > 386/33 + 387 (gcc 1.35) 
       15400  (2.8)    17500  (2.8)      DEC MV3200 (vcc)        
       15200  (2.8)    17400  (2.8)      IBM RT 135 (-f2 -lfm)   
       14500  (2.7)    17400  (2.8)      DEC MV3600 (vcc)        
       15900  (2.9)    17300  (2.8)      HP9000/370              
       13900  (2.6)    16000  (2.6)      IBM RT125 (afpa)        
       13700  (2.5)    15200  (2.5)      HP9000/360              
       13200  (2.4)    15200  (2.5)      DEC Vaxserver 3500      
       13000  (2.4)    15100  (2.4)      Dec Vaxstation 3100     
       14000  (2.6)    14800  (2.4)      Sun 386i/250 Weitek (cc)
       12900  (2.4)    14000  (2.3)      Sun 3/60 (-O4 lib/f68881)
       10500  (1.9)    12700  (2.1)      Sun 3/50 (gcc 68881)     
       10600  (2.0)    11500  (1.9)      IBM RT 135               
       10500  (1.9)    11500  (1.9)      HP9000/350               
        9900  (1.8)    10500  (1.7)*     Sequent Symmetry         
        9200  (1.7)     9700  (1.6)    > IBM RT 115 (4.3BSD High C 2.1)
        8000  (1.5)     8750  (1.4)      Sun 3/60 (-f 68881)      
        7000  (1.3)     8200  (1.3)      386/25 + 387   (cc 386/ix)
        7300  (1.3)     8000  (1.3)    > IBM RT 115 (4.3BSD High C 1.4)
        7280  (1.3)     7910  (1.3)      HP9000/330 (HP-UX 6.5 cc) 
        7200  (1.3)     7600  (1.2)      IBM RT 125                
        5530  (1.0)     6330  (1.0)      DEC Vaxstation 2000/vcc   
        5730  (1.1)     6230  (1.0)      HP9000/330                
        6000  (1.1)     6200  (1.0)      386/25 + 387   (gcc)      
        5410  (1.0)     6170  (1.0)      DEC Vax 780               
        5580  (1.0)     6150  (1.0)      HP9000/320                
        5480  (1.0)     6080  (1.0)      Sun 3/50 (-f 68881)       
        4670  (0.9)     5530  (0.9)      DEC Vaxstation 2000       
        4160  (0.8)     5210  (0.8)      DEC MVII   (cc)           
        4080  (0.8)     5070  (0.8)      DEC MVII   (vcc)          
        1960  (0.4)     2060  (0.3)      Sun 3/60                  
        1270  (0.2)     1330  (0.2)      Sun 3/50                  
         ???  (???)      950  (0.2)      Sun 3/160 (no fpa)        
         530  (0.1)      560  (0.1)      Sun 2/120 (no fpu - cc) 
         340  (0.1)      360  (0.1)      DEC Vax 730               
         259  (0.0)      260  (0.0)      386/25 (386/ix - no 387)  

A few notes on the results: the Cray scc compiler uses the same backend
as their Fortran compiler.  gcc enhancements are due to inline code for 
sqrt and fabs. Strikingly different results for the same system show
that it pays to shop around for the best compiler/options/libraries 
available for your floating point intensive code.

Thanks to:  archer@sgi.com, bauer@loligo.cc.fsu.edu, bav@hobbes.ksu.ksu.edu, 
bryan%kewill@uunet.uu.net, bt@irfu.se, csmith@convex.com, csu@alembic.acs.com,
eric@geology.tn.cornell, dave@rutgers.edu, david@torsqnt.uucp,
evans@decvax.dec.com, garyc@quasi.wv.tek.com, glenn@mathcs.emory.edu, 
harrison@decwrl.dec.com, hleroy@erisa.fr, howard@aic.hrl.hac.com, 
hrp@boring.cray.com, idallen@watgcl.waterloo.edu, jpb@sn2024.cray.com, 
jw@pan.uucp, ken@cs.toronto.edu, kline@ux1.cso.uiuc.edu, 
ksp@maxwell.nde.swri.edu, mark@zok.uucp, markw@airgun.wg.waii.com, 
moraes@csri.toronto.edu, paul@db0tui66.bitnet, rauletta@gmuvax2.gmu.edu, 
skam@solbourne.com, steved@longs.lance.colostate.edu, tac@csl.ncsu.edu, 
thp@westhawk.uucp, tpf@jdyx.uucp, wesommer@athena.mit.edu,
zimet@sequoia.berkeley.edu, for sharing their results.  (Please assume the 
standard disclaimers for all.)

I would appreciate hearing about measurements on other clients, or results
differing significantly from those above.  To perform your own:

	1. Get xfroot/part01 (V5-I3) and xfroot/patch1(V5-I7) from
	   comp.sources.x. These are available via anonymous ftp from
	   uunet.uu.net. While they will eventually be found there in
	   comp.sources.x/volume5, as of this writing they are in
	   comp.sources.x/new/890924.0.Z and 890929.0. If you don't
	   have ftp access to uunet.uu.net, I will be happy to mail
	   a copy (~700 lines.)
	2. Install xfroot on the client to be tested, taking care
	   that you have verified the definition of HZ in xfroot.c.
	   (See the README.)
	3. Make the following two runs:

	      xfroot -a 0.1 -b 0.1 -c 0.1    (lower bound)
	      xfroot -a 3000 -b 3000 -c 3000 (upper bound)

Please mention any details (compilers/libraries/options) you think are
relevant.

-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (11/12/89)

Here is the 6th updated list of xfroot fractal-points/processor_second 
measured on various clients. The number, a count of trips/second
through the 9 line "hopalong" loop in xfroot, is a rough index of scalar 
double-precision floating point uniprocessor speed. The lower number 
represents a case where nearly all points are in-range and thus require
additional integer arithmetic, bit manipulation, and memory accesses to
record the point. The higher number reflects a case when most points are
out of range and most time is spent in floating point arithmetic.  
   
      Key:   () : Vax 780 equivalents
             *  : For a single processor
             +  : Using hardware square root
	     >  : New since last posting

      304000 (56.2)   619000(100.3)*     Cray 2    (scc)         
      316000 (58.4)   476000 (77.1)*     Cray Y-MP (scc)         
      283000 (52.3)   415000 (67.3)*     Cray X-MP (scc)         
      185000 (34.2)   263000 (42.6)*+  > Apollo DN10000 (See note below)
      143000 (26.4)   195000 (31.6)*+    ETA-10 G                
      157000 (29.0)   194000 (31.4)*     Cray X-MP (cc)          
      129000 (23.8)   183000 (29.7)*     Cray 2    (cc)          
      174000 (32.2)   182000 (29.5)*   > Amdahl 5990             
      115000 (21.3)   170000 (27.6)*   > Apollo DN10000 (-D_BUILTINS)
      117000 (21.6)   151000 (24.5)*+    Convex C2 (gcc)         
      108000 (20.0)   144000 (23.3)      SGI Iris 4D/240 (-lfastm)
      108000 (20.0)   138000 (22.4)*+    Convex C2 (vc3/fastmath)
       99000 (18.3)   118000 (19.1)*+    Convex C2 (vc3)         
       95000 (17.6)   115000 (18.6)      DEC DS5800              
       89000 (16.5)   111000 (18.0)      SGI Iris 4D/240         
       73000 (13.5)    94000 (15.2)+     Sun 4/370 (f77/libm.i1) 
       66000 (12.2)    92000 (14.9)      HP9000/835CHX           
       78000 (14.4)    92000 (14.9)    > Sony NWS-3860           
       77000 (14.2)    91000 (14.7)      DEC DS5400              
       58000 (10.7)    75000 (12.2)      DEC DS3100              
       61000 (11.3)    70000 (11.3)      Tektronix XD88/30       
       52000  (9.6)    69000 (11.2)+     Sun 4/280               
       58000 (10.7)    67000 (10.9)      Solbourne Series5 Cypress  
       49000  (9.1)    60000  (9.7)*     Gould NP1               
       50000  (9.2)    57000  (9.2)      DEC Vax 6400 (vcc)      
       49000  (9.1)    55000  (8.9)*     Convex C2 (vc2)         
       45000  (8.3)    54000  (8.8)      SGI Iris 4D/70-GT       
       43000  (7.9)    53000  (8.6)    > Sun SPARCstation 1 (see note below)
       42000  (7.8)    48000  (7.8)      Sun 4/370 (libm.i1)     
       41000  (7.6)    47000  (7.6)*     Convex C2 (cc)          
       41000  (7.6)    47000  (7.6)      Sun 4/370               
       28000  (5.2)    33000  (5.3)      Dec Vax 8650            
       28000  (5.2)    33000  (5.3)    > Stellar GS 2000 (-O2)   
       26500  (4.9)    30300  (4.9)    > Mac II (w/ Siclone 3033)
       20800  (3.8)    28900  (4.7)      Sun SPARCstation 1 (see note below)
       24000  (4.4)    28000  (4.5)      HP9000/370 (ffpa)       
       24700  (4.6)    27800  (4.5)      Sun SPARCstation 1 (gcc)
       22800  (4.2)    27100  (4.4)      Titan                   
       22900  (4.2)    26100  (4.2)      DEC MV3900 (vcc)        
       19900  (3.7)    25200  (4.1)      Sun SPARCstation 1          
       17200  (3.2)    24200  (3.9)      DG AViiON (88k 16.7 MHz)
       22300  (4.1)    23700  (3.8)      386/33 + 387   (cc 386/ix)
       21100  (3.9)    23600  (3.8)      Sun 4/260               
       20100  (3.7)    23400  (3.8)*     Sequent Symmetry (fpa)  
       19700  (3.6)    23200  (3.8)      Dec Vax 8530            
       21000  (3.9)    23000  (3.7)      Sun 4/280               
       19700  (3.6)    22400  (3.6)      Dec Vax 8600            
       19500  (3.6)    21600  (3.5)    > Apollo DN4500 (-D_BUILTINS)
       16800  (3.1)    19200  (3.1)      DEC Vax 6220            
       16800  (3.1)    17600  (2.9)      386/33 + 387 (gcc 1.35) 
       15400  (2.8)    17500  (2.8)      DEC MV3200 (vcc)        
       15200  (2.8)    17400  (2.8)      IBM RT 135 (-f2 -lfm)   
       14500  (2.7)    17400  (2.8)      DEC MV3600 (vcc)        
       15900  (2.9)    17300  (2.8)      HP9000/370              
       13800  (2.6)    16100  (2.6)    > Apollo DN3550 (-D_BUILTINS)
       13900  (2.6)    16000  (2.6)      IBM RT125 (afpa)        
       13800  (2.6)    15900  (2.6)    > Apollo DN3500 (-D_BUILTINS)
       13700  (2.5)    15200  (2.5)      HP9000/360              
       13200  (2.4)    15200  (2.5)      DEC Vaxserver 3500      
       13000  (2.4)    15100  (2.4)      Dec Vaxstation 3100     
       14000  (2.6)    14800  (2.4)      Sun 386i/250 Weitek (cc)
       12900  (2.4)    14000  (2.3)      Sun 3/60 (-O4 lib/f68881)
       11900  (2.2)    13200  (2.1)    > Apollo DN4000 (-D_BUILTINS)
       11000  (2.0)    12900  (2.1)    > Apollo DN2500 (-D_BUILTINS)
       10500  (1.9)    12700  (2.1)      Sun 3/50 (gcc 68881)     
        9700  (1.8)    12100  (2.0)    > Mac II                  
       10600  (2.0)    11500  (1.9)      IBM RT 135               
       10500  (1.9)    11500  (1.9)      HP9000/350               
        9900  (1.8)    10500  (1.7)*     Sequent Symmetry         
        9200  (1.7)     9700  (1.6)      IBM RT 115 (4.3BSD High C 2.1)
        8000  (1.5)     8750  (1.4)      Sun 3/60 (-f 68881)      
        7930  (1.5)     8670  (1.4)    > HP9000/340                
        7000  (1.3)     8200  (1.3)      386/25 + 387   (cc 386/ix)
        7300  (1.3)     8000  (1.3)      IBM RT 115 (4.3BSD High C 1.4)
        7280  (1.3)     7910  (1.3)      HP9000/330 (HP-UX 6.5 cc) 
        7200  (1.3)     7600  (1.2)      IBM RT 125                
        5530  (1.0)     6330  (1.0)      DEC Vaxstation 2000/vcc   
        5730  (1.1)     6230  (1.0)      HP9000/330                
        6000  (1.1)     6200  (1.0)      386/25 + 387   (gcc)      
        5410  (1.0)     6170  (1.0)      DEC Vax 780               
        5580  (1.0)     6150  (1.0)      HP9000/320                
        5560  (1.0)     6120  (1.0)    > Apollo DN3000 (-D_BUILTINS)
        5480  (1.0)     6080  (1.0)      Sun 3/50 (-f 68881)       
        4670  (0.9)     5530  (0.9)      DEC Vaxstation 2000       
        4160  (0.8)     5210  (0.8)      DEC MVII   (cc)           
        4080  (0.8)     5070  (0.8)      DEC MVII   (vcc)          
        1960  (0.4)     2060  (0.3)      Sun 3/60                  
        1270  (0.2)     1330  (0.2)      Sun 3/50                  
         ???  (???)      950  (0.2)      Sun 3/160 (no fpa)        
         530  (0.1)      560  (0.1)      Sun 2/120 (no fpu - cc) 
         340  (0.1)      360  (0.1)      DEC Vax 730               
         259  (0.0)      260  (0.0)      386/25 (386/ix - no 387)  

A few notes on the results: 
     
     o The top DN10000 timings used the PRISM 6.7(359) compiler with
       the following options: -opt 4 -cpu a88k -def sqrt=_builtin_sqrt
       -def fabs=_builtin_fabs.

     o Two SPARCstation results using sqrt.i1 and libm.i1 were reported.
       The only difference appeared to be that the faster one was compiled
       and linked in one step. Can anyone enlighten us on this?

     o The Cray scc compiler uses the same backend as their Fortran.

     o gcc enhancements are due to inline code for sqrt & fabs.

     o Strikingly different results for the same system show that it pays to 
       shop around for the best compiler/options/libraries available.

Thanks to:  archer@sgi.com, bauer@loligo.cc.fsu.edu, bav@hobbes.ksu.ksu.edu, 
bryan%kewill@uunet.uu.net, bt@irfu.se, casey@gauss.llnl.gov, csmith@convex.com,
csu@alembic.acs.com, eric@geology.tn.cornell, dave@rutgers.edu, 
david@torsqnt.uucp, evans@decvax.dec.com, garyc@quasi.wv.tek.com, 
glenn@mathcs.emory.edu, harrison@decwrl.dec.com, hleroy@erisa.fr, 
howard@aic.hrl.hac.com, hrp@boring.cray.com, idallen@watgcl.waterloo.edu, 
jpb@sn2024.cray.com, jw@pan.uucp, ken@cs.toronto.edu, kline@ux1.cso.uiuc.edu, 
ksp@maxwell.nde.swri.edu, kucharsk@uts.amdahl.com, lnz@lucid.com,
mark@zok.uucp,
markw@airgun.wg.waii.com, michael@ws.sony.co.jp, moraes@csri.toronto.edu, 
paul@db0tui66.bitnet, rauletta@gmuvax2.gmu.edu, skam@solbourne.com, 
sommerfeld@apollo.com, steved@longs.lance.colostate.edu, tac@csl.ncsu.edu, 
thp@westhawk.uucp, tony@popserver.stanford.edu, tpf@jdyx.uucp,
wesommer@athena.mit.edu, zimet@sequoia.berkeley.edu, for sharing their results.
(Please assume the standard disclaimers for all.)

I would appreciate hearing about measurements on other clients, or results
differing significantly from those above.  To perform your own:

	1. Get xfroot/part01 (V5-I3) and xfroot/patch1(V5-I7) from
	   comp.sources.x. These are available via anonymous ftp from
	   uunet.uu.net. While they will eventually be found there in
	   comp.sources.x/volume5, as of this writing they are in
	   comp.sources.x/new/890924.0.Z and 890929.0. If you don't
	   have ftp access to uunet.uu.net, I will be happy to mail
	   a copy (~700 lines.)
	2. Install xfroot on the client to be tested, taking care
	   that you have verified the definition of HZ in xfroot.c.
	   (See the README.)
	3. Make the following two runs:

	      xfroot -a 0.1 -b 0.1 -c 0.1    (lower bound)
	      xfroot -a 3000 -b 3000 -c 3000 (upper bound)

Please mention any details (compilers/libraries/options) you think are
relevant.

-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (11/20/89)

The 7th update of xfroot timings is available. These timings provide a rough 
index of scalar double-precision floating point uniprocessor speed on dozens
of X clients spanning three orders of magnitude in processor speed. In addition
to many new and revised timings, this update includes information on under-
estimates of faster systems identified in previous xfroot timings. The update
is available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@ux1.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.

-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (11/27/89)

The 8th update of xfroot timings is available. These timings provide a rough 
index of scalar double-precision floating point uniprocessor speed on dozens
of X clients spanning three orders of magnitude in processor speed. This update
has new or revised timings for the Cray Y-MP, Stardent 3000, Titan, Gould NP1,
Stellar GS2000 and Tektronix XD88/10. The timings are available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@ux1.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.

-------------------------
Ed Kubaitis (ejk@ux1.cso.uiuc.edu)
Computing Services Office - University of Illinois, Urbana

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (12/11/89)

The 9th update of xfroot timings is available. These timings provide a rough 
index of scalar double-precision floating point uniprocessor speed on dozens
of X clients spanning three orders of magnitude in processor speed. This update
has new or revised timings for the Stardent 3000, Titan, and GS2000, Convex C2
with "ESP" feature, MIPS R2000, IBM RT135/EAFPA, ISI V24K, and Vax 750.  The 
timings are available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@ux1.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (12/18/89)

The 10th update of xfroot timings is available. These timings (by over 50
contributors in 8 countries) provide a rough index of scalar double-precision 
floating point uniprocessor speed on dozens of X clients spanning three orders
of magnitude in processor speed. New or revised in this update: MIPS M/120, 
M/2000, M/180; DEC 5810, 3100, 8650, 3900, & 6280; CADMUS 68020, R2000 and
80860 based systems.  The timings are available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@ux1.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (01/07/90)

The 11th update of xfroot timings is available. These timings provide a rough
index of scalar double-precision floating point uniprocessor speed. The timings
have been reported by over 50 contributors in 8 countries on dozens of systems 
(most of them X clients) spanning three orders of magnitude in processor speed.
New or revised in this update: Cray Y-MP; Convex C2 ESP; DEC 5810; Solbourne 
Cypress; MIPS RC2030; Alliant FX/80; NeXT; Pyramid 98x, 90x.  The timings 
are available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@ux1.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (02/23/90)

The 12th update of xfroot timings is available. These timings provide a rough
index of scalar double-precision floating point uniprocessor speed. The timings
have been reported by over 50 contributors in 8 countries on dozens of systems 
(most of them X clients) spanning three orders of magnitude in processor speed.
New or revised in this update: IBM RS/6000 320 and 530, Solbourne Cypress,
Sun 4/390, Sun 4/60, Cadmus 9933/RC, MIPS M-120.  The timings are available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@ux1.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (03/04/90)

The 13th update of xfroot timings is available. These timings provide a rough
index of scalar double-precision floating point uniprocessor speed. The timings
have been reported by over 60 contributors in 8 countries on dozens of systems 
(most of them X clients) spanning three orders of magnitude in processor speed.

New in this update: Multiflow Trace 14/300, IBM PS/2 (386 & 486), Sun 3/460, 
Sequent Balance (NS32000). The timings are available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@uxh.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.

ejk@ux1.cso.uiuc.edu (Ed Kubaitis) (03/18/90)

The 14th update of xfroot timings is available. These timings provide a rough
index of scalar double-precision floating point uniprocessor speed. The timings
have been reported by over 60 contributors in 8 countries on dozens of systems 
(most of them X clients) spanning three orders of magnitude in processor speed.

New or revised in this update: Sun 3/470, 3/80, 3/260; Bull DPS 9000, DPS 8/49;
IBM RS/6000 320. The timings are available:

   1. Via anonymous ftp to uxc.cso.uiuc.edu in pub/xfroot/timings 

   2. Via email request to ejk@uxh.cso.uiuc.edu. Indicate if you wish to 
      be on a mailing list for future updates.