[comp.lang.c] f2c experience

bglenden@mandrill.cv.nrao.edu (Brian Glendenning) (12/07/90)
I'm the fellow who started the most recent reincarnation of the
Fortran vs C discussion - I'm afraid that discussion digressed more
than I wanted it to and I'm sorry about that.

Anyway, I decided to run our package through f2c. Here's a little note
I wrote up about my experience.

Brian
===========================================================================

I have compiled the subset of AIPS (Astronomical Image Processing
System) required to run our benchmakrking/validation suite with the
Fortran-to-C converter (f2c) from AT&T (this code is publicly
redistributable).

The compiled subset (about 7% by number of programs) consisted of
165,000 lines of Fortran and 6700 lines of C (to handle the OS
interface, signals, etc). I ran it on a Sun/4 110
(mandrill.cv.nrao.edu) running SunOS 4.1 and the 15OCT90 version of
AIPS (256kword = 1MB "core" size).

The only changes required to get AIPS to compile were to change some
variable names from "REAL" to something else in about a dozen
routines. Although legal f77, this is probably a bad practice in any
event. Since AIPS has been known to break many compilers in the past I
think this speaks highly of the quality of f2c.

The bulk of the code was compiled with no optimization, while the most
numerically intensive portions of the code (the so-called Q routines)
were compiled with Sun's cc -fsingle -O4.  For the Fortran comparison
the compilation was the same aside from -fsingle which is
meaningless for Fortran.

The resulting f2c code passed the verification suite with flying
colours. This surprised me a bit since I thought that we might run
into parentheses grouping problems since Sun cc is a K&R compiler and
I didn't specify the flags to force f2c to follow Fortran evaluation.

		    Small (256^2) DDT f2c Results

Task     What                           Correct bits RMS       Correct bits worst pixel
UVMAP    gridded FFT imaging            I=19, B=17*            I=10, B=10
APCLN    "clean" deconvolution          21                     14
APRES    deconvolution residuals        22                     17
MXMAP    gridded FFT imaging            I=19, B=20             I=14,B=14
MXCLN    "clean" deconvolution          18                     14
VTESS    Maximum entropy deconvolution  27                     20
                                        *I,B = Image, Beam

UVSRT    Disk Sort of ungridded data    Pass
ASCAL    Self calibration of "closure"  Pass
         errors

These numbers are typical of what we find when bringing up AIPS on any
new system.

The timings were very interesting (CPU times only - although the system
was fairly unloaded it wasn't completely so):

Task    f2c(s)    f77(s)     f2c/f77
UVSRT   12        10         1.20
UVMAP   30        28         1.07
APCLN   408       350        1.17
APRES   20        17         1.18
ASCAL   285       176        1.62
MXMAP   48        40         1.20
MXCLN   688       463        1.49
VTESS   119       90         1.32
TOTAL   1610      1174       1.37

I believe that ASCAL's speed may be ascribable to the fact that with
the options I have chosen sin/cos are probably not inlined (this is
correctable).

More interesting is why MXCLN and APCLN, which do fundamentally the
same thing, run at different rates under f2c. I have no answer to this
now.

What can we conclude from this? Well, the obvious thing is that it
works and we can make AIPS run on machines without Fortran compilers.
Next, I believe that the above performance numbers could be increased
with modest amounts of effort, even with changes as trivial as
compiler command line options. If this is true it may have important
consequences in how we direct "new AIPS." I think we should consider
pursuing this experiment on more interesting machines such as the
Convex and the IBM workstation (or even with gcc on Suns).

Brian Glendenning, 12/6/90.
--
       Brian Glendenning - National Radio Astronomy Observatory
bglenden@nrao.edu          bglenden@nrao.bitnet          (804) 296-0286