rlr@utastro.UUCP (Randy Ricklefs) (06/06/88)
We will soon start porting software from a number of different sources to a
micro-processor-based UNIX system.  Most of the code is in various dialects
of FORTRAN.  The question that arises is whether to convert all this stuff
to a more modern language, particularly c.  This suggests several questions
which I hope the NET readers will help answer (without starting a 100-years
war on languages ... please! :-) ):
1) Are people actually initiating new projects in FORTRAN, or are they
	maintaining and porting old FORTRAN code?
2) Does the answer to 1) change if we restrict discussion to PC's & Macs?
3) Is c a suitable replacement for FORTRAN in terms of mathematical capabil-
	ities and portablility?
4) Are there reliable FORTRAN to c translators available under MS-DOS or UNIX
	that will allow moving over to c without re-coding the world?
Thanks in advance for the help!
Randy
-- 
                       Randy Ricklefs
       uucp:  {ut-sally, ut-ngp, noao, charm}!utastro!rlr
  arpa:  rlr@utastro.UTEXAS.ARPA    phone:  (512) 471-1342jlg@beta.UUCP (Jim Giles) (06/07/88)
In article <2742@utastro.UUCP>, rlr@utastro.UUCP (Randy Ricklefs) writes: > We will soon start porting software from a number of different sources to a > micro-processor-based UNIX system. Most of the code is in various dialects > of FORTRAN. The question that arises is whether to convert all this stuff > to a more modern language, particularly c. This suggests several questions > which I hope the NET readers will help answer (without starting a 100-years > war on languages ... please! :-) ): C is not all that much more 'modern'. It was developed more recently. Fortran is a higher-level language than C (as admitted even by Ritchie). > 1) Are people actually initiating new projects in FORTRAN, or are they > maintaining and porting old FORTRAN code? Most code (new and old) in major scientifc computing environments is Fortran, even when C is available. Whether this is because Fortran is actually superior, or just programmer inertia is a qeustion that spurs that 100-years war you complained about. > 2) Does the answer to 1) change if we restrict discussion to PC's & Macs? I write Fortran code for PC's. I don't write C code for PC's. I prefer to write Modula-2 code for PC's. I like writing SmallTalk code for PC's, but the executables are too slow. I know some people who write Fortran exclusively - on PC's as elsewhere. > 3) Is c a suitable replacement for FORTRAN in terms of mathematical capabil- > ities and portablility? On UNIX and UNIX-like systems, the Fortran support library is written almost entirely in C. The only real problem with C is the lack of the concept of an array (such references are always turned into pointers). This makes some (admittedly rare) constructs difficult to do in C. > 4) Are there reliable FORTRAN to c translators available under MS-DOS or UNIX > that will allow moving over to c without re-coding the world? Depends on what you mean by 'reliable'. If your definition doesn't include efficiency as a requirement, there are probably some such tools. > Thanks in advance for the help! I'm afraid I didn't help much. Without specifics on the purpose and size of the code you want to move, I can't really advise you on the desirability of converting it. (Nor can anyone else.) J. Giles Los Alamos
jerry@violet.berkeley.edu ( Jerry Berkman ) (06/08/88)
In article <20008@beta.UUCP> jlg@beta.UUCP (Jim Giles) writes: >In article <2742@utastro.UUCP>, rlr@utastro.UUCP (Randy Ricklefs) writes: >> We will soon start porting software from a number of different sources to a >> micro-processor-based UNIX system. Most of the code is in various dialects >> of FORTRAN. The question that arises is whether to convert all this stuff >> to a more modern language, particularly c. >> 3) Is c a suitable replacement for FORTRAN in terms of mathematical capabil- >> ities and portablility? C is missing many useful intrinsics. I have trouble taking a language seriously for numeric computation if the language doesn't even know about absolute values, e.g.: x = abs(x); or signed numbers, e.g. x = +5; Both statements are illegal in C. And what about: i = j**k In C this becomes: i = pow( (double)j, (double)k); For that matter, how do you translate y = x**6 from FORTRAN to C? Most FORTRAN compilers will optimize this using only 3 multiplies. If you translate it to C as y = x*x*x*x*x*x, then there is a chance the compiler will do as well as FORTRAN, but I wouldn't bet on it. And if you use the translation y = pow(X,5.0), you have an out-of-line procedure call plus an expensive evaluation. Actually, C does not have a concept of intrinsic at all. In the FORTRAN statement: x = sqrt(y) you are guaranteed by the standard that the square root intrinsic is invoked (unless there is an external statement). By contrast, the C statement: x = sqrt(y); does not guarantee anything of the kind. A user defined function could be called, or "sqrt" could be a define. For sqrt(), this is not usually a problem. But I'd be a lot less sure of other names like acos or sinh. For that matter, since sqrt() is a FORTRAN intrinsics, I get an error message from any decent FORTRAN compiler for sqrt(integer). None from C of course (running lint is a hassle). Also, I like the semantics of sqrt() under FORTRAN better: on all FORTRAN systems I have used, if y = -1., the result is a fatal error. In C, it depends on what the library does. I have used C libraries which returned sqrt(abs(y)), others which return 0, and 4.3 BSD on a VAX returns the reserved operand. >On UNIX and UNIX-like systems, the Fortran support library is written >almost entirely in C. The 4.3 BSD math libraries for the VAX are written mostly in assembler. That the 4.3 BSD VAX FORTRAN I/O library is written in C using standard I/O may be why it is so slow. The VAX math library is about as fast as the VAX VMS math library while the BSD I/O library is up to 3 or 4 times as slow as the VAX VMS I/O library. Much of the Cray UNICOS math and I/O support libraries are written in assembler. - Jerry Berkman Central Computing Services, U.C. Berkeley jerry@violet.berkeley.edu
gil@limbic.UUCP (Gil Kloepfer Jr.) (06/08/88)
In article <2742@utastro.UUCP> rlr@utastro.UUCP (Randy Ricklefs) writes: |>We will soon start porting software from a number of different sources to a |>micro-processor-based UNIX system. Most of the code is in various dialects |>of FORTRAN. The question that arises is whether to convert all this stuff |>to a more modern language, particularly c. This suggests several questions |>which I hope the NET readers will help answer (without starting a 100-years |>war on languages ... please! :-) ): No problem...I hate those wars myself (being a fan of both languages!). |>1) Are people actually initiating new projects in FORTRAN, or are they |> maintaining and porting old FORTRAN code? Both. There are still a number of research groups who are solve mathematical problems in FORTRAN (simply because it's all they know). This code is many times proprietary, and the authors simply do not care to let anyone port the existing code. There are also a LOT of statistical (again mathematical) and graphics software in FORTRAN which is so large that it would be a great expense to port. It is therefore maintained (ported if the need is there). |>2) Does the answer to 1) change if we restrict discussion to PC's & Macs? Yes and no. It all depends on cost, although personally I would look more towards porting if I had the PC/Mac marketplace in mind. |>3) Is c a suitable replacement for FORTRAN in terms of mathematical capabil- |> ities and portablility? C can do almost anything. The question is with how much trouble. FORTRAN, in my book, handles mathematical stuff MUCH easier and clearly than C. As stated in a past net-debate (no flames please if it is slightly out of context), C lacks some of the array flexibility that FORTRAN has (although with FORTRAN, you need to know how big the arrays are, with C you can allocate memory at run-time). Once you have successfully ported a FORTRAN-based mathematical procedure to C, you can rest assured that the program is pretty-much portable (I am at a loss for a system it wouldn't be portable to). |>4) Are there reliable FORTRAN to c translators available under MS-DOS or UNIX |> that will allow moving over to c without re-coding the world? I have heard of some, but I haven't used any and don't know how well they work. |>Thanks in advance for the help! You're welcome. NOTE: The final decision on which language to use for your application should not rest on which is more "modern." There are FORTRAN compilers for almost every computer in existance nowadays. Whether you code in C or FORTRAN should rest in the application and what it does. Further, the conversion costs should be considered as well as portability when making the decision to convert. Standard (F77) FORTRAN that isn't I/O intensive will generally compile and run on any machine with the same compiler. There are people that still develop FORTRAN-based applications, but the number is decreasing because much of the mathematical stuff can be "black-boxed" (and possibly just that module written in FORTRAN), while the rest of the system (including file handling and user-interface) can be more easily (and elegantly) written in C. |> Randy Ricklefs uucp: {ut-sally, ut-ngp, noao, charm}!utastro!rlr |> arpa: rlr@utastro.UTEXAS.ARPA phone: (512) 471-1342 +------------------------------------+----------------------------------------+ | Gil Kloepfer, Jr. | Net-Address: | | ICUS Computer Group, Systems Dvlp. | {boulder,ihnp4,talcott}!icus!limbic!gil| | P.O. Box 1 | Voice-net: (516) 968-6860 | | Islip Terrace, New York 11752 | Othernet: limbic!gil@icus.UUCP | +------------------------------------+----------------------------------------+
grimlok@hubcap.UUCP (Mike Percy) (06/08/88)
From article <10655@agate.BERKELEY.EDU>, by jerry@violet.berkeley.edu ( Jerry Berkman ): > > C is missing many useful intrinsics. I have trouble taking a language > seriously for numeric computation if the language doesn't even know about > absolute values, e.g.: x = abs(x); or signed numbers, e.g. x = +5; x =+ 5 used to be an example of a C operator which is now x += 5. Most compilers I've used warned me about an obsolete operator or some such. Just write is as x = +5 rather than x =+5 and things should be fine. x = abs(x) is provided by virtually every math library. > For that matter, how do you translate y = x**6 from FORTRAN to C? For that matter, how do you translate *p %= 2 from C to FORTRAN? > For that matter, since sqrt() is a FORTRAN intrinsics, I get an error > message from any decent FORTRAN compiler for sqrt(integer). None from C > of course (running lint is a hassle). "Gosh Dad, why do those airline pilots have to run through that checklist, it's such a hassle" "Well, Beaver, if they didn't, they might miss some small detail that could cause a crash" "Geeee" > > Also, I like the semantics of sqrt() under FORTRAN better: on all > FORTRAN systems I have used, if y = -1., the result is a fatal error. > In C, it depends on what the library does. I have used C libraries > which returned sqrt(abs(y)), others which return 0, and 4.3 BSD on a > VAX returns the reserved operand. > Which is why you have the freedom to decide exactly what you want by redefining the function slightly, maybe with a #define. Also, if you like FORTRAN math stuff, why not call functions from a FORTRAN library. NAG comes to mind. It's easy enough to do. FORTAN does have an extreme advantage over C in one respect -- vectorizability (is that a word?) of certain things (some array processing for example). If I remember right, this was one of the things that noalias was supposed to help change.
eric@snark.UUCP (Eric S. Raymond) (06/08/88)
In article <2742@utastro.uucp>, rlr@utastro.UUCP (Randy Ricklefs) writes: > 1) Are people actually initiating new projects in FORTRAN, or are they > maintaining and porting old FORTRAN code? New projects are still being initiated in FORTRAN. Lots of old FORTRAN code is still being maintained, too -- FORTRAN is still the computer lingua franca of the engineering and scientific world. > 2) Does the answer to 1) change if we restrict discussion to PC's & Macs? I am not aware of a FORTRAN for the Mac. Good FORTRANs do exist for the PC/PS machines (but beware of Microsoft's, which is said to be slow and buggy!). Sorry for not giving references, but it's been many years since I hacked FORTRAN myself. > 3) Is c a suitable replacement for FORTRAN in terms of mathematical capabil- > ities and portablility? Pre-ANSI C wasn't, quite -- the promote-everything-to-double lossage in K&R and lack of complex-number support caused problems. ANSI C solves the first problem, and C++ the second. A perhaps more serious problem is the lack of a C-portable equivalent of LINPACK (though the UNIX C math libraries cover a fair amount of ground). > 4) Are there reliable FORTRAN to c translators available under MS-DOS or UNIX > that will allow moving over to c without re-coding the world? I have seen ads for FORTRAN to K&R-C translators. I expect FORTRAN to C++ translaters with COMPLEX-to-complex mapping will be along any month now if they're not already here. Despite problems pointed out under 3), C has gained a lot of ground at former FORTRAN strongholds during the last five years; it is much easier to code in and maintain. The rise of UNIX has of course helped it. Assuming you decide to convert -- *don't* try converting by hand! Assuming you can get a translator, be aware that none of them are perfect and expect the odd bug or two to turn up post-translation. I guess I can sum up my gut feel about the change best with a pair of lists: Indications *for* switching to C: 1. Your FORTRAN code base is relatively small (so you don't need the next five years to satisfy yourself that the translator code did a correct job!). 2. Your applications have a long expected lifetime. 3. The code needs to be extended and modified on a continuing basis. 4. The application wants to run on UNIX or MS-DOS boxes (for the forseeable future, the best compiler technology is going to go first into C on these). Indications *against* switching to C: 1. Translator-introduced bugs (that might be *very* subtle!) could kill people. 2. Your applications use COMPLEX (some translators may not handle this). 3. Your applications use LINPACK or other math functions not available in the C/UNIX world. 4. Your application uses EQUIVALENCE or COMMON in weird untranslatable ways. 5. Your installation is dominated by FORTRAN-head engineer/scientist types who would be reduced to a pathetic state of confusion by the change. You *must* budget time for a post-conversion desk check of the generated C! Lint the whole thing at absolute minimum. If floating-point computation is involved, have a numerical analyst scan for hotspots. I hope your circumstances are such that you can attempt this. I'm speaking only from theory and comparative knowledge of the two languages; I'd really like to see a how-we-did posting from a shop that actually took the plunge. -- Eric S. Raymond (the mad mastermind of TMN-Netnews) UUCP: {{uunet,rutgers,ihnp4}!cbmvax,rutgers!vu-vlsi,att}!snark!eric Post: 22 South Warren Avenue, Malvern, PA 19355 Phone: (215)-296-5718
eugene@pioneer.arpa (Eugene N. Miya) (06/09/88)
I have been asked to post this. Please send mail to him, not me. Thanks. --eugene ============mesg start=============== Some flames for thought ... I agree that Fortran is a somewhat primitive language and that a better language for scientific programming would be nice. I see two issues in making transition to something better. The first is the choosing of a suitable replacement language. The second is that of translating Fortran Dusty Decks. 1. Choice of Language I do lots of programming in both C and Fortran. I think C is not the answer. It's too low level. Multidimensioned arrays in C are hard to implement well and in a way that will be portable. One person's idea of how to implement multidimesioned arrays will differ from another's. Now try to write code that will handle both! YECH! Fortran 8X is not the answer. It's a "dirty" language (to put it mildly). This is my personal debatable opinion. The approch by the committee here seems to be "let's add stuff we think is neat and hope over the next 20 years that people decide on their own that they shouldn't use things like COMMON, EQUIVALENCE, and column oriented format." I don't see much chance of success in this approach. OK, How about Ada? I am _very_ new to this language but it seems to be able to handle much of what one might desire for scientific programming. Code writtin in Ada has a good chance of being portable. The problem with this language seems to be its complexity. It won't run nicely on a small (PC class) machine. I wonder whether a possible solution to this problem is to define a sublanguage of Ada for scientific computation. That is, maybe some features could be removed to make the language run on small machines and still be nice for scientific computation. Will removing the task handling and exception handling fix the complexity problem? 2. Tranlation What are the problem areas of tranlation of Fortran into other (more sane) languages (e.g., C, pascal, Ada)? A few problem areas I see are a. multidimensioned arrays. Fortran code is _often_ optimized to account for Fortran's columnwise storage of 2-D arrays. Pascal (and Ada, I assume) store rowwise. b. EQUIVALENCE c. COMMON Any more here? Matt Wette mwette%gauss@hub.ucsb.edu
jerry@violet.berkeley.edu ( Jerry Berkman ) (06/09/88)
This message is a reply to three of the people who commented on my previous message. wan@cory.Berkeley.EDU (Hank S. Wan) wants a FORTRAN type checker: > if FORTRAN is so great as in all the "RE: Should I convert FORTRAN code to C" > somebody must have a type checker so the following is not merrily accepted > by unix f77. > > PROGRAM bar > j = sub1( i ) > PRINT *, i, j > END > SUBROUTINE sub1( r ) > r = 1.0 > END > > lint would do the job for c code. (and NO, lint is not a hassle if you're > smart enough to use it often, and interpret the results right.) > Checking consistency in type is useful for any large project in any serious > language. Here is what IBM's VS FORTRAN, version 2, release 2 says about your sample program, when the ICA option is specified: VS FORTRAN VERSION 2 ENTERED. 13:04:37 **BAR** END OF COMPILATION 1 ****** **SUB1** END OF COMPILATION 2 ****** (W) CONFLICTING NAME USAGE -- THE NAME, SUB1, HAS BEEN REFERENCED AS A FUNCTION IN BAR(COMPILATION 1). IT IS DEFINED AS A SUBROUTINE IN SUB1(COMPILATION 2). (W) CONFLICTING TYPE - AT THE MAIN ENTRY TO SUBROUTINE, SUB1(COMPILATION 2), ARGUMENT NO. 1, "R", IS EXPECTED TO BE REAL*4; HOWEVER, BAR(COMPILATION 1), AT ISN 2 PASSES THIS ARGUMENT, "I", AS INTEGER*4. VS FORTRAN VERSION 2 EXITED. 13:04:40 Sorry, but it's not public domain. By the way, when you are asking for a type checker, you are commented on the implementation, not the language. If you want complete checking, try WATFOR77, which check not only for argument agreement, but also check subscripts, and for variables used before they are defined. Is there any C equivalent? swarbric@tramp.Colorado.EDU (Frank Swarbrick) comments: > To Jerry Berkman, > Gee, I just tried compiling x = +5; and x = abs(x); both in C and they both > worked. I just tried both on monet.berkeley.edu, where the most current distributed 4.3 BSD UNIX lives. The C compiler, given the program: main() { int i; i = +5; } comments: "junk.c", line 5: syntax error It may very well work in other compilers, and will probably be added in the new standard, but is not there now. > (Now I don't know why anyone would ever use x = +5, but I guess > that is not the point.) It could be output from a program which generates source code, or a person just likes symmetry. It does not seem unreasonable to ask a compiler to correctly compile it. As for "x = abs(x);", it does compile and load. However, it's calling a library function. What's wrong with that? (1) It's slow. (2) It returns the absolute value of integers. You can type it "float abs();" and it works on the VAX, but it may not work on all systems. The FORTRAN generic abs() which works on any numeric data type and generates in-line code is superior. grimlok@hubcap.UUCP (Mike Percy) says: >>> Also, I like the semantics of sqrt() under FORTRAN better: on all >> FORTRAN systems I have used, if y = -1., the result is a fatal error. >> In C, it depends on what the library does. I have used C libraries >> which returned sqrt(abs(y)), others which return 0, and 4.3 BSD on a >> VAX returns the reserved operand. >Which is why you have the freedom to decide exactly what you want by >redefining the function slightly, maybe with a #define. This makes it an act of faith in reading large programs as to what procedure calls do. I prefer intrinsics for common functions. - Jerry Berkman, U.C. Berkeley Computing Services jerry@violet.berkeley.edu
gwyn@brl-smoke.ARPA (Doug Gwyn ) (06/09/88)
The original poster was wondering if he should convert Fortran code to C. My answer to that is, not if you can get an adequate Fortran compiler for your system. If you think it would be useful, there is at least one automatic Fortran-to-C translator commercially available (Fortrix). If you plan to thoroughly revise your application (so that it amounts to a rewrite), you would be better off designing the replacement from scratch, and it might make sense to implement the replacement in C if you're not going to be dependent on EISPACK, IMSL, etc. Fortran's single biggest problem is that it has shitty support for data structures. If this doesn't bother you, then by all means continue to use Fortran. I think by now everyone who is likely to learn better techniques would have done so. It happens that many of our more interesting applications are primarily written in C, because Fortran is just too puny. And we do run them on our Crays as well as on small and medium sized systems. I find that with sufficient care, useful C code can be even more portable than Fortran, more efficient, and more maintainable. Of course with insufficient care, horrible code can be produced in ANY language. Use whatever tool seems right for the job and that you are comfortable using.
owen@wrs.UUCP (Owen DeLong) (06/09/88)
In article <2742@utastro.UUCP> rlr@utastro.UUCP (Randy Ricklefs) writes: >We will soon start porting software from a number of different sources to a >micro-processor-based UNIX system. Most of the code is in various dialects >of FORTRAN. The question that arises is whether to convert all this stuff >to a more modern language, particularly c. This suggests several questions >which I hope the NET readers will help answer (without starting a 100-years >war on languages ... please! :-) ): Ahh, Why not? Language wars are so much fun.... You get to tie up so many peoples telebits with such garbage, and everyone can flame you... :-) >1) Are people actually initiating new projects in FORTRAN, or are they > maintaining and porting old FORTRAN code? Some, but that's just because they haven't ported their programmers to c yet. >2) Does the answer to 1) change if we restrict discussion to PC's & Macs? Yes, more Mac and PC programmers have been ported to c than in the system 3 environment. (What, you mean someone besides IBM makes computers? :-) >3) Is c a suitable replacement for FORTRAN in terms of mathematical capabil- > ities and portablility? Absolutely not, for this, you need COBOL! :-)... Seriously, I wouldn't wish COBOL on my worst enemy. >4) Are there reliable FORTRAN to c translators available under MS-DOS or UNIX > that will allow moving over to c without re-coding the world? I haven't seen anything. There was a review of FORTRAN to LISP converters in Computer Language once, I think, and the author there basically said he hadn't seen any inter-language translator he found to be reliable. > >Thanks in advance for the help! > >Randy > I hope some of this is helpful, but I couldn't resist the opportunity to poke fun at a language that should have died before assembly was born. Owen Disclaimer: WRS won't give me execute access to cc yet.
swarbric@tramp.Colorado.EDU (Frank Swarbrick) (06/09/88)
To Matt Wette,
   Am I totally missing something, or is this not a multidimentional array:
int array[10][15];
??
Certainly there are different ways of accessing them (as an array, as points,
or both at the same time), but they are all portable.  If I have greatly
misunderstood something I'm sure I will get many corrections.
Frank Swarbrick (and, yes, the net.cat)           swarbric@tramp.Colorado.EDU
...!{ncar|nbires}!boulder!tramp!swarbric
"...This spells out freedom, it means nothing to me, as long as there's a PMRC"swarbric@tramp.Colorado.EDU (Frank Swarbrick) (06/09/88)
(I am really wondering if this belongs in comp.lang.fortran at all, but I am
not sure if Jerry Berkman reads comp.lang.c.  Sorry.)
Anyway, just because your outdated compiler doesn't compile x = +5 means only
that your compiler is outdated...  Or maybe even buggy, as it should not really
think it means x =+ 5.  And indeed, it doesn't, as it gave you a syntax error
instead of a warning saying you never initialized x.  Or maybe C used to not
accept this format; I don't know.
Anyway, I'm not disagreeing that it's good that FORTRAN can generate differnt
things for x = abs(x) depending on whether or not x is float, int, or whatever.
It's just in C the compiler does not know what a function looks like.  They
are just black boxes with a few restrictions.  C was made this way for a
reason, and I like it.  FORTRAN is different for a reason, and that's fine
too.
(Oh, by the way, if x is float or double use x = fabs(x);)
Frank Swarbrick (and, yes, the net.cat)           swarbric@tramp.Colorado.EDU
...!{ncar|nbires}!boulder!tramp!swarbric
"...This spells out freedom, it means nothing to me, as long as there's a PMRC"cjl@ecsvax.UUCP (Charles Lord) (06/09/88)
In article <22ad4e5b!4683@snark.UUCP>, eric@snark.UUCP (Eric S. Raymond) writes: > I am not aware of a FORTRAN for the Mac. Good FORTRANs do exist for the PC/PS > machines (but beware of Microsoft's, which is said to be slow and buggy!). Since when? perhaps you are confusing MS Fortran with MSC. I have used MS Fortran from 3.2 to 4.1 and have had little to no trouble. No ad, just a satisfied customer. Charles Lord cjl@ecsvax.UUCP
walter@garth.UUCP (Walter Bays) (06/10/88)
jlg@beta.UUCP (Jim Giles) writes: >Most code (new and old) in major scientific computing environments is >Fortran, even when C is available. Whether this is because Fortran is >actually superior, or just programmer inertia is a question that spurs >that 100-years war you complained about. paolucci@snll-arpagw.UUCP (Sam Paolucci) writes: >Ah! Most major scientific computing environment use CRAYs as their >CPU. Do you know of a good C compiler on the CRAY that produces code >of somewhat comparable performance as CRAY's Fortran compilers? >There, I suggest, is part of the answer. I would venture that most C compilers have not produced code that is as fast as most Fortran compilers. And even as the computer industry continues doubling hardware performance, scientists like Jim Giles immediately use the new power by expanding the range of problems studied - and ask for more. Software efficiency will always be important to them. jlg@beta.UUCP (Jim Giles) writes: >Here you are putting the cart before the horse. There was no C compiler >for Crays until recently because there was no demand for it. The present >demand is almost entirely from within Cray and from other (non-scientific) >Cray customers. Again, whether this is from inertia or valid preference >is a combative issue. A chicken and egg problem. IF excellent C compilers had been available, would they have used them? Probably so. IF user demand had been present, would Cray (and others) have provided C compilers as fast as the best Fortran compilers? Probably not. C provides lower level primitives which a programmer can use to write fast code. So the worst C compilers are better than the worst Fortran compilers. But C's liberal assumptions about memory volatility and aliasing have interfered with many optimizations that are fairly easy to do on Fortran. So the best Fortran compilers are better than the best C compilers. Hence we have ANSI C and the battles over volatile (in the language) and noalias (rejected). Chris Torek captured the essence of the 'volatile' debate in many dozens of notes. If you personally have no use for 'volatile', but would like to see C displace Fortran (perhaps to protect yourself from having to use Fortran :-), read Chris' entire note. chris@mimsy.UUCP (Chris Torek) writes (Subject: volatile: a summary): >Is the [volatile] keyword necessary? >No. (As an existence proof, consider compilers that do not now have >the keyword. Most of them work by not doing much in the way of >optimisation. There is more to it, though.) > Summary of the summary: >Is the concept necessary? Yes. >Is the keyword necessary? No. >Is the keyword useful? Yes. >Is the keyword a `good thing'? Opinion: yes. jlg@beta.UUCP (Jim Giles) continues: >I use Fortran mostly because I know how to use it WELL. To some >extent, the lack of detailed knowledge of C is my reason for not using >it. (For example: why is 'a+=b' faster than 'a=a+b'? Seems to me that >the two are identical and should generate the exact same code.) ... If you're a Fortran programmer and have not tried C because you KNOW its' too slow, look again. Many C compilers have been getting much better, even before ANSI. Any good compiler will generate the same code for those constructs. There's a lot of folk wisdom about how to get around the pcc code generator and write fast C code. (Absolutely essential to such things as writing systems code.) Unfortunately, much of this wisdom is counter-productive today, because it obfuscates the code and interferes with the work of the optimizer. (Example: indiscriminate 'register' declarations steal from the compiler's pool of allocatable registers.) swarbric@tramp.Colorado.EDU (Frank Swarbrick) writes: >As for C not generating inline code for stuff like x = pow(x, 2); that is >just not "C's way." It has been discussed a lot in comp.lang.c. Some people >think C should generate inline code for it, and some do not. So far, it >doesn't. C compilers are starting to do more procedure inlining. The widespread use of the Dhrystone benchmark has accelerated this trend, since inlining strcpy() and strcmp() produce much higher Dhrystone numbers. (The Dhrystone rules forbid inlining, but many people only care that the program executes correctly - and don't care what the compiler does.) On some RISC CPU's (not ours) multiplication is a procedure call that is sometimes inlined. Who cares? That affects performance, certainly, but the bottom line of performance is all that matters. jlg@beta.UUCP (Jim Giles) provides an appropriate postscript: >Fortran does some things better, C does others, and both contain deficiencies. -- ------------------------------------------------------------------------------ I said it, not them. E-Mail route: ...!pyramid!garth!walter (415) 852-2384 USPS: Intergraph APD, 2400 Geng Road, Palo Alto, California 94303 ------------------------------------------------------------------------------
mat@emcard.UUCP (Mat Waites) (06/10/88)
In article <6551@sigi.Colorado.EDU> swarbric@tramp.Colorado.EDU (Frank Swarbrick) writes: > > Am I totally missing something, or is this not a multidimentional array: >int array[10][15]; >?? I think the problem is with variable size multidimensional arrays. It's hard(er) to write functions in C to handle multidimensional arrays of varying size. People have come up with wacky techniques for doing it, but if there is no standard way, it's difficult to use in standard libraries, etc. Mat -- W Mat Waites | PHONE: (404) 727-7197 Emory Univ Cardiac Data Bank | UUCP: ...!gatech!emcard!mat Atlanta, GA 30322 |
platt@emory.uucp (Dan Platt) (06/10/88)
In article <5241@ecsvax.UUCP> cjl@ecsvax.UUCP (Charles Lord) writes: >In article <22ad4e5b!4683@snark.UUCP>,eric@snark.UUCP(Eric S. Raymond) writes: >> I am not aware of a FORTRAN for the Mac. Good FORTRANs do exist for the PC/PS >> machines (but beware of Microsoft's, which is said to be slow and buggy!). > >Since when? perhaps you are confusing MS Fortran with MSC. > >I have used MS Fortran from 3.2 to 4.1 and have had >little to no trouble. No ad, just a satisfied customer. > >Charles Lord >cjl@ecsvax.UUCP Actually, I've been quite impressed with the MS Fortran 4.10. It has a few bugs (I found one -- there are probably more), but I'm more tolerant of these than some people are (judging by the flames that some people have generated and pointed at microsoft). The products MS Fortran and MS C are very similar in quality, speed of compilation and speed of executable code. The code is quite fast for MS Fortran (it beat out MSC and Turbo C for some numerical procedures that I benchmarked), and all of these were much faster than the previous generation of products available for PC/s. But not to stray too far from the course, I feel that: 1) the issue is subjective to some extent (hence the lead in "I feel"). How well do you like fortran? How well do you like C? How well do the people you work with like Fortran and C? 2) Objective criterea would include: a) How well are the various languages supported in your environment? Does your system have a good C compiler that generates fast reliable code? Does it have good system support (libraries &c)? If you are doing PC work, there are good cheap C compilers that give you a great deal of control over your system resources at efficiencies that would have been nice some years ago. b) How well does your environment support fortran? In some cases, the fortran support is inferior to that provided for C. In some cases, such as IBM 370's, CRAY's, Cybers &c, the Fortran compiler will be the best choice. c) Who will be using the code? If it is engineers who are doing and have been doing mainly number crunching work, with little or no nead for dynamic data structure manipulation, then Fortran might be better, and if these people don't know C, it may be best to work in fortran --- save lots of lost time and money. There is lots of research being initiated in fortran and work being ported in fortran --- often dictated by criterea defined in 1, 2b, and 2c. The work I've been doing has included both fortran and C coding -- depending on the application and environment I'm working on --- some of it being duplicated for various reasons in both languages. I believe there's room for both languages, and use both. The rest depends on the above criterea. Dan Platt
bobal@microsoft.UUCP (Bob Allison) (06/10/88)
In article <2742@utastro.UUCP> rlr@utastro.UUCP (Randy Ricklefs) writes: >1) Are people actually initiating new projects in FORTRAN, or are they > maintaining and porting old FORTRAN code? >2) Does the answer to 1) change if we restrict discussion to PC's & Macs? I'd say that on PCs most of the stuff going on is porting. But that is because there is a lot of stuff on mainframes which people want to run on PCs and are finally getting the horsepower to do. I was actually surprised by how much new development is going on. >3) Is c a suitable replacement for FORTRAN in terms of mathematical capabil- > ities and portablility? Plenty of other postings arguing this issue. I will just toss in my opinion that you'll almost always get better execution time on a FORTRAN program than a comparable C program (see my next posting). >4) Are there reliable FORTRAN to c translators available under MS-DOS or UNIX > that will allow moving over to c without re-coding the world? > I've seen some ads in typical PC magazines. Buy a few and take a look. But be skeptical: call them up, ask them hard questions. Most of them will require at least some user intervention. Almost all of them generate crummy source, which you practically have to rewrite before you can do any serious maintenance. Be prepared to do some debugging. All of this boils down to: how much are you willing to invest to move to a "modern" language? Is execution speed important? What are the real reasons for switching? Are you going to have to maintain this later? Try to figure out the economics of the effort, then decide. How much time/effort/money are you going to save down the road by investing a bunch of it now? It might be better to spend the time doing something else. >Thanks in advance for the help! > We're always willing to offer opinions when it isn't our project :-)
bobal@microsoft.UUCP (Bob Allison) (06/10/88)
>In article <2960@emory.uucp> platt@emory.UUCP (Dan Platt) writes: >>In article <5241@ecsvax.UUCP> cjl@ecsvax.UUCP (Charles Lord) writes: >> >>[responding to gripe, says nice stuff about MS Fortran] > >Actually, I've been quite impressed with the MS Fortran 4.10. It has a few bugs >(I found one -- there are probably more), but I'm more tolerant of these than >some people are (judging by the flames that some people have generated and >pointed at microsoft). The products MS Fortran and MS C are very similar in >quality, speed of compilation and speed of executable code. The code is >quite fast for MS Fortran (it beat out MSC and Turbo C for some numerical >procedures that I benchmarked), and all of these were much faster than >the previous generation of products available for PC/s. > >But not to stray too far from the course... > [a good description of criteria for selection of FORTRAN deleted] > >I believe there's room for both languages, and use both. The rest depends >on the above criterea. > >Dan Platt Well, this isn't exactly an ad either, but I am the Lead for MS FORTRAN now. I have three basic comments: 1) It's nice to hear good things every now and then (let's not fill the net with endorsements or flames: I just wanted to say I appreciate it). 2) (This isn't directed at you, Don) I've been reading the net for at least two years and I hear a lot of people who find bugs in products. As a developer, nothing is more frustrating than hearing vague rumors of bugs, but never getting any solid evidence of such. Please, please, pretty please, take the effort to slog your way through whatever customer service is available for whatever product and try to report the bug. You're probably helping someone else, maybe helping yourself, and you give us job security (developers are often the worst about this). 3) MS FORTRAN and MS C use the same back end. This means that there are very few optimizations which one compiler performs that the other does not (none that I know of). Yet, Don says that FORTRAN sometimes ran faster than C. I've worked at other companies with similar compiler technology and I have found this to be the case for many benchmarks. We all know why, too. FORTRAN has a lot of problems, but it has a couple of advantages which are what has kept it around for so long and this is clearly one of them. PS: About porting FORTRAN code to C: one of the most obnoxious features of the port is that all REAL arguments become DOUBLE PRECISION (C always passes double). It doesn't hurt the answer, it just slows things down a WHOLE BUNCH on a lot of machines. Bob Allison (I got a bunch of positive response about X3J3 stuff, so I'll keep it up)
karl@haddock.ISC.COM (Karl Heuer) (06/11/88)
In article <10681@agate.BERKELEY.EDU> jerry@violet.berkeley.edu (Jerry Berkman) writes: >By the way, when you are asking for a type checker, you are commented on the >implementation, not the language. I'm glad you were the one to bring that up. Many of the issues you raise are not intrinsic to the language, either. >If you want complete checking, try WATFOR77, which check not only for >argument agreement, but also check subscripts, and for variables used >before they are defined. Is there any C equivalent? Yes, it's my understanding that some implementations of C have this. I haven't used them (though I've often threatened to write one). Certainly the language doesn't forbid it. >As for "x = abs(x);", it does compile and load. >However, it's calling a library function [which is slow]. That's an implementation issue. The better C compilers generate inline code. >(2) It returns the absolute value of integers [only]. The floating-point version is named fabs(). If you want to argue whether a single generic name is better than a family of typed functions, fine; but I think that the issue you raised (existence) is closed. >grimlok@hubcap.UUCP (Mike Percy) says [about the semantics of sqrt(-1.0)] >>Which is why you have the freedom to decide exactly what you want by >>redefining the function slightly, maybe with a #define. > >This makes it an act of faith in reading large programs as to what >procedure calls do. I prefer intrinsics for common functions. Mike Percy has fallen victim to a common misconception. The user does not have the guaranteed freedom to redefine standard functions. (It may happen to work; this is also true in FORTRAN.) The standard functions in C are quite analogous to the intrinsics of FORTRAN. (Note: "analogous" does not mean "identical".) Karl W. Z. Heuer (ima!haddock!karl or karl@haddock.isc.com), The Walking Lint
karl@haddock.ISC.COM (Karl Heuer) (06/11/88)
I really don't want to continue this cross-posted discussion, but I'd like to comment on the multi-dimensional array issues. In article <10032@ames.arc.nasa.gov> mwette%gauss@hub.ucsb.edu (Matt Wette) writes: >Multidimensioned arrays in C are hard to implement well and in a way that >will be portable. Since we're talking about FORTRAN conversion, the straightforward solution is to use the array notation supplied by the C language: "float a[3][5]". The only major problem is that FORTRAN allows an array parameter to be declared with a nonconstant dimension; to render the equivalent code in C (unless you have Gnu's gcc) requires explicit subscript calculation. (This is no less efficient than in FORTRAN, but it is tedious to write. A macro can be helpful.) >Fortran code is _often_ optimized to account for Fortran's columnwise storage >of 2-D arrays. Pascal (and Ada, I assume) store rowwise. No problem. You translate the FORTRAN expression A(J,I) into the C expression a[i][j], or its Pascal or Ada equivalent. (This is assuming your translator knows which parens are arrays rather than function calls.) Karl W. Z. Heuer (ima!haddock!karl or karl@haddock.isc.com), The Walking Lint
smryan@garth.UUCP (Steven Ryan) (06/11/88)
Another big problem is that Fortran programs run like a bat out of hell with a good compiler. It is hard to convince people to give up the performance for (?)ephermal advantages of maintainablity.
shenkin@cubsun.BIO.COLUMBIA.EDU (Peter Shenkin) (06/13/88)
In article<10032@ames.arc.nasa.gov> eugene@pioneer.UUCP (Eugene N. Miya) writes: > >I have been asked to post this. Please send mail to him, not me. [ note: "him" is: Matt Wette mwette%gauss@hub.ucsb.edu ] > ...I see two >issues in making transition to something better. The first is the >choosing of a suitable replacement language. The second is that of >translating Fortran Dusty Decks. > >1. Choice of Language > > Multidimensioned arrays in C are hard to >implement well and in a way that will be portable. One person's idea >of how to implement multidimesioned arrays will differ from another's. >Now try to write code that will handle both! YECH! Just on this issue, I wonder what other numerikers in the audience think of Press et al's solution to this problem, given in "Numerical Recipes in C", Wiley, 1988. To make a long story short, they supply a routine: float **matrix( int nrl, int nrh, int ncl, int nch ) which uses dope vectors to dynamically allocate a 2-dimensional array of floats with range [nrl...nrh][ncl...nch]. They also supply similar routines for double-precision and integer matrices (called dmatrix() and imatrix() ), and the corresponding "free" routines. They do a similar thing for vectors, and obviously an extension of this approach using varargs -- or whatever the hell it's called in the draft ANSI standard :-) -- could easily be accomplished for arbitrary dimensionality. This is, in Matt's words, just "one person's idea of how to implement multidimensioned arrays", but it seems like a reasonable one. The reason for raising this issue here on the net is simply that if we who do write numerical code in C (for better or for worse!) were to standardize on this practice, we could better understand each others' code (or at least this part of it!) For the record, this generalizes to the following: float **array( int Ndim, int n1l, int n1h, int n2l, int n2h, ..., int nNl, int nNh ) /* allocate an N-dimensional array of floats with range * [n1l, n1h][n2l,n2h]...[nNl,nNh] */ void free_array( float **ar, int Ndim, int n1l, int n1h, int n2l, int n2h, ..., int nNl, int nNh ) /* free array allocated with array() */ Then we also would have the following routines: int **iarray() /* allocate an N-dimensional array of ints.... */ double **darray() /* allocate an N-dimensional array of doubles.... */ and the corresponding free routines. Of course, we could then have: #define vector( NL,NH ) array( 1, (NL), (NH) ) #define matrix( NRL,NRH,NCL,NCH ) array( 2, (NRL), (NRH), (NCL), (NCH) ) and perhaps even (for "real" C programmers): #define vector0( N ) array( 1, 0, (N) ) #define matrix0( NR,NC ) array( 2, 0, (NR), 0, (NC) ) ...and so on. All presumably in some header file with a name like "arrays.h". Any thought on this? ("Numerical analysts of the world, unite....") -- ******************************************************************************* Peter S. Shenkin, Department of Biological Sciences, Columbia University, New York, NY 10027 Tel: (212) 280-5517 (work); (212) 829-5363 (home) shenkin@cubsun.bio.columbia.edu shenkin%cubsun.bio.columbia.edu@cuvma.BITNET
greg@csanta.UUCP (Greg Comeau) (06/13/88)
>> Gee, I just tried compiling x = +5; it worked. > i = +5; >comments: > "junk.c", line 5: syntax error >It may very well work in other compilers, and will probably be added in >the new standard, but is not there now. If an old compiler accepts this it is probably just being lazy for some reason. Since C does not have a unary plus, it should be a syntax error. Note that the ANSI draft does allow a unary plus now though since "hey, there's a unary minus so let's at least be consistent". Also, at one time ANSI did apply a binding characteristic to the unary plus applied to parenthised expression (say: a + +(b + c) if I recall), but that has since been withdrawn.
kjartan@raunvis.UUCP (Kjartan Pierre Emilsson Jardedlisfraedi) (06/13/88)
  Now here there seems to be a little misunderstanding.  I have never
understood why Fortran users keep saying that array indexing is awkward
in C.  This is probably because they find the idea of having to allocate
memory for an array rather repulsive.  I  remember that when I made
the change from Fortran to C, I thought that the memory allocation
routine malloc() was only intended for compiler designers and other
computer wizards.  But one quickly finds out that allocating memory
for whatever you are doing is not only simple but also helps you a lot
in keeping the whole program under control, since you have to have in 
mind a clear image of all your variables.
  For people that frequently use the same sort of data structures, e.g
two dimensional double precision array, it is a simple task to write
a little subroutine wich handles the allocation. For this particuliar
data structure, the subroutine is basically a one-liner:
double **Create2DArray(w,h)
int w,h;{ double **r;
for(r=(double**)calloc(h,sizeof(*r));h-->0;r[h]=(double*)calloc(w,sizeof(**r)));
return(r);}
  This may look very cryptic but then you only write it once anyway! A small
extension of the routine above could handle multidimensional array of
arbitrary dimensions and arbitrary data structures.
  
  If the array had been initialized with Create2DArray() then indexing
would be just as normal,i.e:
		DoSomething(array,m,n)
		double **array;
		int m,n;
		{
		...
		array[ i+k ][ j-i ]= 1.0;
		...
		}
 More generally: if an array is initialized with arbitrary dimensions, its
indexing is exactly as in FORTRAN, and can be passed to functions in exactly
the same manner (that is by pointers, which in this case is simply the array
name).
 I personally made the change from FORTRAN to C one year ago, and today I
do not regret it , though there came times when I wanted to do some really
mean things to that stupid computer.  I haven't  completely given up 
FORTRAN though, as there are still many libraries in FORTRAN around here, but I
simply build jacket routines for them (i.e routines which interface calling
conventions of two different languages) with the JBL utility. In that way
I don't have to write anything in FORTRAN but I still can use all the
available FORTRAN subroutines. Knowing that FORTRAN calls everything by
reference and C calls everything by value, it is easy to pass the arguments
from the C program in such a way that the only interfacing you need to do is
converting the pre-underscored C routine function name to the upper-case
FORTRAN name.
  I think that the C offers two qualities that makes it a very comfortable
scientific programming language:
		i) Argument are passed by value, which translates to
		   the fact that to let a subroutine change a variable
		   you have to specify it (by passing its adress).  This 
		   makes subroutines more like real mathematical operators
		   which have a very definite input and a very definite
		   output. This is really a psychological quality, but I
		   think that it has its effect in organizing your program
		   (a What You Get Is What You Asked For quality).
	       ii) You can define your own data structures, which can be any
		   combination of system defined structures and user defined
		   structures, array of structures, self-referential
		   structures and so on. This I think is perhaps the most
		   interesting quality and probably anyone who has done some
		   mathematics will appreciate the fact that most complex
		   problems can be made simpler by a suitable change of
		   formalism which most often is a change of data structure
		   with accompagnment of new operators acting on these
		   structures. For example if you are working in
		   4-dimensional space you simply define the data structure
		   4-vector and then proceed to write the basic operators
		   that you are going to need (DotProduct(vector1,vector2)
		   ,Add(vector1,vector2),Scale(scalar,vector), and so on).
		    Quickly you will see that your program is talking the
		    same mathematical language as you are, and program
		    writing then becomes a real FORmula TRANslation!
  This is not meant as an artillery in the Language Wars, but rather as an
explanation and maybe a clarification for those who haven't had any special
experience with C ,and still think that indexing doesn't exist in C and that
C is bad because it doesn't have the complex data type!
	Say it in any language you like, but say it clearly!
...........................................................................    
    "If you don't like what your left hemisphere thinks, shoot if off."
			Kjartan Pierre Emilsson, Iceland.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hirchert@uxe.cso.uiuc.edu (06/14/88)
FORTRAN remains the preeminent language for numerical (esp. scientific and
engineering) computing for a number of reasons:
o Inertia
  - Much scientific and engineering computing is based on earlier work.  If
    that earlier work is in FORTRAN (and the odds are that it is), then it is
    usually easier to continue working in FORTRAN than to switch to another
    language.  [This effect may be reduced somewhat if automatic language
    conversion programs become both more reliable and more widely available.
    However, unless someone figures out how to translate the comments from
    describing the source language to describing the result language, there
    will always be some barrier here to language switching.]
  - Many of the people doing scientific and engineering computing are
    professional scientists and engineers and only part-time programmers.
    Since programming is not their business, they are far more likely to be
    proficient in only a single language, so even when projects are started
    from scratch, they are far more likely to use the language that they are
    already using when building on existing work - FORTRAN.  [This effect may
    be reduced as more and more colleges and universities teach languages other
    than FORTRAN to their science and engineering students.  On the other hand,
    if one's only exposure to a language is a few class problems, one may still
    feel uncomfortable using that language for "serious" computing.]
o FORTRAN is, in a number of ways, a superior language for many kinds of 
  numeric computing.
  - Features
    . FORTRAN includes a number of features that many more "modern" language
      make you build yourself from more basic features.  The classic example
      is the COMPLEX data type.  A less appreciated example is FORTRAN formatted
      input/output.  If you are reading and writing the classic tables of
      numbers and labeling information, it is a substantially more compact and
      more convenient tool than what is offered in most more "modern" languages.
      (Of course, if you want to some other kind of textual input/output, it
      can be a major pain.)  [In theory, the "modern" languages could match
      this power with library procedures of comparable power.  In practice, the
      portable routines tend to make FORTRAN input/output look fast and the
      fast routines tend not to be portable.  The development of standards, so
      the fast routines are all written to the same specification, would help
      here.]
    . The FORTRAN programmer usually has access to a much wider selection of
      "canned" numerical software than the programmer in other languages.
      (This is an outgrowth of the inertia factor.)  [Automatic translators
      can help with this problem when source code is available.  Mixed language
      programming environments can make irrelevant the issue of what language
      was used to code your "canned" software.  Unfortunately, the way two
      languages interact seems to vary significantly from environment to
      environment, so mixed language software tends to be less portable than
      software written in a single language.  This is another area where the
      development of standards might eventually reduce FORTRAN's advantage.]
  - FORTRAN compilers tend to produce faster code.
    . Historically, vendors have put more effort into optimizing FORTRAN than
      other languages.  (Inertia again.)  [This effect is slowly being reduced
      by an increased demand for optimizing compilers for other languages and
      by the increased use of techniques such as common compiler back-ends to
      facilitate meeting these demands.]
    . FORTRAN is inherently more optimizable (using today's compiler
      technology) than most more "modern" languages.  FORTRAN is close enough
      to the machine to avoid the problem of being forced to do extensive
      analysis/optimization just to produce reasonable code, but far enough
      removed from the machine to allow the application of such analysis/
      optimization to produce signficant improvements.  In particular, where
      other languages frequently insist on complete and orthogonal definitions,
      FORTRAN will disallow unusual (and unlikely) combinations of features if
      this will allow significant improvements in the generated code.  The 
      classic (and single most important) example of this are the restrictions
      in FORTRAN concerning the use of the same or overlapping actual arguments
      corresponding to the dummy arguments of a procedure.  These restrictions
      allow a wide range of optimizations, from removal of loop invariants and
      recognition of common subexpressions to vectorization, to be applied to
      dummy arguments in the same way they are applied to ordinary variables.
      [This inherent advantage may not always be realized.  If a vendor uses
      a common compiler back-end, he may forego these optimizations in order
      simplify its use.  Alternatively, he may provide for compiler directives
      that allow the programmer in a more "modern" language to explicitly
      assert the restrictions implicit in FORTRAN.  (As I understand it, that's
      what the ANSI C noalias controversy was all about.)]
From my point of view, all of the above suggests that FORTRAN will be around
for some time to come.  On the other hand, the factors I mention in my bracketed
comments will tend to reduce FORTRAN's advantages, FORTRAN is most definitely
not without its deficiencies:
o FORTRAN's memory management is inadequate for many of todays problems.
o The array is not sufficient as the primary data organizing facility.
  (Structured data types would provide assistance in expressing fixed relations
  between non-heterogenous data.  Pointers, or something with equivalent
  semantic power, would provide assistance in expressing more dynamic or less
  uniform relations.)
o As typical portability requirements expand from porting programs from unlike
  systems on a one-shot basis to running programs under continuing development
  on several systems simultaneously (e.g. doing test runs on a workstation and
  production runs on a supercomputer), FORTRAN needs to provide more 
  assistance in writing portable code.  A classic problem in this area is
  switching between the use of REAL and DOUBLE PRECISION, including the related
  problems of switching constants and name management for libraries offering
  both REAL and DOUBLE PRECISION versions of routines.
o As the exploitation of parallelism becomes increasingly important in high-end
  computing, more powerful notation for expressing this parallelism becomes
  highly desirable.
In the long run, the extent to which FORTRAN continues to be used may depend
on what is produced as a result of the Fortran 8x effort.  If that result
retains FORTRAN's current strengths and remedies its most pressing deficiencies,
then it seems likely that Fortran will remain the preeminent numerical
computing language.  If the Fortran 8x process fails to produce a new standard
or produces one which is so tied to existing practice that it fails to address
FORTRAN's more pressing deficiencies, then a slow (but accelerating) decline
seems likely.  My guess as the most probable successor would be C.  It has
some serious problems as a numerical computation language, but the ANSI C
committee seems willing to address the extensions and changes necessary to
remedy these problems.
(Just so no one misinterprets what I'm saying:  When I suggest the possibility
of a decline in the use of FORTRAN, I'm not talking about something that will
happen in the next year or two or even in the next 5 to 10 years.  The factors
cited at the beginning of this posting are too significant.  A significant
decline, if it occurs, may well take place over a period of time comparable to
the time it took us to get where we are now, and FORTRAN is more than 30 years
old!)
Kurt W. Hirchert     hirchert@ncsa.uiuc.edu
National Center for Supercomputing Applicationshenry@utzoo.uucp (Henry Spencer) (06/15/88)
> C is missing many useful intrinsics. I have trouble taking a language > seriously for numeric computation if the language doesn't even know about > absolute values, e.g.: x = abs(x); or signed numbers, e.g. x = +5; > Both statements are illegal in C. Curious, my compiler compiles the first one just fine :-). The second is admittedly a bit of a nuisance, but hardly a disaster. In any case X3J11 has fixed both: abs is one of the potentially-intrinsic functions defined in the standard, and unary plus is provided. > [Much moaning and groaning about the lack of exponentiation, an operator > whose general case is (a) notoriously hard to define well, and (b) very > seldom used.] > Actually, C does not have a concept of intrinsic at all. X3J11 (ANSI-to-be) C does. > (running lint is a hassle). Ah, we come to the heart of it: the man wants to convert to C without having to learn anything new, go to any trouble, or buy modern compilers. My nose bleeds for him. -- Man is the best computer we can | Henry Spencer @ U of Toronto Zoology put aboard a spacecraft. --Von Braun | {ihnp4,decvax,uunet!mnetor}!utzoo!henry
chpf127@ut-emx.UUCP (J. Eaton) (06/17/88)
In article <224@raunvis.UUCP>, kjartan@raunvis.UUCP (Kjartan Pierre Emilsson Jardedlisfraedi) writes: > > [stuff deleted about dynamic memory allocation] > > For this particuliar > data structure, the subroutine is basically a one-liner: > > double **Create2DArray(w,h) > int w,h;{ double **r; > for(r=(double**)calloc(h,sizeof(*r));h-->0;r[h]=(double*)calloc(w,sizeof(**r))); > return(r);} > > This may look very cryptic but then you only write it once anyway! Yes, perhaps, but you (and, most likely, someone else) will have to read it more than once. Will that future reader be able to quickly grasp what is being done (especially if that future person is a scientific "programmer")? > Say it in any language you like, but say it clearly! I agree. > Kjartan Pierre Emilsson, Iceland. J. Eaton UT Dept of Chemical Engineering Practice what you preach.
chpf127@ut-emx.UUCP (J. Eaton) (06/17/88)
In article <224@raunvis.UUCP>, kjartan@raunvis.UUCP (Kjartan Pierre Emilsson Jardedlisfraedi) writes: > > I personally made the change from FORTRAN to C one year ago, and today I > do not regret it , though there came times when I wanted to do some really > mean things to that stupid computer. I don't understand this idea of refusing to write code in more than one language. Why not write in whatever language is most convenient for the job at hand? I use mostly Fortran because there are a lot of numercial libraries available that I simply don't care to rewrite or translate to another language. However, I don't object to C or any other language (except BASIC :-) if it fits the requirements of the job... > I haven't completely given up > FORTRAN though, as there are still many libraries in FORTRAN around here, but > I simply build jacket routines for them (i.e routines which interface calling > conventions of two different languages) with the JBL utility. In that way > I don't have to write anything in FORTRAN but I still can use all the > available FORTRAN subroutines. Knowing that FORTRAN calls everything by > reference and C calls everything by value, it is easy to pass the arguments > from the C program in such a way that the only interfacing you need to do is > converting the pre-underscored C routine function name to the upper-case > FORTRAN name. But I DO object to this. At the present time, multilanguage programming is NOT a good idea. AVOID it if at all possible. It makes porting to new environments especially difficult, if not impossible in many cases. Fortran may not be your favorite, and C isn't perfect either, but mixing the two is bound to cause trouble for you or someone else in the future. The whole world is not VM/CMS, COS, NOS/VE, VMS, or even 4.3 BSD. Moving a program which contains only Fortran or only C from any one of those environments to another is hard enough. Don't make it any more difficult by mixing in a load of interface routines which are system dependent! This will make moving your code to new machines/operating systems such a pain in the ass that very few will even give it a second thought. Of course, if you really want your code to die with the machine it was written for, disregard the above. > Say it in any language you like, but say it clearly! I still agree, just say it in one at a time. > Kjartan Pierre Emilsson, Iceland. J. Eaton UT Dept of Chemical Engineering And old machines really do die. Just ask any old CDC 6600. What? You're still using one? Oh. Sorry. Maybe machines ARE forever.
bobal@microsoft.UUCP (Bob Allison) (06/17/88)
In article <50500055@uxe.cso.uiuc.edu> hirchert@uxe.cso.uiuc.edu writes: > > >FORTRAN remains the preeminent language for numerical (esp. scientific and >engineering) computing for a number of reasons: > > [a very complete, fairly concise, excellent description of FORTRAN's > strengths and weaknesses] > >Kurt W. Hirchert hirchert@ncsa.uiuc.edu >National Center for Supercomputing Applications You know, I read this article and was actually envious. Here is someone who writes a crisp, clean, definitive, informative, objective, technically sound article. I wondered what would happen if I could write articles like this. I wondered what would happen if the whole net started writing articles like this. Then I saw it was Kurt, and I thought "Oh, of course". For Kurt is intelligent, informative, and rational: truly a gentleman and a scholar. In short: that was a really good article. Thanks. P.S. Now, don't you wish you hadn't skipped that article? :-). Bob Allison (If we could only find someone who knew about all the FORTRAN to C translators who would write a similar article for us)
firth@sei.cmu.edu (Robert Firth) (06/17/88)
The following code, contributed by a C programmer, allocates dynamic memory for a two-dimensional array: > For this particuliar > data structure, the subroutine is basically a one-liner: > > double **Create2DArray(w,h) > int w,h;{ double **r; > for(r=(double**)calloc(h,sizeof(*r));h-->0;r[h]=(double*)calloc(w,sizeof(**r))); > return(r);} Any Fortran programmer who seriously proposes to convert to C would, in my opinion, be advised to study this example very carefully. Verbum sapienta sufficit.
jlg@beta.lanl.gov (Jim Giles) (06/18/88)
In article <224@raunvis.UUCP>, kjartan@raunvis.UUCP (Kjartan Pierre Emilsson Jardedlisfraedi) writes: > Now here there seems to be a little misunderstanding. I have never > understood why Fortran users keep saying that array indexing is awkward > in C. > [...] Awkward isn't the problem. I don't like the double bracket notation, but anyone can write a macro to turn ARRAY(i,j,k) into array[i][j][k]. The Problem is that 2-d arrays are not optimizable because they are immediately turned into pointer-to-pointer-to-data type of constructs. > If the array had been initialized with Create2DArray() then indexing > would be just as normal,i.e: > > DoSomething(array,m,n) > double **array; > int m,n; > { > ... > array[ i+k ][ j-i ]= 1.0; > ... > } If your use of array here is in a loop (that might otherwise be vector- izable), the C compiler can't vectorize it because a pointer-to-pointer 'may' contain a vector dependency - the compiler doesn't know it's a 2-d array! (If you don't have a vector machine, the problem still exists, you can't unroll the loop for a pipelined archetecture because of the same possible dependency.) This problem is the reason that so much discussion on this news group has recently been about 'noalias' in C (not exactly an appropriate topic for comp.lang.fortran, but relevent to this particular issue anyway). > [...] > ii) You can define your own data structures, which can be any > combination of system defined structures and user defined > structures, array of structures, self-referential > structures and so on. [...] This capability is NOT an adequate replacement for the complex data type. Complex is more than just a pair of reals, there are several operations defined on complex (like plus) which should work naturally. No use of C programming constructs can do this: COMPLEX CVAR REAL RVAR INTEGER IVAR ... CVAR=(CVAR+RVAR)*IVAR ... The operations (including the mixed-mode type conversions) are all done IN-LINE and efficiently. C provides no way to even do this calculation without function calls. > This is not meant as an artillery in the Language Wars, but rather as an > explanation and maybe a clarification for those who haven't had any special > experience with C ,and still think that indexing doesn't exist in C and that > C is bad because it doesn't have the complex data type! Clarifications which contain false or misleading information don't clarify anything. In fact, such 'clarifications' DO add to the Language Wars in very unproductive ways. J. Giles Los Alamos
smryan@garth.UUCP (Steven Ryan) (06/18/88)
First of all I agree with Dijkstra with respect to Fortran. I feel the challenge designing a new language that can be Fortran at its own game, which C does not do. > Now here there seems to be a little misunderstanding. I have never >understood why Fortran users keep saying that array indexing is awkward >in C. This is probably because they find the idea of having to allocate >memory for an array rather repulsive. A Fortran compiler is permitted to allocate all arrays statically, before compilation, and many do this. Also Fortran subroutine are not required to be recursive and hence do not need a procedure frame. All allocation can be done in the compiler and the runtime overhead can be zero. Also it is easy for programmer to modify the array base address to implement nonzero lower bounds. It is also easy for the compiler. I suspect Fortran programmers know alot less about dope vectors biased base addresses and that ilk than C programmers (scientists and engineers vs. computer science and system programmers). > i) Argument are passed by value, which translates to > the fact that to let a subroutine change a variable > you have to specify it (by passing its adress). And simplifies defined/referenced detection for an optimiser. > ii) You can define your own data structures, Apparently some Fortran programmers equivalence different typed arrays to create structures (shudder). Nature is immacuately fair and implacably cruel -- Sara James.
gwyn@brl-smoke.ARPA (Doug Gwyn ) (06/18/88)
In article <5917@aw.sei.cmu.edu> firth@bd.sei.cmu.edu.UUCP (Robert Firth) writes: >Any Fortran programmer who seriously proposes to convert to C would, in >my opinion, be advised to study this example very carefully. They would be much better off studying literate code.
rrr@naucse.UUCP (Bob Rose ) (06/20/88)
In article <20340@beta.lanl.gov>, jlg@beta.lanl.gov (Jim Giles) writes: > In article <224@raunvis.UUCP>, kjartan@raunvis.UUCP (Kjartan Pierre Emilsson Jardedlisfraedi) writes: > > Now here there seems to be a little misunderstanding. I have never > > understood why Fortran users keep saying that array indexing is awkward > > in C. > > [...] > > Awkward isn't the problem. I don't like the double bracket notation, > but anyone can write a macro to turn ARRAY(i,j,k) into array[i][j][k]. > The Problem is that 2-d arrays are not optimizable because they are > immediately turned into pointer-to-pointer-to-data type of constructs. Ok, what gives. Some else post the following code (not exact but close enough) real a(5,2) ... call foo(a) ... end subroutine foo(a) real a(10) ... Now doesn't the following c code work to do the same double a[2][5] /* <-- note [2][5] not [5][2] */ ... foo(a) ... } void foo(a) double a[10] { If it is not the same, would someone please post or email a counter example. I would like to see what is wrong. Note there are problems if you dynamicly allocate arrays but you cannot dynamicly allocate arrays in fortran so this is a void arguement. -bob
g-rh@cca.CCA.COM (Richard Harter) (06/20/88)
In article <738@naucse.UUCP> rrr@naucse.UUCP (Bob Rose ) writes: >In article <20340@beta.lanl.gov>, jlg@beta.lanl.gov (Jim Giles) writes: >> Awkward isn't the problem. I don't like the double bracket notation, >> but anyone can write a macro to turn ARRAY(i,j,k) into array[i][j][k]. >> The Problem is that 2-d arrays are not optimizable because they are >> immediately turned into pointer-to-pointer-to-data type of constructs. >Ok, what gives. Some else post the following code (not exact but close enough) > > real a(5,2) > ... > call foo(a) > ... > end > > subroutine foo(a) > real a(10) > ... My example, but this isn't the right context. This code was posted in response to someone proposing a macro which would produce the following C code: double **a; a = (double **)malloc(2 * sizeof(double *)); for (i=0;i<2;i++) a[i] = (double *)malloc(5 *sizeof(double)); which is not the same thing at all as your example. However the quoted statement is not, as far as I know, correct. Two dimensional arrays in C are not pointer to array constructs. >Now doesn't the following c code work to do the same > double a[2][5] /* <-- note [2][5] not [5][2] */ > ... > foo(a) > ... > } > > void foo(a) > double a[10] > { Yes, it does. However in fortran you can do the following real a(10) ... call foo(a,2,5) call foo(a,5,2) ... subroutine foo(a,m,n) real a(m,n) ... This is what fortran users mean when they talk about variable dimensioning; all it really means is that the compiler will generate address calculations for arrays using variables as well as constants. You can do this by hand, but it is much more convenient if the compiler will do it for you. As far as I know, there is no good reason why this feature could not be added to C. The discussion revolves around ways to fake it in C, using preprocessor tricks. They all look kludgy to me. >Note there are problems if you dynamicly allocate arrays but you cannot >dynamicly allocate arrays in fortran so this is a void arguement. This is not quite right, but it is close enough for practical purposes. The correct statements are: (1) Does the language provide intrinsic dynamic storage allocation. Answer: Intrinsic dynamic storage allocation is not a part of either language. (2) Does the standard support environment include a dynamic storage allocator? Answer: C does, fortran doesn't. (3) Given a dynamic storage allocator, does the language permit use of it? Answer: The C language provides a convenient portable means for doing so. It can be done in fortran, but the means are not as convenient and cannot be guaranteed to be portable. -- In the fields of Hell where the grass grows high Are the graves of dreams allowed to die. Richard Harter, SMDS Inc.
mouse@mcgill-vision.UUCP (der Mouse) (06/20/88)
In article <10655@agate.BERKELEY.EDU>, jerry@violet.berkeley.edu ( Jerry Berkman ) writes: >> [C vs FORTRAN] > C is missing many useful intrinsics. I have trouble taking a > language seriously for numeric computation if the language doesn't > even know about absolute values, e.g.: x = abs(x); or signed > numbers, e.g. x = +5; Both statements are illegal in C. Not so. The first one is perfectly legal; any compiler that won't accept it is broken. The second one is legal under the dpANS (or at least in K&R V2), though it used to be illegal. (Why do we need a unary plus operator anyway? I never missed it.) > [ i = j**k and y = x**6 ] How do you express link->next->chain = vec[i++]; in FORTRAN? Why try to create a war of features? It sounds as though your sort of work is better expressed in FORTRAN. Please stick to FORTRAN, then, and stop complaining about C. You aren't complaining about how Snobol makes it difficult and slow to do what you want to do, are you? > Actually, C does not have a concept of intrinsic at all. It does under the dpANS, or very close to it. If we say, for example, #include <math.h> x = sqrt(y); then we are justified in believing the sqrt() routine to be the math library routine. > For that matter, since sqrt() is a FORTRAN intrinsics, I get an error > message from any decent FORTRAN compiler for sqrt(integer). None > from C of course (running lint is a hassle). In other words, you recognize that you have such a checking feature available, but using it is too much trouble, so you complain about the lack of checking. Riiight. (I certainly can't see the hassle in running lint; just say "lint" instead of "cc"....) der Mouse uucp: mouse@mcgill-vision.uucp arpa: mouse@larry.mcrcim.mcgill.edu
mouse@mcgill-vision.UUCP (der Mouse) (06/20/88)
In article <10681@agate.BERKELEY.EDU>, jerry@violet.berkeley.edu ( Jerry Berkman ) writes: > As for "x = abs(x);", it does compile and load. > However, it's calling a library function. What's wrong with that? > (1) It's slow. > (2) It returns the absolute value of integers. You can type it > "float abs();" and it works on the VAX, but it may not work on > all systems. Doesn't work on our VAX. Your VAX must be using a different floating-point format from the one in the VAX architecture manual. I tried compiling and running this: float abs(); main() { printf("%g\n",abs(123.456)); printf("%g\n",abs(-123.456)); } and it gave me -0.00210706 0.00210706 Of course, maybe you think that's what it means for it to work on the VAX....but I daresay most people would disagree. (You could also break down and use fabs, the floating-point absolute value routine - or is that cheating?) der Mouse uucp: mouse@mcgill-vision.uucp arpa: mouse@larry.mcrcim.mcgill.edu
rrr@naucse.UUCP (Bob Rose ) (06/21/88)
In article <29697@cca.CCA.COM>, g-rh@cca.CCA.COM (Richard Harter) writes: > In article <738@naucse.UUCP> rrr@naucse.UUCP (Bob Rose ) writes: > >Ok, what gives. Some else post the following code (not exact but close enough) > > [Some fortran code and the C equivalent] > > Yes, it does. However in fortran you can do the following > > real a(10) > ... > call foo(a,2,5) > call foo(a,5,2) > ... > subroutine foo(a,m,n) > real a(m,n) > ... > > This is what fortran users mean when they talk about variable dimensioning; > all it really means is that the compiler will generate address calculations > for arrays using variables as well as constants. You can do this by hand, > but it is much more convenient if the compiler will do it for you. As far > as I know, there is no good reason why this feature could not be added to > C. The discussion revolves around ways to fake it in C, using preprocessor > tricks. They all look kludgy to me. ^^^^^^ Here is the equivalent C code (right?) double a[10] ... foo(a,2,5) foo(a,5,2) ... #undef a void foo(a,m,n) double a[] #define a(x,y) a[(x)*n+(y)] /* works with ansi-c */ int m,n; { ... a(p,q) = ... /* same syntax as fortran, golly gee */ Yes, it is kludgy, but it does work. Sooo, the question again is is there any fortran constructs that cannot be translated into fortran through a brute force kludge. You know, something that a well trained ape couldn't do. I would like to see it. Please no complex numbers, these well just be function call (and if you are worried about speed, get a compiler that well do inline function calls.) -bob -
jlg@beta.lanl.gov (Jim Giles) (06/21/88)
In article <1171@mcgill-vision.UUCP>, mouse@mcgill-vision.UUCP (der Mouse) writes: > [...] (Why do we need a > unary plus operator anyway? I never missed it.) The proposed ANSI standard for C has unary plus operators. Among other uses, it can force ordering of expression evaluation. The famous case where C doesn't evaluate a+(b+c) in any particular order is fixed by doing a+ +(b+c). The unary plus forces the subexpression to be eval- uated before adding the other constant. > [...] It sounds as though your sort of work is > better expressed in FORTRAN. Please stick to FORTRAN, then, and stop > complaining about C. You aren't complaining about how Snobol makes it > difficult and slow to do what you want to do, are you? > This is exactly the point of previous responses. There really are things This is exactly the point of previous responses. There really are things which are best done in Fortran. The points raised about C in this discussion are reasons for not switching. No one is complaining about C in isolation, these are legitimate answers to the original posting about the desirability of switching to C. As such, discussion of the inadequacies of C for certain types of work are quite appropriate. J. Giles Los Alamos
leo@philmds.UUCP (Leo de Wit) (06/21/88)
In article <20340@beta.lanl.gov> jlg@beta.lanl.gov (Jim Giles) writes: >In article <224@raunvis.UUCP>, kjartan@raunvis.UUCP (Kjartan Pierre Emilsson Jardedlisfraedi) writes: >> Now here there seems to be a little misunderstanding. I have never >> understood why Fortran users keep saying that array indexing is awkward >> in C. >> [...] > >Awkward isn't the problem. I don't like the double bracket notation, >but anyone can write a macro to turn ARRAY(i,j,k) into array[i][j][k]. Matter of style. I don't like seeing an identifier that could be both an array and a function call (oh yeah, Fortran uses CALL in front of it 8-). >The Problem is that 2-d arrays are not optimizable because they are >immediately turned into pointer-to-pointer-to-data type of constructs. Depends on how you implement them. > >> If the array had been initialized with Create2DArray() then indexing >> would be just as normal,i.e: >> >> DoSomething(array,m,n) >> double **array; >> int m,n; >> { >> ... >> array[ i+k ][ j-i ]= 1.0; >> ... >> } > >If your use of array here is in a loop (that might otherwise be vector- >izable), the C compiler can't vectorize it because a pointer-to-pointer [stuff about vectorizing deleted]... So I suggest another alternative for Create2DArray(): double **Create2DArray(w,h) int w,h; { double **r, **a; for (a = calloc(w * h, sizeof(**r)), r = calloc(h,sizeof(*r)); --h >= 0; r[h] = a + w * h; return r; } This has two advantages: only 2 calls to calloc needed (while the original needed h + 1), and furthermore the data itself is contiguous in memory (pointed to by r[0]). Furthermore you can have both the speed of vectored arrays and flat ones (whatever is more appropriate). Another remark: perhaps it's better to replace the second calloc by a malloc, since the contents of r[h] is changed anyway. And you can combine the two allocations into one, as The Walking Lint pointed out. If you're out for speed, use flat arrays and pointers into these. > >> [...] >> ii) You can define your own data structures, which can be any >> combination of system defined structures and user defined >> structures, array of structures, self-referential >> structures and so on. [...] > >This capability is NOT an adequate replacement for the complex data type. >Complex is more than just a pair of reals, there are several operations >defined on complex (like plus) which should work naturally. No use of >C programming constructs can do this: > > COMPLEX CVAR > REAL RVAR > INTEGER IVAR > ... > CVAR=(CVAR+RVAR)*IVAR > ... >The operations (including the mixed-mode type conversions) are all done >IN-LINE and efficiently. C provides no way to even do this calculation >without function calls. This is simply not true. Even if the C compiler cannot generate inline code it is perfectly possible to use macro's to achieve the speed that FORTRAN boasts of (and I think even more than that). For instance: typedef struct { double c_re, c_im; } complex; #define C_MULT(p,q,r) \ ((r).c_re = ((p).c_re + (q).c_re, (r).c_im = (p).c_im + (q).c_im) complex z1, z2, result; /* ------- */ C_MULT(z1,z2,result); You can have even macro's for functions on complex numbers, and even the best Fortran compiler cannot forsee to implement all user functions inline (in fact I think only a very limited number of functions are supported). The only drawback of the macro approach is that most people don't like the splitting up of expressions into these elementary ones (if C had structure assignment and runtime declarators also this problem was solved). But I think Fortran would get into trouble if you had to work with quaternion calculations, e.g. Nice as it is to have predefined types and operations, it is even nicer to support creating you own types. What if you want to describe distributions in your data model? C offers a much cleaner approach. Pure number crunching: ok, Fortran. But compiler building? Databases? Operating systems? Juicy, juicy (You see: use C ;-). If I want speed, I'd better have no Fortranquilizers 8-). Leo.
bobal@microsoft.UUCP (Bob Allison) (06/22/88)
In article <741@naucse.UUCP> rrr@naucse.UUCP (Bob Rose ) writes: >Sooo, the question again is is there any fortran constructs that cannot >be translated into fortran through a brute force kludge. You know, >something that a well trained ape couldn't do. I would like to see it. >Please no complex numbers, these well just be function call (and if >you are worried about speed, get a compiler that well do inline function >calls.) > -bob Much as I hate to get embroiled in this my-language-can-do-anything-your- language-can-do-better issue, reading this article I did manage to come up with a feature which is fairly difficult to implement in C: assigned gotos. I don't believe C can take the address of a label, which is the way a "well-trained ape" would do it. Otherwise, you could (I suppose) assign each label as it appears a monotonically ascending number and then in place of the "ASSIGN 10 TO L" statement, assign the assigned number of the label to L, then in place of the "GOTO L" statement do a case statement and branch to the appropriate label based on the value in L. Frankly, this stinks. This is clearly a contrived example (I'm not suggesting we all use FORTRAN so we can do assigned GOTOs), but I think that it is obvious that there are things which C can do easily which FORTRAN can't and things which FORTRAN can do easily which C can't, and let's just live with it. (I love it, soon we'll be having lawsuits about discrimination and equal opportunity for programming languages). A very sensible comment: > Offhand, my feeling is that I wouldn't use any of these schemes. >C isn't fortran and vice versa. C doesn't allow > > foo(m,n,a) > int m,n; > double a[m][n]; > {....} > >and that's the way it is. Cute tricks to simulate this just make the >code obscure. > > Richard Harter, SMDS Inc. Of course, it is just as easy to construct a C program which is difficult to write in FORTRAN. Bob Allison (It seems this debate comes up every six months. The basic response is "Who cares? Let people use whatever language they like. It's their life." Why do even mundane things like programming language selection need to be turned into religious issues?)
leo@philmds.UUCP (Leo de Wit) (06/26/88)
In article <750@garth.UUCP> smryan@garth.UUCP (Steven Ryan) writes: >First of all I agree with Dijkstra with respect to Fortran. I feel the >challenge designing a new language that can be Fortran at its own game, >which C does not do. So what IS FORTRAN's game? > [stuff deleted]... >A Fortran compiler is permitted to allocate all arrays statically, before >compilation, and many do this. Also Fortran subroutine are not required to >be recursive and hence do not need a procedure frame. All allocation can be >done in the compiler and the runtime overhead can be zero. A few remarks: A C programmer can CHOOSE (note: uppercase 8-) whether he wants arrays to be global, static or automatic. This means that the runtime overhead can be at least as little as that of a FORTRAN program (zero overhead is rubbish; you always have to page or swap the data in, whether you use data /heap space or stack space, or you at least have to claim core). Furthermore you make it sound as if it was a merit when functions cannot be recursive (are not required to..); procedure frame are more or less for compiler builders: you can do without them, making it much harder for yourself (most procedure frames can be build and broken down with a single, fast instruction; this goes also for pushing/popping sets of registers). If you can design a function in which the system spends significant time doing call, return, push, pop, link, unlink there must be very little code inside that function (try a macro; no, I do not mean an assembler! 8-). Dropping off these things (call, push ..) can only gain a few percent, at what other cost (inflexible subroutine scheme, permanent variables)? You can even totally rule out any use of subroutines if you're out for speed: large monolotic blocks with many pieces of repeated code, is that desirable? The scope of local variables can be more restricted than in FORTRAN if you wish so (yes, for auto AND for static vars); more like Algol block-bound. >Also it is easy for programmer to modify the array base address to implement >nonzero lower bounds. It is also easy for the compiler. I suspect Fortran >programmers know alot less about dope vectors biased base addresses and that >ilk than C programmers (scientists and engineers vs. computer science and >system programmers). The trouble here is obviously that scientists want their problem to map too literally onto the notation and possibilities of a computer language. Strange, because C you have to learn only once (that is, twice if that is not the proposed ANSI draft standard), scientific notations may change all the time. Knowing how to work with pointers can improve on speed considerably. > >> ii) You can define your own data structures, > >Apparently some Fortran programmers equivalence different typed arrays to >create structures (shudder). Now this one I cannot grasp (English not being my native language); what do you mean by this sentence (what are the predicate and the subject) ? Can you give an equivalence 8-) ? Leo.
mouse@mcgill-vision.UUCP (der Mouse) (06/26/88)
In article <20340@beta.lanl.gov>, jlg@beta.lanl.gov (Jim Giles) writes: > In article <224@raunvis.UUCP>, kjartan@raunvis.UUCP (Kjartan Pierre Emilsson Jardedlisfraedi) writes: >> [...] > The Problem is that 2-d arrays are not optimizable because they are > immediately turned into pointer-to-pointer-to-data type of > constructs. Real 2-d arrays turn into pointer-to-array, not pointer-to-pointer. If you have one of these dynamically-allocated arrays that have been bandied about, which is really an array of pointers to blocks of data, *that* is pointer-to-pointer. >> ii) You can define your own data structures, > This capability is NOT an adequate replacement for the complex data > type. Complex is more than just a pair of reals, [...] It's both more and less powerful. To put it another way, FORTRAN's COMPLEX data type is NOT an adequate replacement for the ability to define custom data types. > No use of C programming constructs can do this: [FORTRAN example] > C provides no way to even do this calculation without function calls. Sure it does, it just takes a little more code than the FORTRAN example. But that's a strawman anyway. For every such example, I can write one in C that FORTRAN similarly "can't" do. der Mouse uucp: mouse@mcgill-vision.uucp arpa: mouse@larry.mcrcim.mcgill.edu
mouse@mcgill-vision.UUCP (der Mouse) (06/26/88)
In article <738@naucse.UUCP>, rrr@naucse.UUCP (Bob Rose ) writes: > In article <20340@beta.lanl.gov>, jlg@beta.lanl.gov (Jim Giles) writes: >> The Problem is that 2-d arrays are not optimizable because they are >> immediately turned into pointer-to-pointer-to-data type of >> constructs. > Ok, what gives. Some else post the following code (not exact but > close enough) > real a(5,2) > call foo(a) > end > subroutine foo(a) > real a(10) > Now doesn't the following c code work to do the same > double a[2][5] /* <-- note [2][5] not [5][2] */ > ....foo(a).... > void foo(double a[10]) { ... } No, I'm afraid not. foo() is expecting an array of ten doubles. Instead, you're passing it an array of two arrays of five doubles. (Actually, a pointer to an array of five doubles.) What's the difference? To put it briefly, I find no guarantee that sizeof(double[5]) == 5*sizeof(double). (If that didn't make sense to you, read on.) Of course, most compilers won't produce surprises like this, but that's no guarantee that some machine won't come along on which there are good performance reasons for doing it. A is just an array of two elements. Like other arrays of two elements, the memory allocated for it can be thought of as being in two pieces: one for each element. By the definition of pointer arithmetic and its relation to arrays, the distance from the start of the first piece to the start of the second piece is the size of the thing a is an array of. When the elements of an array are, say, structures, nobody finds this surprising. For example, suppose we have struct { int i; char c; } b[2]; Now suppose sizeof(int)==2. Then that structure really *needs* only three bytes of storage. However, the compiler is free to waste a byte and decree that the structure type takes up four bytes, one of which has no name. As far as I can tell, the same is true of arrays: if I declare char c[3]; the compiler is free to do the same padding trick and make c take up four bytes, one of which has no name. Of course the same is true of arrays of five doubles. So when we allocate an array of two such arrays, the compiler may put padding in between. der Mouse uucp: mouse@mcgill-vision.uucp arpa: mouse@larry.mcrcim.mcgill.edu
g-rh@cca.CCA.COM (Richard Harter) (06/27/88)
In article <1189@mcgill-vision.UUCP> mouse@mcgill-vision.UUCP (der Mouse) writes: >>> ii) You can define your own data structures, >> This capability is NOT an adequate replacement for the complex data >> type. Complex is more than just a pair of reals, [...] >It's both more and less powerful. To put it another way, FORTRAN's >COMPLEX data type is NOT an adequate replacement for the ability to >define custom data types. The CDC 3600 version of fortran II had an interesting wrinkle. You could define your own data types with arithmetic operators. The compiler would generate calls to functions which you supplied. The issue really is -- what data types are supported by the language as primitives. Any type which is a primitive language type can be compiled much more efficiently than one which must be built up by hand. C has characters and pointers as primitives; fortran has complex as a primitive. C also has structures as a primitive. -- In the fields of Hell where the grass grows high Are the graves of dreams allowed to die. Richard Harter, SMDS Inc.
rob@kaa.eng.ohio-state.edu (Rob Carriere) (06/28/88)
In article <528@philmds.UUCP> leo@philmds.UUCP (L.J.M. de Wit) writes: > >The trouble here is obviously that scientists want their problem to map >too literally onto the notation and possibilities of a computer >language. Strange, because C you have to learn only once (that is, >twice if that is not the proposed ANSI draft standard), scientific >notations may change all the time. Knowing how to work with pointers >can improve on speed considerably. > Yes. I have written FORTRAN 66 programs where I could not set the array lower bound, and was forced to drag ``-lb'' through every array index in the entire code. Rest assured, it did absolute wonders to readability (and my nerves :-) I know how to work with pointers, I'm just not proud enough of the fact to have to demonstrate it throughout the code. I'll hide this stuff where it belongs, in a couple of low-level functions. Not to mention, *my* notation tends to be very consistent, I use whatever makes the problem the easiest to understand :-) > >>Apparently some Fortran programmers equivalence different typed arrays to >>create structures (shudder). > >Now this one I cannot grasp (English not being my native language); what do >you mean by this sentence (what are the predicate and the subject) ? >Can you give an equivalence 8-) ? > Methinketh the man useth ``equivalence'' as a verb, at least that's what Dijkstra likes to do (though he at least has the good sense to disambiguate to ``equivale'' -- don't you wish everybody spoke ``de enige echte taal ter wereld'' :-) :-) Rob Carriere "Never translate your formulae!"
smryan@garth.UUCP (Steven Ryan) (06/28/88)
>So what IS FORTRAN's game? Running big huge numerical models like a bat out of hell. Watching people getting bigger and faster computers is like watching a heroin addict: more! more! MORE! People who say nasty things about fortran are right, but what have they to offer? >A few remarks: >A C programmer can CHOOSE (note: uppercase 8-) whether he wants arrays >to be global, static or automatic. By only offering static allocation, the compiler does not have to support any overhead at all. A C program must keep the stack pointer correctly aligned and rest of the protocol straight even if it doesn't use the stack because somebody it calls may use the stack. Similarily, in supporting recursion, the stack has to be maintained in case some called program needs the stack. Further, supporting recursion means all automatic variables must be mapped into the stack because the procedure may recurse indirectly. Forbidding recursion means the compiler can map automatic variables into static areas. However convenient it might be, recursion is not strictly necessary in practical programs. In some numeric models, it would actually be inconvenient. This is another case where fortran gives up something of questionable value (for them) to simplify optimisation. (However, we can always use a transitive closure which takes (?)cubic time to detect recursion, but most customers would rather a fast compiler.) > (zero overhead is >rubbish; you always have to page or swap the data in, whether you use data >/heap space or stack space, or you at least have to claim core). Pagein cost vs. pagein + computation cost. > procedure frame are more >or less for compiler builders: you can do without them, making it much >harder for yourself Unless it is impossible for a procedure to recurse, directly or indirectly, local variables must be mapped onto the stack. You cannot avoid this overhead without avoiding recursion. > Dropping off these things (call, push ..) >can only gain a few percent, at what other cost (inflexible subroutine >scheme, permanent variables)? You can even totally rule out any use of >subroutines if you're out for speed: large monolotic blocks with many >pieces of repeated code, is that desirable? Unfortunately--some people do find that desirable. As far as few cycles, look back a few weeks when somebody said a C function should make the argument list length available. Many jumped on the poor guy because they didn't want their code to waste the single instruction necessary to load a register. Please, don't suggest I want speed or I find this desirable. My favorite compiler was an Algol60 compiler. I never turned off array bound checks or switch range checks. I think call-by-name (real call-by-name with deferred execution and not wimpy call-by-reference) is fun. The compiler permitted assembly procedures to have variable length lists. The calling protocol gave the parameter list length and a descriptor with type and kind for each element in the list. And the address of dope vector was passed along. >>Apparently some Fortran programmers equivalence different typed arrays to >>create structures (shudder). Some fortran programmers use the EQUIVALENCE statement to adjust the beginning address of different arrays with different types so that they map their elements into interleaved storage. Don't know details--just heard about some of things they do just to stay in fortran. Like using blank common as a heap.
jlg@beta.lanl.gov (Jim Giles) (06/28/88)
In article <1189@mcgill-vision.UUCP>, mouse@mcgill-vision.UUCP (der Mouse) writes: > In article <20340@beta.lanl.gov>, jlg@beta.lanl.gov (Jim Giles) writes: > > The Problem is that 2-d arrays are not optimizable because they are > > immediately turned into pointer-to-pointer-to-data type of > > constructs. > > Real 2-d arrays turn into pointer-to-array, not pointer-to-pointer. If > you have one of these dynamically-allocated arrays that have been > bandied about, which is really an array of pointers to blocks of data, > *that* is pointer-to-pointer. I'm sorry. To me a 2-d array is a 2-d array is a 2-d array is a .... Any routine that gets a 2-d array as an argument must assume that it is either static (which restricts you one way), or it must assume that it is dynamic (in which case it IS a pointer-to-pointer etc.). This means that you can't write a general routine which takes a 2-d array as an argument. This is EXACTLY the sort of problem that I meant originally when I said that C didn't have the concept of multi-dimensional arrays in the language. By the way, I've not seen any C compilers which optimize static 2-d arrays anyway. An early step in the compiler usually turns the array reference into a linear function on the indices - this throws out the dependency information. As a result, very similar code is actually generated whether the array was statically or dynamically allocated. C gives you the worst of both worlds - you can't treat them the same in source, and you can't get better optimization out of them in object code. > [...] > > type. Complex is more than just a pair of reals, [...] > > It's both more and less powerful. To put it another way, FORTRAN's > COMPLEX data type is NOT an adequate replacement for the ability to > define custom data types. I never claimed it was. To repeat: the ability to define derived data types, and the operations thereon, is very useful and Fortran SHOULD have it. This has NOTHING to do with the debate about whether COMPLEX should be built-in. It has NOTHING to do with the fact that C cannot do COMPLEX as efficiently as Fortran. > > > No use of C programming constructs can do this: [FORTRAN example] > > C provides no way to even do this calculation without function calls. > > Sure it does, it just takes a little more code than the FORTRAN > example. [... This 'little more code' that you're talking about does less than you think. Because COMPLEX is built-in to Fortran, the compiler writer is free to use his knowledge about the semantics of COMPLEX in designing optimizations. Whereas your 'little more code' spells out a specific sequence of operations to simulate complex artithmetic, the Fortran compiler is free to choose from all mathematically equivalent sequences for the most efficient one. User definable data types are useful, but commonly used data types should be built-in. > ...] But that's a strawman anyway. For every such example, I can > write one in C that FORTRAN similarly "can't" do. Please feel free to do so when someone asks the question: 'Should I convert C code to Fortran?' Why do you consider a direct answer to a question to be a 'strawman'? Or didn't you read the "subject" line on this article. J. Giles Los Alamos
chris@mimsy.UUCP (Chris Torek) (06/28/88)
>In article <738@naucse.UUCP> rrr@naucse.UUCP (Bob Rose) suggests that this C code is is legal: >> double a[2][5] /* <-- note [2][5] not [5][2] */ >> ....foo(a).... >> void foo(double a[10]) { ... } In article <1190@mcgill-vision.UUCP> mouse@mcgill-vision.UUCP (der Mouse) gets the right answer, and the right reason: >No, I'm afraid not. foo() is expecting an array of ten doubles. >Instead, you're passing it ... Actually, a pointer to an array of >five doubles.) but the wrong explanation: >What's the difference? To put it briefly, I find no guarantee that >sizeof(double[5]) == 5*sizeof(double). There is indeed such a guarantee. The one that is missing, which makes this particular example fail, is that pointers to arrays of objects need not at all resemble pointers to the objects themselves. Please remember that C is a strongly typed language (no matter what else you have been told). Mixing types is often legal, and the cross-checking required by the language is extremly weak, but the type system itself is quite strong. The difference is like that between an extensive law system and extensive enforcement: If the system is there, any missing enforcement can be provided later, at need, without much hassle. The whole example may be corrected quite simply: ... foo(&a[0][0]); /* or foo(a[0]) */ ... void foo(double a[10]) /* or void foo(double *a) */ { ... } -- In-Real-Life: Chris Torek, Univ of MD Comp Sci Dept (+1 301 454 7163) Domain: chris@mimsy.umd.edu Path: uunet!mimsy!chris
smryan@garth.UUCP (Steven Ryan) (06/29/88)
>> > type. Complex is more than just a pair of reals, [...] >> >> It's both more and less powerful. To put it another way, FORTRAN's >> COMPLEX data type is NOT an adequate replacement for the ability to >> define custom data types. C does not provide custom data types either. It permits new structures, but those are not orthogonal to the primitive types: no new operators, no new casts, (on some implementation) no new assignments, ...
smryan@garth.UUCP (Steven Ryan) (06/29/88)
>>>some Fortran programmers equivalence different typed arrays
In Fortranish, "equivalence" is verb: to equivalence A and B means to write
an EQUIVALENCE statement to control their relative starting addresses as in
          DIMENSION A(100),B(101)
          EQUIVALENCE (A(1),B(2))
------------------------------------------------------------
p.s.  who says Fortran-66 doesn't support zero-based arrays?shenkin@cubsun.BIO.COLUMBIA.EDU (Peter Shenkin) (06/29/88)
A few points concerning the everpresent comparisons between C and Fortran. (I like *both*!!!) Anyway, my first point has to do with recursion. The second with EQUIVALENCE. If you're bored already, hit "n". In article <817@garth.UUCP> you write: >However convenient it might be, recursion is not strictly necessary in >practical programs.... POINT 1: I used to agree, and most people agree. Most textbook examples of recursion -- eg, calculation of factorial(n) -- would be much clearer and faster, in practice if coded using simple iteration. But I've come across a class of problem which can only be generally handled through recursion, I think. I'd be intrigued if someone can come up with a Fortran solution. (Mail it to me, and I'll post a summary to comp.lang.fortran.) This class involves, conceptually, nested DO-loops, when the depth of the nest is unknown at compile time. For example, the following C program generates grid points within a hypercube, given the number of degrees of freedom (dimensionality) of the space, and the number of gridpoints (number of levels) along each coordinate. Turns out I need to do more complicated versions of this sort of thing all the time in numerical work. NB: This *can* be coded in Fortran by setting up the maximum number of needed DO-loops and escaping early, if need be. But this is unwieldy, at best , unless this maximum is small. And here (if one malloc()'s -- see my note on MAXDF) a maximum need not be specified. /*****************************************************************************/ /* grid.c: generate grid points in a hypercube */ /* compile by saying, "cc -o grid grid.c" /* invoke by saying, eg, "grid 4 2", to generate the vertices, only, of * a 4-D hypercube. */ #define MAXDF 100 /* max number of degrees of freedom -- not really needed, * since we could malloc() value[] in grdgen() */ main(argc,argv) int argc; char *argv[]; { int df,nlev; /* number of deg. of freedom, number levels per df */ df= atoi(argv[1]); nlev= atoi(argv[2]); printf("df=%d; nlev=%d\n",df,nlev); grdgen(df,nlev); } /*****************************************************************************/ grdgen(df,nlev) int df; /* no. of degrees of freedom */ int nlev; /* no. of levels each df */ { static int begin=1; /* ==0 if func was called by itself */ static int totaldf; /* total no. of degrees of freedom (1st call)*/ static int value[MAXDF]; /* to hold current values of all df's */ int i,j; if(begin) totaldf=df; if(--df ==0) { /* iterate through last df and print values */ for(i=0; i<nlev; ++i) { value[df] = i; for(j=0; j<totaldf; ++j) { printf("%d ",value[j]); } printf("\n"); } begin=1; } else for(i=0; i<nlev; ++i) { value[df] = i; begin = 0; grdgen(df,nlev); } } /*****************************************************************************/ POINT 2: >>>Apparently some Fortran programmers equivalence different typed arrays to >>>create structures (shudder). > >Some fortran programmers use the EQUIVALENCE statement to adjust the >beginning address of different arrays with different types so that >they map their elements into interleaved storage. Generally it's the same storage, not interleaved storage, and it's not to create structures. (The conceptual equivalent (not real equivalent -- there is none) of structures in standard Fortran is labeled COMMON.) EQUIVALENCE is sometimes used to be able to reuse declared (C programmers would say "defined") storage for different purposes in different places. The conceptual equivalent of this in C is the union. -- ******************************************************************************* Peter S. Shenkin, Department of Biological Sciences, Columbia University, New York, NY 10027 Tel: (212) 280-5517 (work); (212) 829-5363 (home) shenkin@cubsun.bio.columbia.edu shenkin%cubsun.bio.columbia.edu@cuvma.BITNET
js07@bunny.UUCP (Jack Shaio) (06/30/88)
In a previous posting, der Mouse replies to: > >>> ii) You can define your own data structures, >> This capability is NOT an adequate replacement for the complex data >> type. Complex is more than just a pair of reals, [...] with: >It's both more and less powerful. To put it another way, FORTRAN's >COMPLEX data type is NOT an adequate replacement for the ability to >define custom data types. > In fact, the reverse is true: C's ability to define custom data types is no substitute for FORTRAN's COMPLEX type. COMPLEX is a standard, and you can link any packaged math subroutine to your own programs without having to worry about how complex was defined. I recently obtained three FFT algorithms in C from various sources and found three different ways of defining complex (all different from mine). Had I done everything in FORTRAN, a look at the subroutine header would have been enough. The point is that if you plan to do numerical work, and not program every last subroutine from scratch, one COMPLEX data type in FORTRAN is a plus; ten different definitions of complex in C are a pain. Perhaps a remmant of the days when people used computers to solve equations. js
clewis@spectrix.UUCP (Chris "where are you this week?" Lewis) (06/30/88)
In article <817@garth.UUCP> smryan@garth.UUCP (Steven Ryan) writes: >By only offering static allocation, the compiler does not have to support >any overhead at all. A C program must keep the stack pointer correctly >aligned and rest of the protocol straight even if it doesn't use the stack Not always - besides, do you really think the prologs have: if (stack is odd) align it ?? >because somebody it calls may use the stack. Some C (and other langs) compilers optimize a lot of function prolog and epilog stuff out. Prolog and epilog for C code is usually not noticable (compared to, say IBM 360 series FORTRAN). Especially interesting is that FORTRAN subroutines tend to be far larger than C ones. This is frequently due to people learning FORTRAN on IBM 360 series stuff where the prologs and epilogs are enormous in time and space - I've worked (internals) on an 360 series version of C that used it's own super simple calling convention - ran considerably faster (even tho 360's don't have stacks!). C's calling convention is actually absurdly simple - usually little more than saving the return address and some registers (register variables and temporaries) and decrementing the stack and then restoring same at the end. Unlike most Pascals (or ADA for that matter) which frequently have some rather spectacular stack frames formats to handle. Also, what's so bad about stacks? Many machines can do some stack references faster than static - sufficient in many cases to greatly outweigh the calling convention. Eg: short stack offset instructions on 386, 68020, VAXen(?) etc. are as short as two bytes instead of six for static - which at least saves you some additional memory fetches (which are usually far more expensive than register adds) and your pipelining gets to help a little more. Eg: on a 68020, these are some representative timings (in cycles): Best case Worst case mov Rn,-(SP) 3 6 mov Rn,n(SP) 3 7 mov Rn,addr.L 5 9 mov n(SP),m(SP) 6 13 mov a.L,b.L 8 16 Similarly, for expression intermediaries, machines with short stack offsetting modes can be a big win, and are usually less wasteful of memory than most static memory schemes (unless you have a million registers ;-). So, blanket statements like "automatic variables are a lot of overhead" are frequently wrong - frequently they are *less* overhead. I've done some fairly substantial compiler benchmarking, and have noted that auto variables are usually faster than static (depending on the circumstances and the CPU). >However convenient it might be, recursion is not strictly necessary >in practical Nobody is saying "recursion is necessary" - it's occasionally a handy tool to simplify some problems. That's all. I almost never use it in C - but I do sometimes, and I'd greatly miss it in a language that doesn't support it (I'd probably say "Oh sh*t - not another FORTRAN!" ;-). >> (zero overhead is >>rubbish; you always have to page or swap the data in, whether you use data >>/heap space or stack space, or you at least have to claim core). >Pagein cost vs. pagein + computation cost. Ah yes, but static memory allocation usually lead to programs that are bigger - taking *longer* to load (usually on swapping systems). Without stacks and/or dynamic storage allocation programmers usually don't have the ability to decide how much memory he needs depending on the data - another potential optimization loss (the program's data structure has to be as large as the largest set of data, rather than being able to start smaller and make it bigger if necessary). And the stack model is especially convenient for "deep returns" (use of setjmp/longjmp to be able to conveniently recover from errors way down a subroutine tree - have you ever seen the insides of Oracle? Sheesh!). -- Chris Lewis, These addresses will last only til June 30: (clewis@lsuc afterwards) UUCP: {uunet!mnetor, utcsri!utzoo, lsuc, yunexus}!spectrix!clewis Moderator of the Ferret Mailing List (ferret-list,ferret-request@spectrix)
jgk@speech2.cs.cmu.edu (Joe Keane) (07/01/88)
In article <20454@beta.lanl.gov> jlg@beta.lanl.gov (Jim Giles) shows he hasn't used C. >Any routine that gets a 2-d array as an argument must assume that it is >either static (which restricts you one way), or it must assume that it >is dynamic (in which case it IS a pointer-to-pointer etc.). Whether an array is static or dynamic has nothing to do with whether it contains arrays or pointers. >[various implications of this assumption] >By the way, I've not seen any C compilers which optimize static 2-d >arrays anyway. So you've not seen any C compilers. Given `static foo[10][10][10]' the reference `foo[2][0][7]' is a single address. If foo were a parameter the reference would be a single offset. But see for yourself. >As a result, very similar code is actually >generated whether the array was statically or dynamically allocated. Yes, the same code. --Joe
mike@arizona.edu (Mike Coffin) (07/01/88)
From article <20454@beta.lanl.gov>, by jlg@beta.lanl.gov (Jim Giles): > By the way, I've not seen any C compilers which optimize static 2-d > arrays anyway. An early step in the compiler usually turns the array > reference into a linear function on the indices - this throws out the > dependency information. This is not true. How can information be lost by changing "a[i]" to "*(a+i)"? In fact at least one very fancy FORTRAN compiler converts array references to exactly that form to extract dependency information that is difficult to recognize otherwise. See "Interprocedural Dependency Analysis and Parallelization" by Michael Burke and Ron Cytron, in the 1986 SIGPLAN Symposium on Compiler Construction, pp 162-175. For instance they are able to parallelize a loop with the following references: DO i = 1 to N A(2i,2i+1) = ... ... = A(i,i) END They say (page 166) "Existing techniques separately analyze the two dimensions of A, and so separately consider two dependence equations, neither of which can demonstrate independence. However, the dependencies that exist independently in each dimension cannot hold simultaneously: for any value of i, the expressions at the definition (2i and 2i+1) are never equal but the expressions at the use (i and i) are equivalent. Since the dependencies cannot hold simultaneously, no dependence exists between the definition and use of A. By linearizing the references to A, we couple the two equations into one, allowing us to determine the independence of the accessed regions." -- Mike Coffin mike@arizona.edu Univ. of Ariz. Dept. of Comp. Sci. {allegra,cmcl2,ihnp4}!arizona!mike Tucson, AZ 85721 (602)621-4252
jlg@beta.lanl.gov (Jim Giles) (07/01/88)
In article <2130@pt.cs.cmu.edu>, jgk@speech2.cs.cmu.edu (Joe Keane) writes: > Whether an array is static or dynamic has nothing to do with whether > it contains arrays or pointers. > You haven't been reading this discussion at all! The C gurus all have been saying that a static multidimensional array is implemented as pointer to array, but that a dynamic multidimensional array is implemented as pointer to pointer (to pointer ... etc for the number of dimensions). I have no reason to dispute this claim since it is the same as the statements made in a recent issue of _Computer Language_ magazine. > >By the way, I've not seen any C compilers which optimize static 2-d >arrays anyway. > > So you've not seen any C compilers. Given `static foo[10][10][10]' > the reference `foo[2][0][7]' is a single address. If foo were a > parameter the reference would be a single offset. But see for > yourself. What you just described is constant folding (the constant subscript expressions were done at compile time). Now, show how this relates to the issue which was being discussed - vectorization or pipelining. > > >As a result, very similar code is actually > >generated whether the array was statically or dynamically allocated. > > Yes, the same code. Exactly!! The same bad code. It was neither vectorized nor unrolled for pipelining. Hence it is not the same code that Fortran compilers generally produce. The Fortran generally runs faster because its conception of array is better suited to these types of optimization. J. Giles Los Alamos
jlg@beta.lanl.gov (Jim Giles) (07/01/88)
In article <6067@megaron.arizona.edu>, mike@arizona.edu (Mike Coffin) writes: > From article <20454@beta.lanl.gov>, by jlg@beta.lanl.gov (Jim Giles): > > By the way, I've not seen any C compilers which optimize static 2-d > > arrays anyway. An early step in the compiler usually turns the array > > reference into a linear function on the indices - this throws out the > > dependency information. > > This is not true. How can information be lost by changing "a[i]" to > "*(a+i)"? In fact at least one very fancy FORTRAN compiler converts > array references to exactly that form to extract dependency > information that is difficult to recognize otherwise. See > "Interprocedural Dependency Analysis and Parallelization" by Michael > Burke and Ron Cytron, in the 1986 SIGPLAN Symposium on Compiler > Construction, pp 162-175. > Most simple compilers (read 'C compilers') can't vectorize the following (assume i is a loop index): A[i+N] = A[i+M] regardless of the values of N or M. However the following expression might have been the source of the above expression: A[4][i] = A[5][i] This clearly has no dependencies and even a simple compiler (read 'some Fortran compilers') can vectorize this (ie. A(i,4)=A(i,5)). Turning the references above into *(A+i+4*dim2) = *(A+I+5*dim2) which, after constant folding, is the expression I started out with is what causes the C compiler to incorrectly identify a dependency. Now, it is true that with careful analysis of M, N, and the range if i, the compiler could still have compiled this efficiently. No C compilers that I've seen do this very well. Thank you for the reference. I have always been convinced that dependency analysis should TRY the kind of stuff you suggest, but only if simplistic analysis fails to get rid of the dependencies (compiler speed should be considered too). The problem is that C doesn't use either analysis technique very well. C actually CAN'T analyse those dynamic multi- dimensional arrays (pointer-to-pointer) because it can't assume the pointers have any specific relationships. It is probably for this reason that it doesn't bother for static arrays either. (vote for 'noalias', that should help.) J. Giles Los Alamos
smryan@garth.UUCP (Steven Ryan) (07/02/88)
>Not always - besides, do you really think the prologs have: > if (stack is odd) > align it > ?? > >>because somebody it calls may use the stack. On a previous machine I used, yes, the stack had to be kept at even word boundary at all times. >Some C (and other langs) compilers optimize a lot of function prolog and >epilog stuff out. Prolog and epilog for C code is usually not >noticable (compared to, say IBM 360 series FORTRAN). The compiler is cheaper if it never existed in the first place. I suspect part of the size of prolog goes for adjustable arrays. >Also, what's so bad about stacks? Many machines can do some stack >references faster than static - sufficient in many cases to greatly outweigh >the calling convention. I think this a base register+offset compared to a constant offset, which should be avoiding in any case because it makes the code nonreentrant. Again, on the previous machine I used, all data references were base register+offset so that no time/space difference existed. Then again, if you want to consider a machine designed for C, would you also consider a machine designed for fortran. >So, blanket statements like "automatic variables are a lot of overhead" >are frequently wrong - frequently they are *less* overhead. All that is being compared is addressing modes. >>However convenient it might be, recursion is not strictly necessary >>in practical > >Nobody is saying "recursion is necessary" - it's occasionally a handy >tool to simplify some problems. That's all. For most problems, recursion is the most natural technique for me. Most times I use loops is because compilers don't do recursion removal and most languages make it difficult to define procedures. Data structures also simplify many problems. The challenge is to design a language as fast as Fortran but a shade more advanced (by about 30 years). Fortran users are conservative and are already pushing their machines to the limit. I have seen compiler complaints, not that the code is wrong, but that uses one extra instruction. >Ah yes, but static memory allocation usually lead to programs that >are bigger - taking *longer* to load (usually on swapping systems). Unix has a disgusting loader, but this is not really a language issue. Also, once loaded, a program may be run many times. In this case, a longer load is offset by a shorter run. >Without stacks and/or dynamic storage allocation programmers usually don't >have the ability to decide how much memory he needs depending on the data - >another potential optimization loss (the program's data structure has >to be as large as the largest set of data, rather than being able to >start smaller and make it bigger if necessary). Some C programmers don't seem to notice--see previous complaints about Yacc.
smryan@garth.UUCP (Steven Ryan) (07/03/88)
>> An early step in the compiler usually turns the array >> reference into a linear function on the indices - this throws out the >> dependency information. > >This is not true. How can information be lost by changing "a[i]" to >"*(a+i)"? In fact at least one very fancy FORTRAN compiler converts >array references to exactly that form to extract dependency >information that is difficult to recognize otherwise. [at two fortran compilers--the vectorising compiler I worked on also operated on an affine function of the subscripts.] The C subscript, if taken literally, does lose dependency information. Strict C parse tree possible Fortran parse tree convert-*-to-variable subscript | / \ integer-add symbol: a integer-multiply / \ / \ symbol: a integer-multiply element-size i / \ element-size i By having an explicit subscript node, a Fortran optimiser can easily access the array spans for each dimension, necessary for vectorisation. It also simplifies determining what region is referenced or defined, necessary for common subexpressions. ------------------------------------ USA - home of the modern revolution. (tho we'd just soon fergit it.)
chris@mimsy.UUCP (Chris Torek) (07/03/88)
>In article <2130@pt.cs.cmu.edu> jgk@speech2.cs.cmu.edu (Joe Keane) writes: >>Whether an array is static or dynamic has nothing to do with whether >>it contains arrays or pointers. (Joe is correct. A C array is simply `array N of T', where T is any valid C type except `function returning T1', and where N is constant. The last is the bugaboo.) In article <20518@beta.lanl.gov> jlg@beta.lanl.gov (Jim Giles) writes: >You haven't been reading this discussion at all! The C gurus all have been >saying that a static multidimensional array is implemented as pointer to >array, but that a dynamic multidimensional array is implemented as pointer >to pointer (to pointer ... etc for the number of dimensions). Only the false gurus. :-) More seriously, what C lacks in array handling is only the ability to conveniently write the following FORTRAN declaration: SUBROUTINE FOO(M, N, ARR) INTEGER M, N INTEGER ARR(M, N) ... ref ARR(i, j) ... END Ideally, one should be able to say void foo(int m, int n, int arr[n][m]) { ... /* ref arr[j][i] */ ... } /* called as foo(M, N, arr) */ and have both compilers do the same thing---namely, compute the address (&arr[0][0] + j * m + i). In practise, one must write void foo(int m, int n, int *arr) { ... /* ref arr[j*m + i] */ ... } /* called as foo(M, N, &arr[0][0]) */ There is at least one C compiler that supports the former syntax as an extension, namely the GNU compiler `gcc'. (It is a trivial extension, mechanically speaking, in most compilers.) >... The Fortran generally runs faster because its >conception of array is better suited to these types of optimization. which is correct, but not because of the form of subscripting, but rather because of an implicit rule against aliasing---a rule that I have seen violated in FORTRAN programs, incidentally. (Such programs are allowed to quietly produce mysterious results [as are malformed C programs that are equally easy to come by: different error, same lack of obviousness]. Ever hear someone gripe `but it works on VAX/VMS; the <Cray/CDC/...> compiler must be broken'?) -- In-Real-Life: Chris Torek, Univ of MD Comp Sci Dept (+1 301 454 7163) Domain: chris@mimsy.umd.edu Path: uunet!mimsy!chris
jlg@beta.lanl.gov (Jim Giles) (07/04/88)
In article <12292@mimsy.UUCP>, chris@mimsy.UUCP (Chris Torek) writes: > In article <20518@beta.lanl.gov> jlg@beta.lanl.gov (Jim Giles) writes: > >You haven't been reading this discussion at all! The C gurus all have been > >saying that a static multidimensional array is implemented as pointer to > >array, but that a dynamic multidimensional array is implemented as pointer > >to pointer (to pointer ... etc for the number of dimensions). > > Only the false gurus. :-) I would like to see dynamically allocated multi-dimensional arrays in C that aren't declared as: int **a ... But all implementaions (both here on the net and in journals) have done this. The result is that a reference to such an array really IS turned into pointer-to-pointer type constructs. If you have an alternate implementation, please send it to me and I'll shut-up. > [...] > Ideally, one should be able to say > > void foo(int m, int n, int arr[n][m]) { > ... /* ref arr[j][i] */ ... > } > /* called as foo(M, N, arr) */ > I agree. The point is that you can't. Furthermore, 'foo' has to be implemented twice if the expected argument (arr) could be either statically or dynamically allocated. In the later case, the declaration has to be: int **arr. > [...] > >... The Fortran generally runs faster because its > >conception of array is better suited to these types of optimization. > > which is correct, but not because of the form of subscripting, but > rather because of an implicit rule against aliasing---a rule that I > have seen violated in FORTRAN programs, incidentally. (Such programs > are allowed to quietly produce mysterious results [as are malformed C > programs that are equally easy to come by: different error, same lack > of obviousness]. Ever hear someone gripe `but it works on VAX/VMS; the > <Cray/CDC/...> compiler must be broken'?) Which is what I pointed out when I said that C needs the 'noalias' directive. The Fortran rule against aliasing is not implicit either. The standard specifically prohibits it. Due to separate compilation constraints, the compiler can't check the problem. But the prohibition is explicit. By the way, the Fortran 8x version of dynamic memory allocation doesn't have the pointer-to-pointer problem. When you declare an allocatable array, its dimensionality is part of the declaration. The ALLOCATE statement then takes the actual dimensions and dynamically allocates the whole space. From a subroutine's point of view, all arguments which are declared multi-dimensional are treated the same whether they were dynamically allocated or not. J. Giles Los Alamos
chris@mimsy.UUCP (Chris Torek) (07/04/88)
>>In article <20518@beta.lanl.gov> jlg@beta.lanl.gov (Jim Giles) claimed that >>>... The C gurus all have been saying that a static multidimensional >>>array is implemented as pointer to array, but that a dynamic >>>multidimensional array is implemented as pointer to pointer >>>(to pointer ... etc for the number of dimensions). >In article <12292@mimsy.UUCP> chris@mimsy.UUCP (Chris Torek) writes: >>Only the false gurus. :-) In article <20531@beta.lanl.gov> jlg@beta.lanl.gov (Jim Giles) writes: >I would like to see dynamically allocated multi-dimensional arrays in >C that aren't declared as: > int **a ... Okay: int n, m, *a, i, j; /* make a an int a[n][m] */ a = (int *)zalloc(n * m, sizeof(int)); /* zalloc is like calloc but never fails (aborts on error) */ /* work with it */ for (j = 0; j < n; j++) for (i = 0; i < m; i++) ... ref a[j * m + i] ... The other reason your claim in >>> above is wrong is because statically allocated 2-D arrays can be done with row vectors: /* these must be global */ int a0[M], a1[M], a2[M], a3[M], ..., a99[M]; /* if N=100 */ int *a[N] = { a0, a1, a2, a3, ..., a99 }; >But all implementations (both here on the net and in journals) have done >this. Because it is convenient. The version above with j*m + i is not. (This is the key point; more later.) >The result is that a reference to such an array really IS turned >into pointer-to-pointer type constructs. If you have an alternate >implementation, please send it to me and I'll shut-up. Someone already did. One can combine the row-vector and the flat array: int **a, i; a = (int **)zalloc(N, sizeof(int *)); *a = (int *)zalloc(N * M, sizeof (int)); for (i = N; i < N * M; i += M) a[i] = *a + i; /* * now may work with either a[j][i] (row vector) * or a[0][j * M + i] (flat). */ >>Ideally, one should be able to say >> >> void foo(int m, int n, int arr[n][m]) { >> ... /* ref arr[j][i] */ ... >> } >> /* called as foo(M, N, arr) */ >I agree. The point is that you can't. Right. This makes working with flat arrays of more than one dimension inconvenient (NOT impossible!). >Furthermore, 'foo' has to be implemented twice if the expected >argument (arr) could be either statically or dynamically allocated. Wrong. As long as the static and dynamic versions agree---both must either be flat, or be row-vector---the same code can be used for both. If the dynamic array is both flat and row-vector, then it does not matter which the static version is. Incidentally, I can make the static version also be both flat and row-vector: /* these must again be global */ int a00[M * N]; int *a[N] = { a00 + 0, a00 + M, a00 + 2*M, ..., a00 + 99*M }; >In the later case, the declaration has to be: int **arr. In the row-vector case. If we use the flat-and-row-vector dynamic implementation, the call must read foo(m, n, &arr[0][0]) and the declaration must read void foo(int m, int n, int *arr) { int j, i; for (j = 0; j < n; j++) for (i = 0; i < m; i++) ... work with arr[j*n + i] ... } If we use the flat static implmentation, the call and the declaration remain identical. >>>... The Fortran generally runs faster because its >>>conception of array is better suited to these types of optimization. >>which is correct, but not because of the form of subscripting, but >>rather because of an implicit rule against aliasing ... >Which is what I pointed out when I said that C needs the 'noalias' >directive. I think that the noalias directive damages C far more than it improves it, just as I think that rule in Fortran damages Fortran. >The Fortran rule against aliasing is not implicit either. I meant only that it is not explicit in the code. >The standard specifically prohibits it. Due to separate compilation >constraints, the compiler can't check the problem. ... in the general case, yes. In almost all cases the compiler (`linker', if you prefer) can, at least if the Fortran rule against aliasing matches what I think was intended by X3J11's `noalias'. If the rule is stronger than that (I hope not), the `linker' (which should certainly do code generation, if you want any sort of decent inline expansion) can detect all illegal aliasing (again, I hope the rule is not *that* strong!---it would prohibit all sorts of useful tricks that never cause trouble on real hardware). >By the way, the Fortran 8x version of dynamic memory allocation doesn't >have the pointer-to-pointer problem. When you declare an allocatable >array, its dimensionality is part of the declaration. Likewise for gcc: void f(int n, int m) { int a[n][m]; /* or, if you prefer to use malloc: (*a)[m] = (int (*)[m])zalloc(n * m, sizeof(int)) */ ... } Again, this is convenient. Unfortunately, it is also not standard, so it cannot be used in general. If one wants a flat array (because the compiler is unable to vectorise the row-vector version, perhaps), one must use the inconvenient access version, a[j*m + i] (with either the flat allocation or the combined flat-plus-row-vector allocation). When something is convenient, it gets used; when it is inconvenient enough, it does not get used (for instance, I never use Fortran [*]). This gives a slight advantage to the flat-plus-row-vector version: when speed is unimportant (or when your compiler is smart enough), use the convenient row-vector access a[j][i]; when you have to play tricks to convince the compiler to vectorise, use the inconvenient flat acess a[0][j*m + i]. The problems with the f-p-r-v version are that the allocator is more complex (minor: it is only written once) and that using it in two different ways is confusing (major: this must constantly be documented). ----- [*] Bet you took this one wrong! Instead, when I must, I use Ratfor. :-) -- In-Real-Life: Chris Torek, Univ of MD Comp Sci Dept (+1 301 454 7163) Domain: chris@mimsy.umd.edu Path: uunet!mimsy!chris
dik@cwi.nl (Dik T. Winter) (07/05/88)
In article <12302@mimsy.UUCP> chris@mimsy.UUCP (Chris Torek) writes: > Again, this is convenient. Unfortunately, it is also not standard, so > it cannot be used in general. If one wants a flat array (because the > compiler is unable to vectorise the row-vector version, perhaps), one > must use the inconvenient access version, a[j*m + i] (with either the > flat allocation or the combined flat-plus-row-vector allocation). Se essentially there are two ways to do the thing in C. Now what if I am writing a portable library, should I make two versions (one for vector machines and the other for smart compilers)? This is a bit ridiculous, so we will get libraries (like IMSL, NAG and packages as LINPACK etc.) in only one version. And if the programmer started with the other version, but found he needed a library routine, he is in trouble. The big point is: you are not going to write a complete numerical system from scratch, so you have no options; it is the library that dictates the option (which is most likeley a flat array). -- dik t. winter, cwi, amsterdam, nederland INTERNET : dik@cwi.nl BITNET/EARN: dik@mcvax
smryan@garth.UUCP (Steven Ryan) (07/05/88)
>A few points concerning the everpresent comparisons between C and Fortran. >(I like *both*!!!) I dislike both. >>However convenient it might be, recursion is not strictly necessary in >>practical programs.... Practical as in how many Fortran programmers need indefinite looping.
jlg@beta.lanl.gov (Jim Giles) (07/05/88)
I guess my major complaint against C multi-dimensioned arrays is that the reference a[i][j] means different things depending upon the decl- aration of a. If these differences were transparent at the source level it wouldn't be so bad. But the user is REQUIRED to be aware of the differences. In Chris Torek's terms, the difference between 'flat' and 'row-vector' multi-dimensioned arrays requires explicit different treatment by user code - in spite of identical syntax. Fortran is superior in this case in that all arrays are 'flat' - in Fortran 8x this includes dynamically allocated arrays. In article <12302@mimsy.UUCP>, chris@mimsy.UUCP (Chris Torek) writes: > [...] > I think that the noalias directive damages C far more than it improves it, > just as I think that rule in Fortran damages Fortran. > The rule against aliasing in Fortran doesn't hurt it because all arrays are 'flat' to begin with. If C had a way of declaring 'flat' arrays dynamically (like Fortran 8x and most large Fortran environments have) it wouldn't need 'noalias' either. (I declare dynamic arrays in Fortran all the time. It's non-standard, but it can be done in some way on all systems I now use.) To be sure, Fortran probably needs some way of allowing aliasing. But I think it should be with an explicit user request (an ALIAS statement) rather than the default assumed. Most frequently, aliasing is accidental, deliberate aliasing is comparatively rare. The rare usage is the one that should require extra oversite from the user. (And C DOES need 'noalias' in order to get a functionality that is otherwise difficult. A directive which gives extra flexibility to the user without an undue burden on those that don't use it is not really damaging to the language.) > [...] When > something is convenient, it gets used; when it is inconvenient enough, > it does not get used (for instance, I never use Fortran [*]). Hmm.... That's just exactly the reason that I rarely use C. > [*] Bet you took this one wrong! Instead, when I must, I use Ratfor. :-) Hmm.... I don't use RATFOR either (at least not for many years - it produces TERRIBLE code). Still, I would like a version of Fortran which allowed INTERNAL procedures - the ONLY useful item in RATFOR. > [...] This > gives a slight advantage to the flat-plus-row-vector version: when > speed is unimportant (or when your compiler is smart enough), use the > convenient row-vector access a[j][i]; when you have to play tricks to > convince the compiler to vectorise, use the inconvenient flat acess > a[0][j*m + i]. This last is just the sort of thing that simplistic compilers have trouble vectorizing. The statement: arr[4][i] = arr[5][i] turns into: *(M+i) = *(N+i) where M and N are the result of constant folding (of the array address and the product of the dimension size with either 4 or 5 respectively). If this is in a loop on i, the first version (with the syntax clearly displayed) is obviously free of dependencies. The second version requires additional calculation from the compiler to determine dependencies. In a more complicated expression, this additional work may be beyond the ability of the compiler. The rumor is that this very example killed the early Cray compiler (ie. the compiler incorrectly identified it as a dependency). Actually, I have no idea whether the Cray C compiler does this correctly even now - even for statically allocated arrays. It seems very strange to support a 'feature' which requires you to make dependency analysis harder in order to vectorize at all. J. Giles Los Alamos
smryan@garth.UUCP (Steven Ryan) (07/06/88)
In Algorithmic Language and Program Development, Bauer and Wossner, they cite Paterson and Hewitt as proving, essentially, recursion is more powerful than iteration. > But I've come across a class of >problem which can only be generally handled through recursion, I think. I'd >be intrigued if someone can come up with a Fortran solution. I don't want to be in position of proving various examples can be done without recursion, given the above statement and my own opinions. However this example is interesting. A recursive solution assumes the existence of a sufficiently large stack. If this storage is made available to Fortran program (perhaps as blank common), a nonrecursive solution would store the loop parameters as array elements: while still-something-do do-it increment-lowest-stack-entry while it-overflows reset-it increment-the-next-lowest-stack-entry
rob@kaa.eng.ohio-state.edu (Rob Carriere) (07/06/88)
In article <879@garth.UUCP> smryan@garth.UUCP (Steven Ryan) writes: >In Algorithmic Language and Program Development, Bauer and Wossner, they cite >Paterson and Hewitt as proving, essentially, recursion is more powerful than >iteration. > I don't know the learned gentlemen, but it is certainly true that for anything that can run on a computer, recursion is *at most* as strong as iteration. Proof: the CPU is an iterative device, not a recursive one (it iterates through its' microcode or eqv) and it can excecute your program. Since a computer with unbounded memory is as strong as a Universal Turing machine, I have this nagging feeling that the proof of P&H was intended for a much more restricted environment than either B&W or the poster are (is) assuming. If I'm wrong, *do* tell me, this sounds interesting. Rob Carriere "He iterated until he had no recourse but to recurse his fate."
smryan@garth.UUCP (Steven Ryan) (07/07/88)
>>In Algorithmic Language and Program Development, Bauer and Wossner, they cite >>Paterson and Hewitt as proving, essentially, recursion is more powerful than >>iteration. >> >I don't know the learned gentlemen, but it is certainly true that for >anything that can run on a computer, recursion is *at most* as strong >as iteration. I will step aside. The bibliography reference is, I think, Paterson M S, Hewitt C E (1970): Comparative Schematology. Record of the Project MAC Conference on Concurrent Systems and Parallel Computation, Woods Hole, Mass, 1970. New York: ACM 1970, p. 119-127 Have at it.
atbowler@watmath.waterloo.edu (Alan T. Bowler [SDG]) (07/15/88)
In article <817@garth.UUCP> smryan@garth.UUCP (Steven Ryan) writes: > >By only offering static allocation, the compiler does not have to support >any overhead at all. A C program must keep the stack pointer correctly >aligned and rest of the protocol straight even if it doesn't use the stack >because somebody it calls may use the stack. Similarily, in supporting >recursion, the stack has to be maintained in case some called program needs >the stack. Further, supporting recursion means all automatic variables must >be mapped into the stack because the procedure may recurse indirectly. >Forbidding recursion means the compiler can map automatic variables into >static areas. > Then again depending on the machine archetecture, access to something mapped to a stack frame may be cheaper and faster than statically allocated memory. It depends on the addressing mechanisms of the machine, and the skill of whoever does the stack design and call/return mechanism. The blanket statement that stack accesses were more expensive than static memory accesses was true in the past, but stopped being universally true sometime in the 1960's. As hardware caches become more important, and as addressing extensions are added to increase the address space of existing architectures, the tradoff between stack and static allocation changes.
smryan@garth.UUCP (Steven Ryan) (07/17/88)
[Suddenly, a pittrap opens beneath our hero's toes. From the darkness are heard faint cries of `my mainframe is bigger than your mainframe.' He neatly pirouettes out of danger.] >Then again depending on the machine archetecture, access to something >mapped to a stack frame may be cheaper and faster than statically >... I see three possibilities: (1) On the hardware, static nonrecursive code is faster than stack-based recursive code. Fortran wins; C loses. (2) Static and stack code are equally well supported. Fortran wins; C wins. (3) Stack code is faster than nonrecursive static code. C wins. Also, Fortran wins. Although a standard conforming *program* cannot be recursive, the implementation is unrestricted. A compiler can map a program into static nonrecursive code, stack-based recursive code, heap-based multithreaded recursive and reentrant code, or anything else as long as it accepts nonrecursive code. Is this cheating? Well..... This is a case of underconstraining the solution so that the implementor has greater freedom of action. A Fortran compiler writer has that many fewer constraints than C hence that much more freedom. The Fortran compiler may not be better but certainly will not be worse. This is similar to the alias ban in Fortran. [NO! I'm starting that again. This is just a fer instance, not an agreement/disagreement.] The Fortran optimiser can depend on the alias ban to produce better code than C. Doesn't have to, and it won't effect a conforming program either way. In general, the fewer constraints that are put on a compiler, or any other program, the more freedom the programmer has for getting any particular implementation fast. Hafa an godne daege. sm ryan ---------------------------------------- Bucky Bug sez: Poor Mother Earth.. No one takes care of her in her old age.. - Odd Bodkins
smryan@garth.UUCP (Steven Ryan) (07/17/88)
>This is similar to the alias ban in Fortran. [NO! I'm starting that again.
                                                      ^
                                            [NO! I'm NOT starting that again.
Oh, well.