wcs@alice.UUCP (Bill Stewart, usually) (12/19/88)
Are there any machines that implement quad-precision (128-bit) floating point numbers in hardware? Are there any commercial Fortran compilers that support it? (e.g. IBM 370 Fortran-H) We've had a couple applications that have come by recently that want it, and I had never known anyone supported it. (Will I have to write an extended-precision package? We probably don't have the information to do numerical analysis on this code to see if they really needed it.) Thanks; Bill -- # Thanks; # Bill Stewart, att!ho95c!wcs, AT&T Bell Labs Holmdel NJ 1-201-949-0705
mac3n@babbage.acc.virginia.edu (Alex Colvin) (12/19/88)
In article <8561@alice.UUCP>, wcs@alice.UUCP (Bill Stewart, usually) writes: > Are there any machines that implement quad-precision (128-bit) floating > point numbers in hardware? Are there any commercial Fortran compilers Honeywell(-Bull?) DPS series do 100-decimal-digit floating point in hard(firm?)ware. Probably not available directly in fortran. try calling PL/I. for that matter, try PL/I on your local mainframe.
ags@s.cc.purdue.edu (Dave Seaman) (12/19/88)
In article <8561@alice.UUCP> wcs@alice.UUCP (Bill Stewart, usually) writes: >Are there any machines that implement quad-precision (128-bit) floating >point numbers in hardware? Are there any commercial Fortran compilers >that support it? (e.g. IBM 370 Fortran-H) We've had a couple >applications that have come by recently that want it, and I had never >known anyone supported it. Basically all of the CDC, ETA, and Cray machines support 128-bit floating point numbers, but it is called double precision, not quad precision. Single precision on those machines is 64 bits. Double precision is, of course, supported by the Fortran compilers. Hardware support comes in all sorts of flavors. All the machines I mentioned obviously have hardware instructions that make 128-bit floats possible, but I don't think any of them make it quite as convenient as 64-bit floats when you look at the instructions that are generated. -- Dave Seaman ags@j.cc.purdue.edu
JONESD@kcgl1.eng.ohio-state.edu (David Jones) (12/19/88)
Some VAXes supports 128 bit floating point in hardware (they call the data type H_floating). The VAX-11 FORTRAN compiler supports H-floating data as REAL*16. David L. Jones | Phone: (614) 292-6929 Ohio State Unviversity | Internet: 1971 Neil Ave. Rm. 406 | jonesd@kcgl1.eng.ohio-state.edu Columbus, OH 43210 | jones-d@eng.ohio-state.edu Disclaimer: A repudiation of a claim.
schwartz@shire.cs.psu.edu (Scott Schwartz) (12/20/88)
In article <8561@alice.UUCP>, wcs@alice (Bill Stewart, usually) writes: >Are there any machines that implement quad-precision (128-bit) floating >point numbers in hardware? Are there any commercial Fortran compilers >that support it? The Prime x50 series had this, and their f77 compiler supported it, in at least one instruction mode. -- Scott Schwartz <schwartz@shire.cs.psu.edu>
mccalpin@loligo.fsu.edu (John McCalpin) (12/20/88)
In article <3688@s.cc.purdue.edu> ags@s.cc.purdue.edu (Dave Seaman) writes: >In article <8561@alice.UUCP> wcs@alice.UUCP (Bill Stewart, usually) writes: >>Are there any machines that implement quad-precision (128-bit) floating >>point numbers in hardware? >Basically all of the CDC, ETA, and Cray machines support 128-bit floating >point numbers, but it is called double precision, not quad precision. >Dave Seaman ags@j.cc.purdue.edu Although it is true that CDC/ETA and Cray machines have the ability to do 128-bit arithmetic, you probably don't want to do it on these machines. The relative speed of 64-bit vector instructions vs 128-bit instructions (which cannot currently be vectorized) is typically very close to 100:1 on both the CDC/ETA and Cray machines. Of course, if your code is all scalar anyway, then the performance degradation will not be so severe. It is theoretically possible to re-write the microcode in the Cyber 205 vector pipes to perform 128-bit vector instructions, but CDC has decided that there is not much of a market for it. This same approach cannot be used on the ETA-10, since the instructions are all in hardware -- which makes debugging the "microcode" a rather painful experience. John D. McCalpin mccalpin@masig1.ocean.fsu.edu mccalpin@nu.cs.fsu.edu mccalpin@fsu (BITNET or MFENET)
reeder@ut-emx.UUCP (William P. Reeder) (12/21/88)
In article <3688@s.cc.purdue.edu> ags@s.cc.purdue.edu (Dave Seaman) writes: >In article <8561@alice.UUCP> wcs@alice.UUCP (Bill Stewart, usually) writes: >>Are there any machines that implement quad-precision (128-bit) floating >>point numbers in hardware? >Basically all of the CDC, ETA, and Cray machines support 128-bit floating >point numbers, but it is called double precision, not quad precision. >Dave Seaman ags@j.cc.purdue.edu Sure, they support them, but would you say they support them "in hardware"? I have used/programmed both a CDC 170/750 and a Cray X/MP-24 (in FORTRAN and in assembly). The CDC machine had a 96-bit accumulator used by all f.p. instructions. Some instructions (FXi) performed an arithmetic operation and returned the upper 48-bits of the result, others (DXi) returned the lower 48-bits. The operands were always single-precision floating-point values (in registers). So, for example, to get the double precision sum of two (single-precision) values required two instructions, an FXi and a DXi. Unfortunately, when I think of double-precision I expect that I should be able to (for example) add two double-precision operands and get a double precision result. To do this on the CDC would require a function or subroutine, it is not provided *in hardware*. The X/MP is even worse, it does not even have the equivalent of the DXi instructions found on the CDC's. As a result, double-precision computations are done entirely in software (meaning without the benefit of any special dp hardware) and are anywhere from 30 to 90 times slower than the corresponding single-precision operations. This is probably the reason dp operations can't be vectorized (as someone pointed out in an earlier posting). What about newer Crays? Or the 2? William Reeder University of Texas Computation Center reeder@emx.utexas.edu -- DISCLAIMER: I speak only for myself, and usually only to myself.
nelson@udel.EDU (Mark Nelson) (12/21/88)
In article <285@loligo.fsu.edu> mccalpin@masig1.ocean.fsu.edu (John D. McCalpin) writes: > >Although it is true that CDC/ETA and Cray machines have the ability to >do 128-bit arithmetic, you probably don't want to do it on these machines. >The relative speed of 64-bit vector instructions vs 128-bit instructions >(which cannot currently be vectorized) is typically very close to 100:1 >on both the CDC/ETA and Cray machines. Of course, if your code is all >scalar anyway, then the performance degradation will not be so severe. > If I remember correctly from my days at CDC working with the 205, the floating point hardware actually produced 128 bit results. There were different versions of the floating point instructions that returned different (64-bit) parts of this 128 bit result. The normal instructions used were the ones that returned a normalized 64 bit result, but for add and multiply multiply (maybe divide ?) there were instructions to return either the most significant or the least significant 64 bits. I am quite sure that these instructions existed in both scalar and vector forms. These instructions would allow you to add 128 bit numbers with four additions, and this certainly vectorizes. Mark Nelson nelson@dewey.udel.edu
dik@cwi.nl (Dik T. Winter) (12/21/88)
(Of course 64 bit means 48 bit mantissa, 128 bit means 96 bit mantissa.) In article <6053@louie.udel.EDU> nelson@udel.EDU (Mark Nelson) writes: > In article <285@loligo.fsu.edu> mccalpin@masig1.ocean.fsu.edu (John D. McCalpin) writes: > > > >The relative speed of 64-bit vector instructions vs 128-bit instructions > >(which cannot currently be vectorized) is typically very close to 100:1 ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ (see comment below for the 205.) > >on both the CDC/ETA and Cray machines. > > > If I remember correctly from my days at CDC working with the 205, the > floating point hardware actually produced 128 bit results. There were > different versions of the floating point instructions that returned > different (64-bit) parts of this 128 bit result. The normal instructions > used were the ones that returned a normalized 64 bit result, but for Right, and normalization is done wrong (i.e. first truncate than normalize rather than the other way around). > add and multiply multiply (maybe divide ?) there were instructions to Divide not. > return either the most significant or the least significant 64 bits. This is all very similar to other CDC Cybers. > I am quite sure that these instructions existed in both scalar and > vector forms. These instructions would allow you to add 128 bit > numbers with four additions, and this certainly vectorizes. Not four, you have to do some additional normalizations on the way, but it is close. Question: why does the 205 not vectorize? Simple, vectorization means a stride of 2 (ie. succesive vector elements are separated by one memory word), and that is not like you would want it on the 205. For similar reasons early compilers did not vectorize operations on complex. -- dik t. winter, cwi, amsterdam, nederland INTERNET : dik@cwi.nl BITNET/EARN: dik@mcvax
PLS@cup.portal.com (Paul L Schauble) (12/21/88)
The Honeywell-Bull DPS90 and DPS9000, both current production models, implement floating point using a 36 bit word. They have hardware support for single, double, and quad (144 bit) floating point. The Fortran compiler has full support for this, including math library. Mail me if you want more info. ++PLS
cik@l.cc.purdue.edu (Herman Rubin) (12/21/88)
In article <8899@ut-emx.UUCP>, reeder@ut-emx.UUCP (William P. Reeder) writes: > In article <3688@s.cc.purdue.edu> ags@s.cc.purdue.edu (Dave Seaman) writes: < <In article <8561@alice.UUCP> wcs@alice.UUCP (Bill Stewart, usually) writes: < <>Are there any machines that implement quad-precision (128-bit) floating < <>point numbers in hardware? < <Basically all of the CDC, ETA, and Cray machines support 128-bit floating < <point numbers, but it is called double precision, not quad precision. < <Dave Seaman ags@j.cc.purdue.edu The CDC6000 series and related Cybers, and the CRAYs have 48-bit mantissas; the Cyber 205 and ETA has essentially 47 bits. The CRAYs do not have the necessary hardware to make multiple precision reasonable, while the others have the hardware to make approximately twice that precision relatively easy. For more precision, it is necessary to integerize the floating point procedures. > Sure, they support them, but would you say they support them "in hardware"? > > I have used/programmed both a CDC 170/750 and a Cray X/MP-24 (in FORTRAN > and in assembly). > > The CDC machine had a 96-bit accumulator used by all f.p. instructions. > Some instructions (FXi) performed an arithmetic operation and returned the > upper 48-bits of the result, others (DXi) returned the lower 48-bits. The > operands were always single-precision floating-point values (in > registers). So, for example, to get the double precision sum of two > (single-precision) values required two instructions, an FXi and a DXi. > Unfortunately, when I think of double-precision I expect that I should be > able to (for example) add two double-precision operands and get a double > precision result. To do this on the CDC would require a function or > subroutine, it is not provided *in hardware*. Still, this helps greatly in getting roughly twice the accuracy. I would expect to lose one bit of accuracy at most in the program. If the operations above were included, the cycle begins again; what if we want twice that precision? And without additional operations, the situation would be as bad as on the CRAYs; see below. > The X/MP is even worse, it does not even have the equivalent of the DXi > instructions found on the CDC's. As a result, double-precision > computations are done entirely in software (meaning without the benefit of > any special dp hardware) and are anywhere from 30 to 90 times slower than > the corresponding single-precision operations. This is probably the > reason dp operations can't be vectorized (as someone pointed out in an > earlier posting). It is not all that difficult to vectorize multiple precision operations on the CRAYs, providing the precision is fixed. Because of the lack of good multiplication operations on those machines, numbers must be broken into 24-bit mantissas; this is why it is so slow. But even for greater precision, where the operations must be essentially done in integerized floating point, fixed precision can be vectorized, albeit with difficulty. -- Herman Rubin, Dept. of Statistics, Purdue Univ., West Lafayette IN47907 Phone: (317)494-6054 hrubin@l.cc.purdue.edu (Internet, bitnet, UUCP)
mccalpin@loligo.fsu.edu (John McCalpin) (12/21/88)
I previously wrote that 128-bit floating-point arithmetic did not vectorize on the Cyber 205/ETA-10. In response: In article <6053@louie.udel.EDU> nelson@udel.EDU (Mark Nelson) writes: >If I remember correctly from my days at CDC working with the 205, >...(t)here were different versions of the floating point instructions >that returned different (64-bit) parts of this 128 bit result. ...stuff deleted... >These instructions would allow you to add 128 bit >numbers with four additions, and this certainly vectorizes. >Mark Nelson >nelson@dewey.udel.edu The above is mostly correct. The multiply/add upper and lower instructions do exist in vector forms, but a lot more than 4 are required to get the complete result. Daan Sandee of CDC and SCRI has vectorized the linked triad operation in 128 bits, and I believe that it required about 30 passes through the pipe, as opposed to 1 pass for the 64-bit linked triad. If this recollection is correct, then the 128-bit can be as fast as 1/30 of the 64-bit speed, rather than the 1/100 which is typically obtained using scalar code. Clarifications, Daan ? John D. McCalpin mccalpin@masig1.ocean.fsu.edu mccalpin@fsu (BITNET or MFENET)
kurtk@tekcae.CAX.TEK.COM (Kurt Krueger) (12/22/88)
Lots of deleted stuff about CDC doing double precision (128 bit) floating
arithmetic with two seperate instructions ....
The new 180 series machines DO have a real full double precision (128 bit)
machine instruction set for the four basic arithmetic ops. The interesting
thing is that running that machine in the 170 mode (with the FXi and DXi
instructions) would slightly beat the machine in the 180 mode. Which goes
to show that the mere presence of 128 bit instructions doesn't automatically
make for speed.
RISC - Seymour was right!
________________________________________________________________________________
|
kurtk@tekcae (Kurt Krueger) | Everything runs on smoke. When the
Electrical Simulation Group (ECAX) | smoke leaks out, it stops working.
D.S. 59-432 627-4363 |
________________________________________|_______________________________________
smryan@garth.UUCP (Xxxxxx Xxxx) (12/22/88)
>If I remember correctly from my days at CDC working with the 205, the >floating point hardware actually produced 128 bit results. There were >different versions of the floating point instructions that returned >different (64-bit) parts of this 128 bit result. The normal instructions The 205 uses `significant zeros.' The machine is set up to treat, say, .123-.122 as .001 rather .1xxe-2, where xx are insignificant bits from normalisation. So there add-upper which returns the 16-bit exponent+48-bit top half of the accumulator after such an add, and an add-lower which returns the bottom half. An add-normalise does normalising shifts and returns the upper 48 bits of that. So if you want to do signficant adds rather than normalised adds, it's very easy. Unfortunately someone (customers?) decided that double precision adds must be normalised, so that what would have been a very simple routine ended up being very complicated, a dozen instructions and still maybe not quite right. >add and multiply multiply (maybe divide ?) there were instructions to Divide-upper (used for integer divides) and divide-significant. >I am quite sure that these instructions existed in both scalar and >vector forms. Yes. > These instructions would allow you to add 128 bit >numbers with four additions, and this certainly vectorizes. Double precision is not vectorised because: - the compiler could only produce one temporary vector for each operator and normalised double precision would require many more. - no library vector routines existed; these did exist for complex, hence complex does vectorise. - insufficient apparent demand from the customers to change either of the above. -- The maggot spawn of Ymir's flesh -- s m ryan whose lust of gold is trapping mesh, have lost their hoard to gods and men, -- Andwari's Gem now hid in river, mine, or fen. -- 1/1
rchrd@well.UUCP (Richard Friedman) (12/24/88)
The NEC SX-2 supports 128 bit quad precision in hardware. The floating point format is IBM, tho, so the resulting values are about 33 significant ecimal digits in the dynamic range 10**-79 to 10**75. THey also support complex double precision and complex quad precision (256 bits). But I am not sure how much of this is in hardware or software. I should also mention that on average, the Cray systems are significantly slower trying to do double precision in software, than straight single precision. One of the first optimization techniques we teach in our seminars is to remove all use of double precision (except where absolutely needed in critical calculations).. Luckily Cray compilers have a compile time option that causes the compiler to ignore all double precision declarations! So, in many cases, with one change onthe call to the compiler we can speed up programs sometimes by a factor of 10 or more !! -- ...Richard Friedman [rchrd] Berkeley, CA. rchrd@well.uucp -or- {ucbvax,lll-lcc,pacbell,hplabs}!well!rchrd
seanf@sco.COM (Sean Fagan) (12/24/88)
In article <8561@alice.UUCP> wcs@alice.UUCP (Bill Stewart, usually) writes: >Are there any machines that implement quad-precision (128-bit) floating >point numbers in hardware? Yes. Cyber's, which have 60-bit single precision, have 120-bit double precision. Their decendents, the Cray's, have 64-bit single precision, and 128-bit double precision. They are, however, probably a bit much for you. An ETA-10 is also a 64-bit s.p. machine, and I believe it is also a 128-bit d.p. machine. And, yes, the FORTRAN compilers on all of these machines support the 128-bit stuff. -- Sean Eric Fagan | "Merry Christmas, drive carefully and have some great sex." seanf@sco.UUCP | -- Art Hoppe (408) 458-1422 | Any opinions expressed are my own, not my employers'.
urjlew@ecsvax.uncecs.edu (Rostyk Lewyckyj) (12/26/88)
Since none of the supercomputers i.e. Cray, ETA, CONVEX, (By definition of most who claim to be supercomputer experts, IBM does not qualify ) provide quadruple precision hardware support. (Cray double precision is 96 bits. IBM Q precision with some hardware support is 112 bits.), there is by common concensus no need for reasonable speed Q precision calculations. So you should either find another way of doing it, after all extended accuracy is less important than speed, as evidenced by the great emphasis placed on Cray speed and none placed on the other companies arithmetic libraries, OR you may need to do your work on that off brand IBM equipment.
cik@l.cc.purdue.edu (Herman Rubin) (12/26/88)
In article <6132@ecsvax.uncecs.edu>, urjlew@ecsvax.uncecs.edu (Rostyk Lewyckyj) writes: > Since none of the supercomputers i.e. Cray, ETA, CONVEX, > (By definition of most who claim to be supercomputer experts, > IBM does not qualify ) provide quadruple precision hardware > support. (Cray double precision is 96 bits. IBM Q precision > with some hardware support is 112 bits.), there is by common > concensus no need for reasonable speed Q precision calculations. > So you should either find another way of doing it, after all > extended accuracy is less important than speed, as evidenced > by the great emphasis placed on Cray speed and none placed on > the other companies arithmetic libraries, OR you may need > to do your work on that off brand IBM equipment. This insistence that the hardware and software provided is the best possible, and that if it is not there one should not want it, is totalitarian and stupid. What is needed is input from those who can think and can find uses for operations which can be efficiently done in hardware, but not efficiently in software, to make these needs known to the manufacturers. There is need for the software operations which those who understand the use of bit patterns find useful to be included in the languages. That these operations are not much used is no evidence. If I know that a square root is 10 times as expensive as a division, and 60 times as expensive as a multiplication, you can be sure that I will choose my algorithms with this in mind. If I know that Cray double precision is many times as expensive as single precision, I may come up with tricky algorithms to get around it. If the Cray had an additional operation to get the least significant part of a product, it would probably speed up multiple precision by at least a factor of 5. And sometimes this is needed for other purposes. I know of no computer reasonably designed for multiple precision operations. What is needed is fast unsigned integer arithmetic. The hardware for integer multiplication is certainly no more difficult than for floating point, and for unsigned actually easier than signed.. With the CPU such a small price of the computer, get those instructions in. -- Herman Rubin, Dept. of Statistics, Purdue Univ., West Lafayette IN47907 Phone: (317)494-6054 hrubin@l.cc.purdue.edu (Internet, bitnet, UUCP)
nmm@cl.cam.ac.uk (Nick Maclaren) (12/28/88)
>In article <8561@alice.UUCP> wcs@alice.UUCP (Bill Stewart, usually) writes: >>Are there any machines that implement quad-precision (128-bit) floating >>point numbers in hardware? Not all items reach us here, so this may have been said before. The original item referred to the bundled Big Blue compilers (Fortran H etc.), which have been dead for many years. However, I have not seen IBM mentioned, but their compilers have supported REAL*16 for 15 years now. All of the larger 370 series had hardware REAL*16 almost as standard, except for division. This has been supported by the operating system since at least MVT release 20 and the first version of Fortran H Extended, and the latter has simulated the instructions on machines without the hardware (by interrupt). There were a good many bugs in the early libraries, but I believe that they were sorted out a long time back (we don't use REAL*16 much). VS Fortran (their current compiler) supports REAL*16, both in hardware and emulation, and it is available in hardware on all of their larger machines as standard. Hardware support for REAL*16 division came in with the operating system MVS/XA and not with a particular item of hardware (because it is a microcode change, not a piece of hardware). Nick Maclaren University of Cambridge Computer Laboratory
phipps@garth.UUCP (Clay Phipps) (12/29/88)
In article <2334@tekcae.CAX.TEK.COM>, kurtk@tekcae.CAX.TEK.COM (Kurt Krueger) writes: >Lots of deleted stuff about CDC doing double precision (128 bit) floating >arithmetic with two seperate instructions .... > >The new 180 series machines DO have a real full double precision >(128 bit) machine instruction set for the four basic arithmetic ops. Even the top-of-the-line C180 model 99_ perform their 128-bit floating-point operations in microcode. The 64-bit floating-point is done in hardware. Not exactly the choice that one would expect from a company with the tradition of CDC, but that's what its management decided to do. >The interesting thing is that running that machine in the 170 mode >(with the FXi and DXi instructions) would slightly beat the machine >in the 180 mode. Which goes to show that the mere presence of >128 bit instructions doesn't automatically make for speed. I don't know whether the FXi and DXi were done in hardware; if they were, that might have provided a sufficient edge. -- [The foregoing may or may not represent the position, if any, of my employer] Clay Phipps {ingr,pyramid,sri-unix!hplabs}!garth!phipps Intergraph APD, 2400#4 Geng Road, Palo Alto, CA 93403 415/494-8800
bga@raspail.UUCP (Bruce Albrecht) (12/29/88)
In article <1973@scolex>, seanf@sco.COM (Sean Fagan) writes: > In article <8561@alice.UUCP> wcs@alice.UUCP (Bill Stewart, usually) writes: > >Are there any machines that implement quad-precision (128-bit) floating > >point numbers in hardware? > > Yes. Cyber's, which have 60-bit single precision, have 120-bit double > precision. Their decendents, the Cray's, have 64-bit single precision, and > 128-bit double precision. They are, however, probably a bit much for you. The Cyber 180's have 64 bit words, with 128 bit quad. I believe the 930, roughly comparable to a high-end Vax, is priced under $100k.
seanf@sco.COM (Sean Fagan) (01/01/89)
In article <2334@tekcae.CAX.TEK.COM> kurtk@tekcae.CAX.TEK.COM (Kurt Krueger) writes: >The new 180 series machines DO have a real full double precision (128 bit) >machine instruction set for the four basic arithmetic ops. The interesting >thing is that running that machine in the 170 mode (with the FXi and DXi >instructions) would slightly beat the machine in the 180 mode. Which goes >to show that the mere presence of 128 bit instructions doesn't automatically >make for speed. >RISC - Seymour was right! Hold on a second. A) Which model? Performance in 170-state is different on a 830 than on a 995E, so that could have something to do with it. Also, how about interrupts and what not? In 180-state, you have to worry about page faults, but not while in 170-state, and that could have something to do with it. Actually, I think the main advantage is that 170-state microcode is *much* easier to deal with than 180-state microcode, and that could be it. -- Sean Eric Fagan | "Merry Christmas, drive carefully and have some great sex." seanf@sco.UUCP | -- Art Hoppe (408) 458-1422 | Any opinions expressed are my own, not my employers'.
colwell@mfci.UUCP (Robert Colwell) (01/02/89)
In article <1080@l.cc.purdue.edu> cik@l.cc.purdue.edu (Herman Rubin) writes: >I know of no computer reasonably designed for multiple precision operations. >What is needed is fast unsigned integer arithmetic. The hardware for >integer multiplication is certainly no more difficult than for floating >point, and for unsigned actually easier than signed.. With the CPU such >a small price of the computer, get those instructions in. The fractional cost of the hardware is indeed insignificant. But the fractional cost of designing it in, getting it right, supporting it, implementing the proper compiler support, and testing it, must all be paid for by increased sales. And the apparent collective vote of all the hardware manufacturers is what you're seeing when you point to the lack of this HW support -- we don't see a large market for multiple precision operations. That's really the only way you're going to get any attention to this issue, by proving that there's a market for it. Bob Colwell ..!uunet!mfci!colwell Multiflow Computer or colwell@multiflow.com 175 N. Main St. Branford, CT 06405 203-488-6090
gould@pilot.njin.net (Brian Jay Gould) (01/04/89)
I've seen some references to Cray and Cyber 205s relating to quad precision. The Cray manuals (X-MP and Y-MP) state that double precision (128 bit) is done by software emulation. The ETA/CDC manuals state that hardware can perform both full (64 bit) and half precision. The HNS (formerly Honeywell-NEC Systems) SX-2 *does* 128 bit floating-point in HARDWARE. But, only in the scalar processor. The vector units are limited to 32 and 64 bit precision. --> Any disclaimers, made by me or by anyone on my behalf, may or may not accurately represent my representation of myself or others. -- ------------------------------------------------------------------------- - Brian Jay Gould :: INTERNET gould@jvnca.csc.org BITNET gould@jvncc - - UUCP rutgers!njin!gould Telephone (201) 329-9616 - ------------------------------------------------------------------------s
hmt@raspail.UUCP (Hemi Thaker) (01/07/89)
>In article <2334@tekcae.CAX.TEK.COM>, >kurtk@tekcae.CAX.TEK.COM (Kurt Krueger) writes: >>Lots of deleted stuff about CDC doing double precision (128 bit) floating >>arithmetic with two seperate instructions .... >> >>The new 180 series machines DO have a real full double precision >>(128 bit) machine instruction set for the four basic arithmetic ops. > >Even the top-of-the-line C180 model 99_ perform their 128-bit floating-point >operations in microcode. The 64-bit floating-point is done in hardware. > >Not exactly the choice that one would expect from a company with >the tradition of CDC, but that's what its management decided to do. I'm not sure what you mean by *Even ..model 99_ ..in microcode* but not all C180 machines implement 128-bit floating point in microcode. In fact, the smallest C180 machine, the 930, performs 128 bit add, subtract, and multiply almost entirely in hardware (microcode is only used for 128 bit data transfers using the 64 bit internal bus and has no effect on instruction performance vs. doing the same thing entirely in hardware). 128 bit divide is done mostly in hardware with some microcode for end case checking. Unless we are told otherwise, 64 bit floating point operations are considered to be far more important than 128 bit because of the low frequency of use of 128 bit operationsso making the hardware/microcode tradeoff would seem reasonable to most hardware designers. As the cost of a gate goes down hardware implementations of 128 bit arithmetic are likely to become more common. - Ed Bender Disclaimer : These opinions are my own, and do not necessarily reflect those of my employer.
turk@Apple.COM (Ken "Turk" Turkowski) (01/12/89)
Any machine with double precision can have quad precision floating point. Look at: Linnainmaa, Seppo _Software for Doubled-Precision Floating-Point Computations_ ACM Transactions on Mathematical Software Vol. 7, No. 3, Sept. 1981, pp. 272-283 -- Ken Turkowski @ Apple Computer, Inc., Cupertino, CA Internet: turk@apple.com Applelink: Turkowski1