[comp.lang.misc] Run-time Type Errors in Smalltalk

mario@cs.man.ac.uk (Mario Wolczko) (04/05/91)

In article <887@puck.mrcu>, paj@mrcu (Paul Johnson) writes:
> Sorry, I am still not convinced.  I am currently programming in
> Eiffel, and changing my mind about a type is not something I often do.
> Also the type declarations carry information which affects the
> execution of the rest of the program.  Hence other parts of the
> program will need changing as well.  It seems to me that this will
> only result from a fairly major rewrite of some part of the code.

This could be a self-fulfilling statement, ie you don't change your
mind about types because you know it's going to be hard work to fix
the type declarations up, whereas in a declarationless language
sorting out the declarations doesn't arise.

> Does anyone have any facts relating to the assertion that run-time
> failures due to type errors are rare in Smalltalk development?

I'm not sure what this means in Smalltalk, much less how you measure
it.  You can't just count "doesNotUnderstands", because some of these
are not type errors (such as asking for the tail of an empty list).

As for statistics, do you want "how many Smalltalk programs have ever
given a run-time type error in their whole lifetime, after they were
declared to be `finished'?"  Most Smalltalk programs I have seen are
prototypes or research programs, and are rarely declared "finished",
so I guess this would be close to 100%.  However, if you ask this
question of any language, replacing "type error" with "error", then my
guess is that the answer is the same.  Very few "finished" programs
are bug-free, type-checked or not, so it's clear that type checking is
only addressing a small part of the problem.

Or perhaps you mean "how many runs of Smalltalk programs have resulted in
run-time type errors?".  This must be close to 0%.  Neither of these
figures means very much.  

Speaking from personal experience, I find that after I have written
some Smalltalk code (and I have written >10000 lines in the last 18
months) there are a few residual "type errors" which get caught in the
first few test runs.  The time taken to find and fix these is much
smaller than the time it would take me to declare all the variables in
such a way as to satisfy a type-checker (even if it were possible with
current type systems, which it isn't in many programs).

The real problems that take hours to find and fix are undetectable
using type-checking, such as writing "x+y" instead of "x-y".

Mario Wolczko

   ______      Dept. of Computer Science   Internet:      mario@cs.man.ac.uk
 /~      ~\    The University              uucp:      mcsun!ukc!man.cs!mario
(    __    )   Manchester M13 9PL          JANET:         mario@uk.ac.man.cs
 `-':  :`-'    U.K.                        Tel: +44-61-275 6146  (FAX: 6280)
____;  ;_____________the mushroom project___________________________________

cok@islsun.Kodak.COM (David Cok) (04/05/91)

In article <2367@m1.cs.man.ac.uk> mario@cs.man.ac.uk (Mario Wolczko) writes:
>In article <887@puck.mrcu>, paj@mrcu (Paul Johnson) writes:
>> Sorry, I am still not convinced.  I am currently programming in
>> Eiffel, and changing my mind about a type is not something I often do.
>> Also the type declarations carry information which affects the
>> execution of the rest of the program.  Hence other parts of the
>> program will need changing as well.  It seems to me that this will
>> only result from a fairly major rewrite of some part of the code.
>
>This could be a self-fulfilling statement, ie you don't change your
>mind about types because you know it's going to be hard work to fix
>the type declarations up, whereas in a declarationless language
>sorting out the declarations doesn't arise.

I've never known a case where one had to go through a program and fix up all
the types.  In Fortran or Pascal one might have to, but in C, use descriptive
typedef names -- then all one needs to do to change the types of those 
variables is to change the definition of the typedef.  This is very like 
changing the class definition in any dynamically typed language.

Historical note:
I vaguely remember from some early programming class ages ago that declarations
were touted not as a way to provide type-checking, but to save the
programmer from misspelling variable names (by requiring that each one be
written at least twice in a program)! FORTRAN's implicit declarations
were forbidden in that class.

David R. Cok
Eastman Kodak Company -- Imaging Science Lab
cok@Kodak.COM

new@ee.udel.edu (Darren New) (04/06/91)

In article <1991Apr5.152736.22353@kodak.kodak.com> cok@islsun.Kodak.COM (David Cok) writes:
>I vaguely remember from some early programming class ages ago that declarations
>were touted not as a way to provide type-checking, but to save the
>programmer from misspelling variable names (by requiring that each one be
>written at least twice in a program)! 

This does not preclude dynamic typing.  In Smalltalk, you declare the
names of all your local variables. The first time you compile code with
a new global variable, it confirms that you spelled it right and allows
you to find the "closest fit" name if not (try *that* in your favorite C compiler :-).  Dynamic typing just means that declarations don't include types,
not that they don't declare variables.

>FORTRAN's implicit declarations
>were forbidden in that class.

And how did you know that you didn't implicitly declare a variable
by mistake?   (or do you mean FORTRAN's implicit *statement*?)
	    -- Darren

-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
  +=+=+ My time is very valuable, but unfortunately only to me +=+=+
+ When you drive screws with a hammer, screwdrivers are unrecognisable +

olson@lear.juliet.ll.mit.edu ( Steve Olson) (04/07/91)

In article <887@puck.mrcu> paj@mrcu (Paul Johnson) writes:
   Does anyone have any facts relating to the assertion that run-time
   failures due to type errors are rare in Smalltalk development?

For years I have been using an object-oriented Lisp (Flavors) to do
things like real-time interpretation of weather radar imagery.  For
the purpose of this discussion Lisp plus Flavors is probably equivalent
to Smalltalk.  These systems have been deployed at several radar sites
scattered across the country for several years.  My only possible reply
is "huh, whats a type error?"  Oh yes! I remember now - a type error is
what happens when you are programming in C and you screw up a cast.  

Seriously, I get the "message not understood" error from time to time, but
99 times out of a hundred, the problem is caught on the first or second
test run.  The other 1 out of hundred?  Well, I still get to diagnose and
fix the problem almost instantly because of the usually informative error
messages.  Much better than the mystery core dump.  I get things which look
like type errors when try to perform math on 'nil' for example, but thats
almost always because I have forgotten to properly calculate the value
of some slot.  Getting a type error in those cases is in fact a tremendous
help over what would happen in a typical statically typed language.

In my opinion, the biggest practical problem with dynamically typed languages
is that it is a pain in the butt to lug around the run time envirnoment.
To run a 1 page Common Lisp program on a UNIX system means I have to fire up a
multi-megabyte run time system which consumes vast amounts of core and virtual
memory.  Yuk.  There are lighter-weight alternatives to Common Lisp, of course,
but you're still going to have some extra baggage to lug around.

People who get endlessly alarmed about the alleged lack of type saftey in
a dynamically-typed languge are not well-informed about how Lisp or Smalltalk
software development actually works.

- Steve Olson
  MIT Lincoln Laboratory
  olson@juliet.ll.mit.edu

diamond@jit345.swstokyo.dec.com (Norman Diamond) (04/08/91)

In article <OLSON.91Apr6172626@lear.juliet.ll.mit.edu> olson@lear.juliet.ll.mit.edu ( Steve Olson) writes:

>In my opinion, the biggest practical problem with dynamically typed languages
>is that it is a pain in the butt to lug around the run time envirnoment.
>To run a 1 page Common Lisp program on a UNIX system means I have to fire up a
>multi-megabyte run time system which consumes vast amounts of core and virtual
>memory.  Yuk.

Exactly.  The multi-megabyte run time system provides the "message not
understood" message and nice debugger instead of mystery core dump.
The multi-megabyte run time system would be useful while developing
programs in statically typed languages too, and then you'd never see
the mystery core dump.  Debates between static and dynamic typing should
center on the advantages of programming in each kind of language, not
on what can be done with multi-megabyte support packages.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.

new@ee.udel.edu (Darren New) (04/09/91)

In article <OLSON.91Apr6172626@lear.juliet.ll.mit.edu> olson@lear.juliet.ll.mit.edu ( Steve Olson) writes:
>People who get endlessly alarmed about the alleged lack of type saftey in
>a dynamically-typed languge are not well-informed about how Lisp or Smalltalk
>software development actually works.

Here, Here!  I'll second that.  And people who say "Well, you could do it 
just as well in a statically typed language if you had as big a runtime
system" I'll answer by saying "why do *most* dynamic languages have huge
libraries and great debuggers and *most* static languages have lousy
libraries and lousy debuggers?"  I suspect that because the static code
is so much less reusable that the incentive is to never change the
late-60's model of programming lest we break all that software.
People rave about GDB, but it looks worse to me than Delta on a
1968 release of CP-V.  Then look at the Smalltalk debugger, which
is not very big because it resuses most of the code from the editor,
browser, and compiler, and it blows any other debugger out of the water.
	       -- Darren

-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
  +=+=+ My time is very valuable, but unfortunately only to me +=+=+
+ When you drive screws with a hammer, screwdrivers are unrecognisable +

guido@cwi.nl (Guido van Rossum) (04/09/91)

Norman Diamond writes:

>The multi-megabyte run time system would be useful while developing
>programs in statically typed languages too, and then you'd never see
>the mystery core dump.  Debates between static and dynamic typing should
>center on the advantages of programming in each kind of language, not
>on what can be done with multi-megabyte support packages.

Maybe I can provide something of a data point here: a comparison of
two languages that differ more in their typing system than in other
aspects.  If you know ABC and Python you can skip to DISCUSSION at the end.

Python, which has dynamic typing, was strongly inspired by ABC, which has
static typing.  In other aspects, the two are much more similar then
your average pair of statically/dynamically typed languages: both are
interactive and interpreted, and have large runtime packages that
replace core dumps by understandable error messages (really exceptions
for Python).  Both runtime packages implement very-high level data
types with roughly the same functionality (only one has bignums --
oddly enough it's the statically typed one).  Neither requires or
allows declarations, both have polymorphic functions.

And now the differences: the type system of ABC determines the type of
parameters and variables by inspecting how they are used -- e.g., if
it is added to something, it must be a number (there's only one
numeric type in ABC), if it is inserted in a list, it must have the
same type as the elements of that list, etc.  You get polymorphism if
the usage of a parameter doesn't give away all the information about a
type -- e.g., if a procedure swaps its two arguments but does nothing
else, it will accept arguments of any type, as long as the two
arguments have the same type.  There are interesting problems when you
try to write a type-checker for such a language, but they are solvable
(for ABC, at least).

Python, on the other hand, does all its type checking at run time.
E.g., if you try to add two objects, their types are compared, and if
they are both strings, or both integers, it succeeds, else you get a
type checking exception (which may be caught).

Programming in either language is extremely smooth compared to C or
other relatively low-level languages.  Because of the interpreters,
you don't get blinding speed if you can't get the built-in data types
to do most of the work for you, so neither language (implementation)
is well-suited for programming byte-copying loops or FFT transforms,
for example; but if you need efficient, large dynamic data structures,
they are available in the runtime system.

DISCUSSION

There are two questions which can be asked about ABC and Python (in
the context of the previous debate about static vs. dynamic typing).

1) Which is more likely to lead to bugfree programs?
2) Which has more flexibility (or: which do you like better)?

Re (1), I don't see a significant difference.  True, ABC detects
certain errors before you start running the program that Python will
only catch in a test run, but there remains a significant class of
errors that can only be detected at run time, e.g., taking the MIN of
an empty list, accessing an associative array with a non-existent key,
etc.  Both Python and ABC encourage program development in small units
that are created and tested bottom-up, and the first test run usually
finds any type errors that remain in Python programs.  Sometimes the
type checker in ABC detects a type error but reports it in such a
weird way that you have to stare really long at the piece of code in
question before you see what the real problem is; e.g., it may
complain that b is not a number in a+b, while the real problem is in
the expression assigned to b many lines earlier.  In Python I can ask
the debugger to print b's value when an exception has occurred, which
usually shows immediately what the problem is (the value gives a more
precise clue about where it came from than just the type, which ABC
reports).

Re (2), it can't be a surprise that I like Python better.  I actually
use heterogeneous lists a lot, e.g., just this weekend I wrote a
tree module which stores arbitrary attributes associated with the
leaves, keyed on strings chosen by the various modules that use the
trees, and an event queue which can store arbitrary events from
unrelated modules.  These modules would pass ABC's type system
alright (except for other features not provided in ABC that don't
relate directly to dynamic/static typing), but their callers wouldn't:
as soon as two unrelated modules would store differently-typed events
in a queue, neither module would type-check.  This is caused by the
absence of unions in ABC, but notice that they would have to be
open-ended unions (David Gudeman's "any" type), since the queue module
cannot know what types will be stored in the queue.  Any one program
will only store a finite number of different event types (hmm, can you
prove this?) but I don't see the point of forcing the programmer to
find out which union type to use in a particular program (and keeping
it up-to-date when the program is extended).

CONCLUSION

Were all else kept equal, I'd still prefer dynamic typing.  Its
simplicity in implementation wins over the complexity of a type
checker that provides the kind of flexibility I need.  ABC is close,
but still too limiting for many realistic applications, and its type
checker was one of the hardest modules to add to the system.

Note that dynamically typed languages can often be optimized using
techniques similar to ABC's type checking algorithm -- only they can
fall back on the dynamic type system if they can't compute the
type of a variable, instead of blaming the user.  I would even like it
if such an optimizer told me when it could prove that an unchecked
exception would certainly occur in a part of the program.  But there
is no reason why such an optimizer should stop at type checking --
static checks can check for definedness of variables (ABC has this!),
apply range checks, in short try to predict every kind of exception
that may occur in a program -- but there is no reason to restrict the
language to accept only programs for which it can be proven (with
known techniques) that no exception will ever occur.  Such languages
are unnecessarily restricting programmers.

--Guido van Rossum, CWI, Amsterdam <guido@cwi.nl>
"I can do everything you want in Macro Assembler.  No need for Compilers."

olson@lear.juliet.ll.mit.edu ( Steve Olson) (04/09/91)

In article <1991Apr8.085602.5082@tkou02.enet.dec.com> diamond@jit345.swstokyo.dec.com (Norman Diamond) writes:
   The multi-megabyte run time system would be useful while developing
   programs in statically typed languages too, and then you'd never see
   the mystery core dump.  

I certainly agree with the first part of your statement - witness Saber-C -
but not the second.  The real gut-buster problems I see in my line of work
are when a guy calls up from a remote site and says "It ran great for 3 days,
and then it crashed!!".  My experience has been that the dynamically-typed
system fails and you fix it the first time.  When the statically typed
system fails ... maybe you get lucky ... maybe you don't.  Note:  I'm
assuming the statically typed system is "deployed" without the run-time
environment - see below.

                            Debates between static and dynamic typing should
   center on the advantages of programming in each kind of language, not
   on what can be done with multi-megabyte support packages.

Agreed in principle, but harder to do than you think.  The poster who
talked about "A" and "B" languages accurately described how languages
- in practice - tend to line up.  Dynamic typing pretty much implies a
elaborate run-time environment, so to do an "apples-to-apples" comparison
we need a statically-typed language with a elaborate run-time enviroment.
So go off and design one, and report back to this list :-).  Or maybe some 
exist - enlighten me.

   --
   Norman Diamond       diamond@tkov50.enet.dec.com
   If this were the company's opinion, I wouldn't be allowed to post it.

- Steve Olson
  MIT Lincoln Laboratory
  olson@juliet.ll.mit.edu

diamond@jit345.swstokyo.dec.com (Norman Diamond) (04/09/91)

In article <3288@charon.cwi.nl> guido@cwi.nl (Guido van Rossum) writes:

>Python, which has dynamic typing, was strongly inspired by ABC, which has
>static typing.  In other aspects, the two are much more similar then
>your average pair of statically/dynamically typed languages: both are
>interactive and interpreted, and have large runtime packages that
>replace core dumps by understandable error messages (really exceptions
>for Python).
[...]
>Were all else kept equal, I'd still prefer dynamic typing.

Dynamic typing indeed makes some things easier, as Mr. Rossum explained.
It is nice to see real evidence for a change.

Anyway, there is a middle ground which has also been suggested by a few
others, and which some languages (e.g. Eiffel, and certain constructs in
C++) come close to providing anyway.  Perhaps some new terms would help
explain this.  Just as type and class have come to mean different things,
perhaps dynamic classing vs. static classing should be recognized.  With
static classing, a type inference system could determine, or a programmer
could specify, that a variable inherits certain known messages, though the
precise type need not be known statically.  I expect that static classing
will have a lot more use than static typing.

I would not try to ram either static typing or static classing down
anyone's throat, but both can be used by disciplined developers and
maintainers to catch some errors early.  Although it cannot catch all
errors, it certainly helps.  Part of the folklore of software engineering
is the magnitude of increase in costs as error detection is delayed.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.

adw@otter.hpl.hp.com (Dave Wells) (04/11/91)

olson@lear.juliet.ll.mit.edu ( Steve Olson) writes:
>People who get endlessly alarmed about the alleged lack of type saftey in
>a dynamically-typed languge are not well-informed about how Lisp or Smalltalk
>software development actually works.

Disclaimer: there almost certainly are people who *are* overly concerned
with runtime type safety in dynamically-typed languges, and who needlessly
lie awake at night worrying about it.

However. There have also been claims that "type errors are actually quite
rare in dynamically-typed languges". (Anecdotal evidence follows). We have
just developed an 18k LOC, 83 user-class application using ACTOR (SmallTalk
72 with sugared syntax for the PC). We found that a majority of the bugs
would have been caught by even a simple (i.e. 'ANSI-C-like') static
typechecker.

Although most of these (those which we've found so far :-|) were found "on
the first pass through", testing such a relatively large interactive
application exhaustively *and repeatedly* is exceedingly expensive compared
to a comparable compilation-style typecheck.

You may also be interested by the following definition of "=" for Sets,
adapted from the ACTOR kernel source. Can you spot the type error? Can you
explain why it means that testing for membership in a heterogeneous Set
containing both Sets and Strings (say) *sometimes* fails with a runtime
type error (depending on how the Set members hash into physical locations
in the Set)?

/* Infix equality method for Sets. Sets are equal if they are the same size
and have equal members. */
Def =(self, collection)
{ if size(self) ~= size(collection)
  then ^nil
  endif;
  do(self,{using(member)
    if not(member in collection)
    then ^nil
    endif;
  });
}!!

I'd be happy with manifest types with static checking, even if it were
optional. I'd be happy with a type inference system which would tell me
that "<collection> must understand size() and in(). This makes it an Array,
Bag, Set or Interval." But, for the kind of development work we're doing,
I'm not at all happy with neither.

diamond@jit345.swstokyo.dec.com  Norman Diamond at Digital Equipment
Corporation Japan writes:
|Anyway, there is a middle ground ... [details omitted] ...
|
|I would not try to ram either static typing or static classing down
|anyone's throat, but both can be used by disciplined developers and
|maintainers to catch some errors early.  Although it cannot catch all
|errors, it certainly helps.  Part of the folklore of software engineering
|is the magnitude of increase in costs as error detection is delayed.

Exactly.

Dave

klimas@iccgcc.decnet.ab.com (04/13/91)

In article <OLSON.91Apr6172626@lear.juliet.ll.mit.edu>, olson@lear.juliet.ll.mit.edu ( Steve Olson) writes:
> In article <887@puck.mrcu> paj@mrcu (Paul Johnson) writes:
>    Does anyone have any facts relating to the assertion that run-time
>    failures due to type errors are rare in Smalltalk development?
	Some info that I picked up from a company doing a lot of big
	project work with C++ (i.e. strong type checking language) was
	that postmortems on their project revealed only 10% of their
	errors were captured by strong type checking.

rsw@cs.brown.EDU (Bob Weiner) (04/13/91)

In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
> 	Some info that I picked up from a company doing a lot of big
> 	project work with C++ (i.e. strong type checking language) was
> 	that postmortems on their project revealed only 10% of their
> 	errors were captured by strong type checking.

I doubt that any professional software designers are willing to log every
instance that their compiler spits out an error.  Their productivity might
drop considerably.  Hence it is a fair bet that this company did not have
an accurate record of errors detected by a compiler for a strongly typed
language.  They may have only recorded problems found as the software
passed from design to quality testing.
--
Bob Weiner				   rsw@cs.brown.edu

cok@islsun.Kodak.COM (David Cok) (04/14/91)

In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
>	Some info that I picked up from a company doing a lot of big
>	project work with C++ (i.e. strong type checking language) was
>	that postmortems on their project revealed only 10% of their
>	errors were captured by strong type checking.

Did they really count ALL the errors -- including every time a compile failed
because of type errors?  It seems to me most type errors would be caught
before a peice of code even got into some error tracking process.

David R. Cok
Eastman Kodak Company
cok@Kodak.COM

ddean@rain.andrew.cmu.edu (Drew Dean) (04/14/91)

In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
>	Some info that I picked up from a company doing a lot of big
>	project work with C++ (i.e. strong type checking language) was
>	that postmortems on their project revealed only 10% of their
>	errors were captured by strong type checking.
I think this depends on what you call "strongly typed."  Personally, I wouldn't
call any C derived language strongly typed, as long as (void *) exists.
If you want to see REAL strong typing, try, say Standard ML (Ok, ok, it's
a functional language so I shouldn't be mentioning it in this group.  :-)
I don't want to start the functional vs OOP flamefest.)   I'm just using this
as an example of a really strong type system: wherever it can, it automatically
infers the type of the expression, and won't compile anything that doesn't
typecheck.
The only experience with dynamically-typed systems I have are Lisp-based
systems, just today I crashed one with a type error.  It could be that
imperative/functional programs are more prone to type-errors that OO programs;
(note: this is a conjecture) does anyone have any real data on the subject ?


-- 
Drew Dean
Drew_Dean@rain.andrew.cmu.edu
[CMU provides my net connection; they don't necessarily agree with me.]

brnstnd@kramden.acf.nyu.edu (Dan Bernstein) (04/14/91)

In article <RSW.91Apr13042119@delay.cs.brown.EDU> rsw@cs.brown.EDU (Bob Weiner) writes:
> In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
> > 	Some info that I picked up from a company doing a lot of big
> > 	project work with C++ (i.e. strong type checking language) was
> > 	that postmortems on their project revealed only 10% of their
> > 	errors were captured by strong type checking.
> I doubt that any professional software designers are willing to log every
> instance that their compiler spits out an error.  Their productivity might
> drop considerably.

It takes no effort to log every compilation. I do, and I know others who
do. It does take effort to log every little change you make to your
program, and every run-time error; I don't know of any C++PSE, though
the company might have developed one internally.

> Hence it is a fair bet that this company did not have
> an accurate record of errors detected by a compiler for a strongly typed
> language.

It's a fair bet that the 10% was fabricated by some programmer who at
the last minute was put on the spot and told to produce some statistics.
But I'd believe that it was in the ballpark: a language which forces you
to know what data types you're using will reduce type errors.

---Dan

diamond@jit345.swstokyo.dec.com (Norman Diamond) (04/15/91)

In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
>	Some info that I picked up from a company doing a lot of big
>	project work with C++ (i.e. strong type checking language) was
>	that postmortems on their project revealed only 10% of their
>	errors were captured by strong type checking.

This is a perfect example of engineering skills that have been learned
through hard efforts and then rejected by practitioners who refuse to
be engineers.

In any other field of engineering, a 10% increase in safety would always
be applied.  To reject it would be grounds for lawsuits, at least.

(Well, it did take a while before the auto industry accepted that lesson.
And one of the fallouts of the Reagan era is lower safety standards in
the auto industry.  Hmm.....)
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.

gudeman@cs.arizona.edu (David Gudeman) (04/16/91)

In article  <1991Apr15.065146.16680@tkou02.enet.dec.com> Norman Diamond writes:
]In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
]>	Some info that I picked up from a company doing a lot of big
]>	project work with C++ (i.e. strong type checking language) was
]>	that postmortems on their project revealed only 10% of their
]>	errors were captured by strong type checking.
]...
]In any other field of engineering, a 10% increase in safety would always
]be applied.  To reject it would be grounds for lawsuits, at least.

How do you get a 10% increase in safety from the above?  If they
hadn't caught those errors by static typing they would have caught
them by testing -- the same way the found the other 90% of the errors.
This logic error combined with the strident tone of the article
suggests that your convictions are affecting your thinking.
--
					David Gudeman
gudeman@cs.arizona.edu
noao!arizona!gudeman

cs450a03@uc780.umd.edu (04/16/91)

Brian Boutel writes:

>The claim that testing catches all errors is laughable. Errors show
>up in production code every day. Testing, as we tell first year
>students, does not demonstrate the absence of errors, only their
>presence.

By the same token, the claim that static typing catches all errors is
invalid.  And if a language _requires_ a significant amount of
programmer time to be spent micro-managing static type declarations,
it introduces the possibility for more errors -- errors which may not
be caught in testing.

>So what can be said about all the errors in the quoted C++ project
>that were not caught by testing and are still in the production code?
> 
>I'll tell you. They are not type errors.

No, they were not _static_ type errors.  You don't know if they'd be
caught by run-time type checking.

Raul Rockwell

brian@comp.vuw.ac.nz (Brian Boutel) (04/16/91)

In article <1917@optima.cs.arizona.edu>, gudeman@cs.arizona.edu (David
Gudeman) writes:
|> In article  <1991Apr15.065146.16680@tkou02.enet.dec.com> Norman
|> Diamond writes:
|> ]In article <4243.2805b94a@iccgcc.decnet.ab.com>
|> klimas@iccgcc.decnet.ab.com writes:
|> ]>	Some info that I picked up from a company doing a lot of big
|> ]>	project work with C++ (i.e. strong type checking language) was
|> ]>	that postmortems on their project revealed only 10% of their
|> ]>	errors were captured by strong type checking.
|> ]...
|> ]In any other field of engineering, a 10% increase in safety would
|> always
|> ]be applied.  To reject it would be grounds for lawsuits, at least.
|> 
|> How do you get a 10% increase in safety from the above?  If they
|> hadn't caught those errors by static typing they would have caught
|> them by testing -- the same way the found the other 90% of the
|> errors.
|> This logic error combined with the strident tone of the article
|> suggests that your convictions are affecting your thinking.
|> --

The claim that testing catches all errors is laughable. Errors show up
in production code every day. Testing, as we tell first year students,
does not demonstrate the absence of errors, only their presence.

The figures quoted refer only to errors that have been detected, not to
all errors.

So what can be said about all the errors in the quoted C++ project that
were not caught by testing and are still in the production code? 

I'll tell you. They are not type errors.

--brian

 
-- 
Internet: brian@comp.vuw.ac.nz
Postal: Brian Boutel, Computer Science Dept, Victoria University of Wellington,
        PO Box 600, Wellington, New Zealand
Phone: +64 4 721000   Fax: +64 4 712070

sakkinen@jyu.fi (Markku Sakkinen) (04/16/91)

In article <1917@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:
>In article  <1991Apr15.065146.16680@tkou02.enet.dec.com> Norman Diamond writes:
>]In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
>]>	Some info that I picked up from a company doing a lot of big
>]>	project work with C++ (i.e. strong type checking language) was
>]>	that postmortems on their project revealed only 10% of their
>]>	errors were captured by strong type checking.
>]...
>]In any other field of engineering, a 10% increase in safety would always
>]be applied.  To reject it would be grounds for lawsuits, at least.
>
>How do you get a 10% increase in safety from the above?  If they
>hadn't caught those errors by static typing they would have caught
>them by testing -- the same way the found the other 90% of the errors.

It is very wishful thinking that all errors would be found by testing.
"Testing can only prove the presence of errors, never their absence."
But I think this quote is from Hoare, whom you probably would not
believe anyway.

>This logic error combined with the strident tone of the article
>suggests that your convictions are affecting your thinking.
>--
>					David Gudeman

Look who's talking!  When you have a difference of opinion with Diamond,
you call his opinion a "logic error" and then accuse him of "strident tone".
BTW, it would be a queer conviction indeed that wouldn't affect a person's
thinking at all.

Markku Sakkinen
Department of Computer Science and Information Systems
University of Jyvaskyla (a's with umlauts)
PL 35
SF-40351 Jyvaskyla (umlauts again)
Finland
          SAKKINEN@FINJYU.bitnet (alternative network address)

dnsurber@lescsse.jsc.nasa.gov (Douglas Surber) (04/16/91)

In <1917@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:

>If they
>hadn't caught those errors by static typing they would have caught
>them by testing -- the same way the found the other 90% of the errors.
>This logic error combined with the strident tone of the article
>suggests that your convictions are affecting your thinking.

The real question is how long would it take them to catch those errors
by testing?  The type checker catches them all first time every time.
Admittedly not every error, but reducing the number of errors by 10%
before you even begin testing sounds like a win to me.  

Further, the type checker finds type errors even in code that, for one
reason or another, is never excersized during testing.  Again not every
error, but a 10% reduction is still something.

--
Douglas Surber                 Internet: dnsurber@jsc.nasa.gov
Lockheed                       NASAmail: dnsurber/jsc/nasa
Houston, Texas                 Phone:    713-283-5195
Life can be only understood backwards, but it must be lived forwards.

mathew@mantis.co.uk (mathew) (04/16/91)

diamond@jit345.swstokyo.dec.com (Norman Diamond) writes:
> In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com w
> >	Some info that I picked up from a company doing a lot of big
> >	project work with C++ (i.e. strong type checking language) was
> >	that postmortems on their project revealed only 10% of their
> >	errors were captured by strong type checking.
[...]
> In any other field of engineering, a 10% increase in safety would always
> be applied.  To reject it would be grounds for lawsuits, at least.

Possibly true, although I can think of examples in other fields of
engineering where 10% increases in safety have NOT been applied. Look at the
design of roll-on, roll-off ferries, for example. Or US domestic electrical
wiring (no earth, no ELCB!)

Also, remember that using a strongly-typed statically typed language is not
necessarily the only way to achieve that 10% increase.

mathew

--
If you're a John Foxx fan, please mail me!

dmg@ssc-vax (David M Geary) (04/16/91)

In article <387@smds.UUCP>, Brian Boutel writes:

In article <1917@optima.cs.arizona.edu>, gudeman@cs.arizona.edu (David
Gudeman) writes:
|> In article  <1991Apr15.065146.16680@tkou02.enet.dec.com> Norman
|> Diamond writes:
|> ]In article <4243.2805b94a@iccgcc.decnet.ab.com>
|> klimas@iccgcc.decnet.ab.com writes:
|> ]>	Some info that I picked up from a company doing a lot of big
|> ]>	project work with C++ (i.e. strong type checking language) was
|> ]>	that postmortems on their project revealed only 10% of their
|> ]>	errors were captured by strong type checking.
|> ]...
|> ]In any other field of engineering, a 10% increase in safety would
|> always
|> ]be applied.  To reject it would be grounds for lawsuits, at least.
|> 
|> How do you get a 10% increase in safety from the above?  If they
|> hadn't caught those errors by static typing they would have caught
|> them by testing -- the same way the found the other 90% of the
|> errors.

]The claim that testing catches all errors is laughable. Errors show up
]in production code every day. Testing, as we tell first year students,
]does not demonstrate the absence of errors, only their presence.

] So what can be said about all the errors in the quoted C++ project that
] where not caught by testing and are still in the production code?

] I'll tell you. They are not type errors.

  Not neccessarily:  

struct junk  { int  x;  float  f; };
struct other { int  y;  double d; };

main() 
{
  static struct junk  myJunk = {1, 1.1};
  struct junk  *p = &myJunk;

  printf("%d\n", ((struct other*)myJunk)->x);
}

  Looks like a type error that got by the compiler to me.  Note that statically
typed languages (such as C++) usually contain some mechanism (here, the cast)
so that the typing system enforced by the compiler may be circumvented.
Therefore, as shown above, type errors can still get by the compiler.

  In fact, in C++, considerable effort is expended (via manipulating the
inheritance tree, or casting derived classes to base classes, etc) to ensure 
that the compiler will accept a given expression as "type safe".  However,
the given expression may well violate the type system.  Errors such as this
are difficult to track down ...

gudeman@cs.arizona.edu (David Gudeman) (04/17/91)

In article  <1991Apr15.234559.16293@comp.vuw.ac.nz> Brian Boutel writes:
]|> 
]|> How do you get a 10% increase in safety from the above?  If they
]|> hadn't caught those errors by static typing they would have caught
]|> them by testing -- the same way the found the other 90% of the
]|> errors.

]The claim that testing catches all errors is laughable.

Sure is.  Did someone make that claim?

]The figures quoted refer only to errors that have been detected, not to
]all errors.

And the errors that can be caught by static type checking are almost
universally so trivial that they probably all _would_ be caught by
testing.  The same cannot be said for the errors introduced by complex
code that is trying to get around the static type declarations -- to
do something that would be easy in a dynamically typed language.

]So what can be said about all the errors in the quoted C++ project that
]were not caught by testing and are still in the production code? 
]
]I'll tell you. They are not type errors.

Nope, you can't tell that.  C++ production code can core dump due to a
type error.  That's because to implement something that really needs
dynamic typing you often have to use tricks that leave all type
checking up to the programmer.

What can you say about errors that occur in  production code in
dynamically typed languages?  First, they are not caused by the
necessity of getting around the static type checker.  Second, they are
almost never type errors of the sort that could have been caught by a
static type checker.  Third, if they are type errors they cannot cause
a core dump or other unpredictable behavior.
--
					David Gudeman
gudeman@cs.arizona.edu
noao!arizona!gudeman

sfk@otter.hpl.hp.com (Steve Knight) (04/17/91)

Drew writes:
> The only experience with dynamically-typed systems I have are Lisp-based
> systems, just today I crashed one with a type error.  

If your system crashed, it has a serious bug.  Type errors should be caught at
run-time and put you into a debugging loop.

[Drew also suggests ML as being a serious strongly typed system.]
I had a similar type problem with ML, too.   I found a hole in the type checker
of my implementation & took a long time finding it.  It was a serious bug in
the implementation -- but not the language.  Similarly, your LISP bug
is a bug in the implementation.

Furthermore, there's not much point in taking C to task for having (void *),
when mostly every implementation of ML has the equivalent of New Jersey's
    System.Unsafe.cast : 'a -> 'b
I only say "mostly" because I don't know about POLY/ML.  (But I'll take a bet.)
This little charmer is even worse than (void *).

Admittedly, it isn't part of the language -- which is almost worse.  If some
one uses it, then the code has to be changed before it can be ported.  I 
don't like the idea of changing code that has been written using this
monstrosity.  If I am going to use packages that require it, then I will
trust the implementor to get it right once.

Steve

mathew@mantis.co.uk (mathew) (04/17/91)

dnsurber@lescsse.jsc.nasa.gov (Douglas Surber) writes:
> In <1917@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes
> >If they
> >hadn't caught those errors by static typing they would have caught
> >them by testing -- the same way the found the other 90% of the errors.
> >This logic error combined with the strident tone of the article
> >suggests that your convictions are affecting your thinking.
> 
> The real question is how long would it take them to catch those errors
> by testing?  The type checker catches them all first time every time.
> Admittedly not every error, but reducing the number of errors by 10%
> before you even begin testing sounds like a win to me.  

What, even if your code has to be twice as long and twice as complicated and
take three times as long to write, because you're writing it in a
statically-typed language?

Yes, if the program you're writing is NOT best expressed in a
dynamically-typed way, then go ahead and use static typing. But saying that
static typing is ALWAYS the best solution is just blinkered stubbornness.

mathew

--
If you're a John Foxx fan, please mail me!

brnstnd@kramden.acf.nyu.edu (Dan Bernstein) (04/18/91)

In article <2023@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:
> And the errors that can be caught by static type checking are almost
> universally so trivial that they probably all _would_ be caught by
> testing.

Goofus uses dynamic typing. Gallant uses static typing.

Goofus says static typing catches so few errors that they probably all
_would_ be caught by testing. Gallant simply catches all the errors.

Goofus finds 95% of the typing bugs during testing. Gallant finds 100%.

Management fires Goofus.

---Dan
P.S. 1/2 :-) for the humor-impaired.

dmg@ssc-vax.uucp (David M Geary) (04/18/91)

]  David Gudeman
]] Douglas Surber

]If they hadn't caught those errors by static typing they would have caught
]them by testing -- the same way the found the other 90% of the errors.
]This logic error combined with the strident tone of the article
]suggests that your convictions are affecting your thinking.

]]The real question is how long would it take them to catch those errors
]]by testing?  The type checker catches them all first time every time.
]]Admittedly not every error, but reducing the number of errors by 10%
]]before you even begin testing sounds like a win to me.  

  Sounds like a win to me, also.  However, there are two caveats:
  
1)  Since dynamic typing allows more expressiveness and a greater degree
    of reusability than a statically typed language, code in a statically 
    typed language may be 10% (or more) larger than the "same" code in a 
    dynamically typed language.  So, while some errors have been ironed out
    before testing, testing may have more to test.

2)  Errors caused by circumvention of the type system in a statically typed
    language may *not* be caught by the compiler, and are difficult to
    track down.  In fact, I demonstrated such an error in an earlier posting.

]]Further, the type checker finds type errors even in code that, for one
]]reason or another, is never excersized during testing.  Again not every
]]error, but a 10% reduction is still something.

  Again, realize that the type checker will not find errors that fall into
category #2 above.

barmar@think.com (Barry Margolin) (04/18/91)

In article <2023@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:
>What can you say about errors that occur in  production code in
>dynamically typed languages?  First, they are not caused by the
>necessity of getting around the static type checker.  Second, they are
>almost never type errors of the sort that could have been caught by a
>static type checker.  Third, if they are type errors they cannot cause
>a core dump or other unpredictable behavior.

The above assumes that a dynamically typed language always does type
checking.  Common Lisp, for instance, does not require runtime type
checking.  The following has unpredictable behavior:

(rplaca (read) 3)

If the input does not contain the representation of a cons then it has
undefined behavior, and can core dump, corrupt the Lisp image, or launch a
nuke.

A static type checker could flag this as a potential type error, but it
shouldn't disallow the code.  If the input is coming from the user I'd
consider the above to be poor code, but if it is coming from a
machine-generated source that is "guaranteed" to have the representation of
a cons, then it is safe; for instance, the following is fine:

(with-input-from-string (*standard-input* (format nil "(~S)" object))
  (rplaca (read) 3))

is guaranteed to return the same thing as

(list 3)

or signal a read-error if OBJECT doesn't have a readable printed
representation.
--
Barry Margolin, Thinking Machines Corp.

barmar@think.com
{uunet,harvard}!think!barmar

barmar@think.com (Barry Margolin) (04/18/91)

In article <8189:Apr1719:16:2091@kramden.acf.nyu.edu> brnstnd@kramden.acf.nyu.edu (Dan Bernstein) writes:
>Goofus uses dynamic typing. Gallant uses static typing.
...
>Management fires Goofus.

>
>Goofus says static typing catches so few errors that they probably all
>_would_ be caught by testing. Gallant simply catches all the errors.
>
>Goofus finds 95% of the typing bugs during testing. Gallant finds 100%.
>

Goofus uses dynamic typing with automatic runtime type checking.  Gallant
uses static type checking, with no runtime type checking (except perhaps
that explicitly programmed by Gallant, e.g. variant record discriminant
examination).

Goofus says static typing catches very few errors that aren't caught by
runtime type checks during testing, and doesn't catch any of the errors
that are not detectable statically (e.g. misuse of unions and pointer
casting).

Goofus finds 95% of the statically detectable typing bugs, and 95% of the
non-statically detectable typing bugs.  Gallant finds 100% of the
statically detectable typing bugs, and 5% of the non-statically detectable
typing bugs (the ones that happen to result in core dumps).

The winner depends on the style of the program.  Users of statically-typed
languages often limit themselves (perhaps unconsciously) to using the
subset of the language that is easy to check statically.  Then they declare
that dynamic checking isn't needed, because static checking is able to find
all their type errors.

I feel similarly about array bounds checking.

Actually, I think a good solution is a compromise, a la Common Lisp and
some polymorphic functional languages.  The programmer may include optional
type declarations, and the compiler should do type propogation.  Where the
compiler can prove type safety, it doesn't need to insert runtime type
checks; when it can't, it puts in the checks.  I've read about similar
schemes for optimizing away most array bounds checks.

--
Barry Margolin, Thinking Machines Corp.

barmar@think.com
{uunet,harvard}!think!barmar

jls@rutabaga.Rational.COM (Jim Showalter) (04/18/91)

%>]>	Some info that I picked up from a company doing a lot of big
%>]>	project work with C++ (i.e. strong type checking language) was
%>]>	that postmortems on their project revealed only 10% of their
%>]>	errors were captured by strong type checking.
%>]...
%>]In any other field of engineering, a 10% increase in safety would always
%>]be applied.  To reject it would be grounds for lawsuits, at least.
%
%>How do you get a 10% increase in safety from the above?  If they
%>hadn't caught those errors by static typing they would have caught
%>them by testing -- the same way the found the other 90% of the errors.
%>This logic error combined with the strident tone of the article
%>suggests that your convictions are affecting your thinking.

Bzzzzzzt. Thank you for playing.

Testing is NOT guaranteed to catch these errors, whereas static typing
is, by its nature, guaranteed to do so. I defy you to come up with a
way to ensure that you have traversed all possible paths through a
dynamically-typed program that could conceivably send the wrong message
to the wrong recipient.

Last time I looked, this sort of coverage-with-range-tracking stuff
was NP-hard, which means for any program of reasonable size you'd have
to test it for more years than there are atoms in the universe to
exhaust all possible ways it could go bump.

Nobody is claiming that there are not ways to go bump in statically-typed
languages, only that the number of ways you can go bump is REDUCED. 
Such risk reduction seems to me to be a fine way to increase program
reliability, which seems like a noble goal. Why anybody would deliberately
choose to code in a language that is guaranteed to permit MORE errors
than another language is beyond me.
--
* The opinions expressed herein are my own, except in the realm of software *
* engineering, in which case I borrowed them from incredibly smart people.  *
*                                                                           *
* Rational: cutting-edge software engineering technology and services.      *

sakkinen@jyu.fi (Markku Sakkinen) (04/18/91)

In article <2023@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:
>In article  <1991Apr15.234559.16293@comp.vuw.ac.nz> Brian Boutel writes:
> ...
>]The claim that testing catches all errors is laughable.
>
>Sure is.  Did someone make that claim?

That is how I too have understood several of your postings.
You have even added "in a matter of minutes".
Glad to hear you don't really think so.

> ...
>]So what can be said about all the errors in the quoted C++ project that
>]were not caught by testing and are still in the production code? 
>]
>]I'll tell you. They are not type errors.
>
>Nope, you can't tell that.  C++ production code can core dump due to a
>type error.  That's because to implement something that really needs
>dynamic typing you often have to use tricks that leave all type
>checking up to the programmer.

C and C++ are a bad choice to represent statically typed languages
in these comparisons.  C descends from some truly _untyped_ languages
and that still shows.  It has only partial type checking at compile
time and none whatsoever at run time.  C++ is not essentially better.

>What can you say about errors that occur in  production code in
>dynamically typed languages?  First, they are not caused by the
>necessity of getting around the static type checker.  Second, they are
>almost never type errors of the sort that could have been caught by a
>static type checker.  Third, if they are type errors they cannot cause
>a core dump or other unpredictable behavior.

However, there are situations in which "Message not understood"
is not essentially better than a core dump.  Typically, embedded
systems running unoperated.

Markku Sakkinen
Department of Computer Science and Information Systems
University of Jyvaskyla (a's with umlauts)
PL 35
SF-40351 Jyvaskyla (umlauts again)
Finland
          SAKKINEN@FINJYU.bitnet (alternative network address)

cs450a03@uc780.umd.edu (04/18/91)

Jim Showalter writes:

>Bzzzzzzt. Thank you for playing.

*sigh* *yawn*

>I defy you to come up with a way to ensure that you have traversed
>all possible paths through a dynamically-typed program that could
>conceivably send the wrong message to the wrong recipient.

Design the program so that there are only a small number of paths in
any given module.  Test each module before using.

One of the advantages of dynamic typing is that it allows you to
represent information as data which, in a statically typed language,
can only be represented as control flow.

Raul Rockwell

gudeman@cs.arizona.edu (David Gudeman) (04/19/91)

In article  <jls.671951286@rutabaga> Jim Showalter writes:
]%
]%>...  If they
]%>hadn't caught those errors by static typing they would have caught
]%>them by testing -- the same way the found the other 90% of the errors.

]Bzzzzzzt. Thank you for playing.

Bzzzzzzt.  Thank you for being obnoxious.  That might have been funny
the first time it was done, but it is getting really old.  Especially
as it is usually done by people who demonstrably know no more than the
person they are replying to.

]Testing is NOT guaranteed to catch these errors, whereas static typing
]is, by its nature, guaranteed to do so.

No, it isn't guaranteed.  But it will catch most of them, and I claim
that the tradeoff (static typing _causes_ some errors that aren't
caught) is in favor of dynamic typing.

]I defy you to come up with a
]way to ensure that you have traversed all possible paths through a
]dynamically-typed program that could conceivably send the wrong message
]to the wrong recipient.

Ooh.  I've been defied.  Well lets see, this should take about 3
seconds...  OK, write a type inferencer.  (Not that I think this is
obviously worth doing.)

]Nobody is claiming that there are not ways to go bump in statically-typed
]languages, only that the number of ways you can go bump is REDUCED. 

People aren't just claiming that, they are assuming it.  I think they
need to question their assumptions.

]... Why anybody would deliberately
]choose to code in a language that is guaranteed to permit MORE errors
]than another language is beyond me.

See what I mean?  Assuming that dynamic typing has more errors.
Completely without evidence, I might add (or repeat actually).
--
					David Gudeman
gudeman@cs.arizona.edu
noao!arizona!gudeman

barmar@think.com (Barry Margolin) (04/19/91)

In article <2400043@otter.hpl.hp.com> sfk@otter.hpl.hp.com (Steve Knight) writes:
>Drew writes:
>> The only experience with dynamically-typed systems I have are Lisp-based
>> systems, just today I crashed one with a type error.  
>If your system crashed, it has a serious bug.  Type errors should be caught at
>run-time and put you into a debugging loop.

Not if you've turned off runtime type checking:

(proclaim '(optimize (speed 3) (safety 0)))

declares that the programmer wants the code to go as fast as possible, and
doesn't want any time wasted on unnecessary type checking ("necessary" type
checking is that which is needed to implement generic functions).

When you make such a declaration, type problems become the programmer's
problem, just like in statically-typed languages.
--
Barry Margolin, Thinking Machines Corp.

barmar@think.com
{uunet,harvard}!think!barmar

bertrand@eiffel.UUCP (Bertrand Meyer) (04/19/91)

From article <3865@ssc-bee.ssc-vax.UUCP>
by dmg@ssc-vax.uucp (David M Geary):

> 1)  Since dynamic typing allows more expressiveness and a greater degree
>     of reusability than a statically typed language, code in a statically 
>     typed language may be 10% (or more) larger than the "same" code in a 
>     dynamically typed language.  So, while some errors have been ironed out
>     before testing, testing may have more to test.

The premise of this sentence (a statically typed language is less
expressive) is, I believe, incorrect. With the appropriate
typing facilities (in particular genericity, constrained and
unconstrained) you lose absolutely nothing in expressiveness.
This has been discussed at length in this newsgroup before,
and in spite of many efforts and even ``challenges''
I don't think anyone has shown a convincing counter-example -
something that could be expressed conveniently in a dynamically
typed language but not in a statically typed language.
(Again, I am assuming a decent type system; if you don't have
genericity, or if arbitrary C-like casts are allowed,
this is all meaningless anyway.)

Mr. Geary is quite correct, however, that a statically typed language
will be slightly more verbose because of the need for declaring
entities (attributes, local variables, function results).
But this should have at most a negligible effect on testing.
Typically, one tests the executable parts of a software system
(the instructions), not the declarations. In fact, the presence
of declarations may be a significant help in eliminating useless
tests and making each individual test more productive.
(The same applies to assertions, but that's another story.)

It is true, of course, that as soon as you introduce more text
(such as declarations) you introduce more possibilities of error.
This is because static typing follows the time-honored method
of introducing redundancy to make it easier to catch errors.
The reason this is a good method is that more
redundancy means more likelihood that a conceptual error will
become an *inconsistency*, and inconsistencies are always
much easier to detect than more general types of errors.
The catch is that you may then detect inconsistencies which
are *not* original errors: for example, they may result from errors
in the elements that have been added for redundancy. (This occurs
for example when a variable is used properly, but its declaration
is ``wrong'' with respect to the programmer's intent.)

This means that when you add redundancy for security there is a risk
of false alarm (detecting an apparent error when the original was in
fact correct). If, as is the case in the practice of reasonable
statically typed languages, such false alarms are rare, the price
is acceptable, being justified by all the *real* alarms that
redundancy will enable you to catch.

As an analogy (offered for what it's worth), in some countries
elementary school students are (were?) taught to check
an arithmetic operation by ``crossing out nines'' (``preuve par 9''),
which is a quick modulo-9 check. Once in a while you will find
an inconsistency which results from an error in the crossing-out,
not in the original operation. However this occasional nuisance
- which merely forces you to redo the operation and the check -
is nothing to complain about when compared to all the
*real* errors that this type of checking catches.
-- 
-- Bertrand Meyer
Interactive Software Engineering Inc., Santa Barbara
bertrand@eiffel.uucp

new@ee.udel.edu (Darren New) (04/23/91)

In article <530@eiffel.UUCP> bertrand@eiffel.UUCP (Bertrand Meyer) writes:
>I don't think anyone has shown a convincing counter-example -
>something that could be expressed conveniently in a dynamically
>typed language but not in a statically typed language.

I think a "resource manager" is a good example.  By this, I mean a module
which accepts <name, object> pairs either to be stored or retrieved. 
Presumedly, the one storing and the one retrieving need to know what 
the type is, but the resource manager need not.  Sort of like an
environment-variable list, except that the right-hand side is not
restricted to single strings but rather can be any type in the
language.             - Darren
-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
     +=+=+ My time is very valuable, but unfortunately only to me +=+=+
+=+ Nails work better than screws, when both are driven with screwdrivers +=+

diamond@jit345.swstokyo.dec.com (Norman Diamond) (04/24/91)

In article <1917@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:
>In article  <1991Apr15.065146.16680@tkou02.enet.dec.com> Norman Diamond writes:
>>In article <4243.2805b94a@iccgcc.decnet.ab.com> klimas@iccgcc.decnet.ab.com writes:
>>>	Some info that I picked up from a company doing a lot of big
>>>	project work with C++ (i.e. strong type checking language) was
>>>	that postmortems on their project revealed only 10% of their
>>>	errors were captured by strong type checking.
>>In any other field of engineering, a 10% increase in safety would always
>>be applied.  To reject it would be grounds for lawsuits, at least.
>How do you get a 10% increase in safety from the above?  If they
>hadn't caught those errors by static typing they would have caught
>them by testing -- the same way the found the other 90% of the errors.

Sorry for the delay -- I have just discovered that topics which I reply to
in comp.object somehow move into comp.lang.misc, sigh.

Anyway, 10% of the caught errors were type errors.
By grossly rough estimate, 10% of the uncaught errors are also type errors.
Strong typing would reduce the number of remaining, uncaught errors by 10%.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.

objtch@extro.ucc.su.oz.au (Peter Goodall) (04/24/91)

dnsurber@lescsse.jsc.nasa.gov (Douglas Surber) writes:

>In <1917@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:

>>If they
>>hadn't caught those errors by static typing they would have caught
>>them by testing -- the same way the found the other 90% of the errors.
>>This logic error combined with the strident tone of the article
>>suggests that your convictions are affecting your thinking.

>The real question is how long would it take them to catch those errors
>by testing?  The type checker catches them all first time every time.
>Admittedly not every error, but reducing the number of errors by 10%
>before you even begin testing sounds like a win to me.  

>Further, the type checker finds type errors even in code that, for one
>reason or another, is never excersized during testing.  Again not every
>error, but a 10% reduction is still something.

This mystical 10% says nothing about the other 90%, or what is responsible
for them. They could be caused by having to rewrite code because the static
typing did not provide enough reusability :-).

--
------------------------------------------------------------------------
Peter Goodall - Smalltalk Systems Consultant - objtch@extro.ucc.su.oz.au
      ObjecTech Pty. Ltd. - Software Tools, Training, and Advice
162 Burns Bay Rd, LANE COVE, NSW, AUSTRALIA. - Telephone: +61 2 418-7433

dlw@odi.com (Dan Weinreb) (04/28/91)

In article <530@eiffel.UUCP> bertrand@eiffel.UUCP (Bertrand Meyer) writes:

   The catch is that you may then detect inconsistencies which
   are *not* original errors: for example, they may result from errors
   in the elements that have been added for redundancy. (This occurs
   for example when a variable is used properly, but its declaration
   is ``wrong'' with respect to the programmer's intent.)

   This means that when you add redundancy for security there is a risk
   of false alarm (detecting an apparent error when the original was in
   fact correct). If, as is the case in the practice of reasonable
   statically typed languages, such false alarms are rare, the price
   is acceptable, being justified by all the *real* alarms that
   redundancy will enable you to catch.

Yes, I agree completely, and here's another analogy that people might
like.  Consider parity checking on computer main memory.  Adding
parity bits allows a consistency check, so that you can easily
recognize when one of the stored bits in memory is incorrect.  The
catch is that perhaps the parity bit itself might have the wrong
value!  In this case, you actually would have been better off without
the parity bit.  However, I think most people can see why, despite
this catch, there's quite a lot to be said for the parity bit, and
most people would rather have parity checking than no checking, and
are willing to pay for it.

This isn't a perfect analogy for all aspects of the issue under
discussion, of course.  I just think it's a good way to explain the
business about the "catch", and the risk of false alarm, and why that
might not be as bad some people might at first think.  Whether the
relative costs for various kinds of software construct are similar
to the relative costs for parity checking is not necessarily obvious,
but the qualitative nature of the tradeoff is similar in this way.

cpp@calmasd.Prime.COM (Chuck Peterson) (05/03/91)

I am glad Mr. Meyer and others admit there is a cost to redundancy.

Consider that many of the languages users prefer, from novice to theoretician,
from Basic to Lisp and Smalltalk, are dynamically typed.  Both because
it simplifies ad hoc experimenting, reduces code size and manifest
complexity, and makes polymorphism natural.

Has a cost of additional error and maintenance of dynamically vs.
statically typed programming really been proven?  No doubt perfectly
unmaintainable code has been written either way, and perhaps some of
the most challenging of smaller products, such as GNU emacs, have been
written with dynamically typed languages.  (Perhaps many formidable large
products as well.)  Even if there is an additional cost,  is it not usually
outweighed by productivity and inventiveness.

Perhaps statically typed languages should be used like assembly language,
where the added security (as with performance for assembly) justifies
recoding.  (Perhaps both security and performance in some measure.)

rockwell@socrates.umd.edu (Raul Rockwell) (05/03/91)

Chuck Peterson:
> Perhaps statically typed languages should be used like assembly
> language, where the added security (as with performance for
> assembly) justifies recoding.  (Perhaps both security and
> performance in some measure.)

Perhaps I'm missing some fundamental insight, but I fail to see how
static typing provides any sort of security.

Consider an arbitrary program, with (for example) several thousand
variables and a few dozen types.  If the program says:  f(x),
what does static typing buy you, security-wise?  You can only
misidentify x a few hundred ways instead of a few thousand?  (Awesome
reliability there).

Unless you are intending to allow only one instance of a type in any
lexical name-space?  That would be security!  [And you'd be writing
your program at least twice, once in "type expressions", and once in
"imperative expressions".  There's nothing quite like trying to keep
two versions of the same thing in sync to improve reliability...]

Performance, you can get, to the extent that your compiler
pre-computes the results of your program.  ("Static typing" means
evaluate "type expressions" at compile time).

Raul Rockwell

diamond@jit533.swstokyo.dec.com (Norman Diamond) (05/09/91)

In article <ROCKWELL.91May2222805@socrates.umd.edu> rockwell@socrates.umd.edu (Raul Rockwell) writes:
>Chuck Peterson:
>> Perhaps statically typed languages should be used like assembly
>> language, where the added security (as with performance for
>> assembly) justifies recoding.  (Perhaps both security and
>> performance in some measure.)
>Perhaps I'm missing some fundamental insight, but I fail to see how
>static typing provides any sort of security.
>Consider an arbitrary program, with (for example) several thousand
>variables and a few dozen types.  If the program says:  f(x),
>what does static typing buy you, security-wise?  You can only
>misidentify x a few hundred ways instead of a few thousand?  (Awesome
>reliability there).

Exactly.  In this scenario, if you misidentify x, there is a 90% chance
that your error will be caught.  This is 9 times as valuable as a case
previously discussed, where other fields of engineering value a 10%
improvement in safety.

>Unless you are intending to allow only one instance of a type in any
>lexical name-space?  That would be security!  [And you'd be writing
>your program at least twice, once in "type expressions", and once in
>"imperative expressions".  There's nothing quite like trying to keep
>two versions of the same thing in sync to improve reliability...]

Although excessive and rhetorical, this also has its value.
Why did businesses used to use "verifiers" on their punched cards for
data entry?  Why do fault-tolerant machines compute many results twice?
You'd better believe there is nothing like redundancy to improve reliability.

When mismatches are detected (which can benefit greatly from mechanical
assistance or compilers), they can be inspected manually to see if they are
really errors, or which part has the errors.  This is far better than skipping
half of them and letting 10% or 90% of the other half remain undetected.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.

rockwell@socrates.umd.edu (Raul Rockwell) (05/09/91)

me: Consider an arbitrary program, with (for example) several thousand
me: variables and a few dozen types.  If the program says: f(x), what
me: does static typing buy you, security-wise?  You can only
me: misidentify x a few hundred ways instead of a few thousand?
me: (Awesome reliability there).

Norman Diamond:
   Exactly.  In this scenario, if you misidentify x, there is a 90%
   chance that your error will be caught.  This is 9 times as valuable
   as a case previously discussed, where other fields of engineering
   value a 10% improvement in safety.

Enumerating the number of ways a variable may be misidentified does
not properly identify the chances that it will be caught.  Run-time
insanity checks can provide a degree of robustness that is not
available using only compile time checks.

Just because your language will not catch certain errors except with
static typing doesn't mean that my language has the same problem.
Also note that dynamic typing allows wonderful properties in a
debugger.

Note that I'm classifying as type checking: array bounds checking,
un-assigned variable checking, range checking, storage class checking
and exceptional values checking.  A compiler can catch some instances
of un-assigned variables, and some instances of storage class
checking.  It's not going to be able to check anything dynamic (except
the static references). A really good compiler might be able to catch
_some_ instances of value out of range.

Ultimately, making the programmer work to make sure that variable Quad
is not assigned to variable Tape is not going to do anything to make
sure that Quad is being used correctly -- it's just going to make sure
that one of many stupid mistakes is not made.

me: Unless you are intending to allow only one instance of a type in
me: any lexical name-space?  That would be security!  [And you'd be
me: writing your program at least twice, once in "type expressions",
me: and once in "imperative expressions".  There's nothing quite like
me: trying to keep two versions of the same thing in sync to improve
me: reliability...]

Norman Diamond:
   Although excessive and rhetorical, this also has its value.  Why
   did businesses used to use "verifiers" on their punched cards for
   data entry?  Why do fault-tolerant machines compute many results
   twice?  You'd better believe there is nothing like redundancy to
   improve reliability.

Why do some machines associate type with a variable at runtime?  Could
it be because there is nothing like redundancy to improve reliability?

(yeah, I'm leaving out an issue here:  additional features which come
with a typing system.  Some such features are gratuitous, some are so
significant that they make reliability arguments pale.)

Raul Rockwell

quale@khan.cs.wisc.edu (Douglas E. Quale) (05/13/91)

In article <1991May9.023313.28308@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>When mismatches are detected (which can benefit greatly from mechanical
>assistance or compilers), they can be inspected manually to see if they are
>really errors, or which part has the errors.  This is far better than skipping
>half of them and letting 10% or 90% of the other half remain undetected.
>--
>Norman Diamond       diamond@tkov50.enet.dec.com
>If this were the company's opinion, I wouldn't be allowed to post it.

This statement suggests that programs compiled in dynamic languages contain
more type errors than similar programs written in statically typed languages.
Unfortunately I don't know of any studies that have been made that could
definitively affirm or deny this.  In fact, I don't know of any evidence at
all that points one way or the other.  I think I understand why static
typing devotees believe that static typing is safer than dynamic typing,
but I think they're mistaken.  In considering the evidence presented by both
sides I show my bias by giving greater weight to those who have actually
used dynamically typed languages.  Simply stated, almost everyone experienced
in writing in dynamic languages also has experience with statically typed
languages.  The reverse is often (usually?) not the case.

Two particulars:

Dynamic typing makes program components easier to test interactively in
isolation; the components get tested again together.  You may say you do
all these tests in your statically typed language even though it's very
inconvenient.  I ask you, does everyone?  In either case I appreciate the
great saving in time and effort that an interpreter + compiler provide.
(Some statically typed languages provide interactive environments.  The
only examples that I'm familiar with are examples of type inferencing
systems much more advanced than Pascal/C/Ada type systems.  Many of these
are not as convenient to use as a dynamically typed system -- see for
instance a thread in comp.lang.functional about SML.)

Static type systems can actually be MUCH LESS SAFE than dynamic type
systems.  I mention this only because perhaps some folks haven't considered
it.  This is often a problem when a primitive static type system attempts
to emulate dynamic typing.  A specific example is provided by the X Toolkit
Intrisics.  To get information from a widget you use a call that looks like
this:

XtVaGetValues(widget, XtNwidth, &width, XtNheight, &height, NULL);

Unfortunately width and height are type XtArgVal which is a union which
completely defeats any typechecking.  The correct type is Dimension, should
I instead use int I get no help from the compiler, and my code has a type
error.  This kind of type error CANNOT occur in a dynamically typed
language.

-- Doug Quale
quale@khan.cs.wisc.edu

ajk@wren.cs.rmit.OZ.AU (Alan Kent) (05/13/91)

quale@khan.cs.wisc.edu (Douglas E. Quale) writes:

>This statement suggests that programs compiled in dynamic languages contain
>more type errors than similar programs written in statically typed languages.
>Unfortunately I don't know of any studies that have been made that could
>definitively affirm or deny this.  In fact, I don't know of any evidence at
>all that points one way or the other.

I also do not know of any studies, and have been a little swayed
towards dynamic typing by the arguments presented in this group.
However I still have some doubts. For example, what calibre are the
programmers who are using these dynamic languages? For large projects
out in industry, it is less likely that you will have a team of
hot-shot programmers. Good programmers are likely to do a good job in
any language. But what about the not so good programmers. Maybe they
need more hand-holding as provided by a static language. Are most of
the people using dynamic languages more research oriented (and so
hopefully better trained)?

The other argument I have for static typing is that it is a form of
very formal documentation. Using types specifies exactly how you expect
a piece of code to be used - exactly what values can be passed to a
function and what can be returned from a function. Sure, a clever
compiler may be able to determine these things. But that is not a form
of documentation for users. Sure, you could just use comments. But from
my limited experience in large projects, its very easy to get lazy and
leave some out. Most software does not just have to get written.
It has to be maintained later, often by someone who did not write
the original program.

My main complaint with static typing is that there are not many
languages that do a good job of it. C is pretty bad. Pascal is too
inflexible.  Eiffel, while it may have problems, is certainly a step
better. It seems to me it is not the concept of static typing which is
the problem, but rather that not many statically typed languages do a
very good job.

Oh well, that is my 2 cents worth, if its worth even that! :-)

Dr Alan Kent
Comp Sci, RMIT, Melbourne, Australia
ajk@goanna.cs.rmit.oz.au

diamond@jit533.swstokyo.dec.com (Norman Diamond) (05/13/91)

In article <1991May13.061520.11992@daffy.cs.wisc.edu> quale@khan.cs.wisc.edu (Douglas E. Quale) writes:
>In article <1991May9.023313.28308@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>>When mismatches are detected (which can benefit greatly from mechanical
>>assistance or compilers), they can be inspected manually to see if they are
>>really errors, or which part has the errors.  This is far better than skipping
>>half of them and letting 10% or 90% of the other half remain undetected.
>
>This statement suggests that programs compiled in dynamic languages contain
>more type errors than similar programs written in statically typed languages.

Try:  a smaller percentage of the type errors are caught.
To prove this would require translation from an untyped language to a typed
language, taking care to assign correct types but without correcting errors
that are caught at that stage -- then seeing what a compiler would catch.
I also don't know anyone who has done this kind of arcane study.

In general, some studies have shown that the ratio of bugs remaining in a
program (to the size of the program) tends to be proportional to the ratio
of bugs that have been caught and removed.  This relationship is not an
inverse one.  Of course, if the product lives long enough and newly caught
bugs decline in frequency, then one might expect remaining uncaught bugs
to have declined similarly.

I suggested as a rough approximation that each KIND of error shows similar
behavior, except when the compiler prevents certain kinds of errors.

>In considering the evidence presented by both
>sides I show my bias by giving greater weight to those who have actually
>used dynamically typed languages.

In chronological order, APL, Snobol, Unix shell and other command languages,
and Smalltalk.

>Dynamic typing makes program components easier to test interactively in
>isolation; the components get tested again together.  You may say you do
>all these tests in your statically typed language even though it's very
>inconvenient.

I didn't see it as inconvenient.  Now, some people see testing as
inconvenient, but I haven't heard typing asserted as the reason.

>I ask you, does everyone?

This is really an irrelevant question.  It is possible to code untested
programs in every language.  There will never be a language in which it
is impossible to write untested programs.

If a developer WANTS to be incompetent, maybe static vs. dynamic doesn't
matter -- well, maybe it does ... if a programmer wants to be incompetent
and lazy, then dynamic has its advantages.  (PLEASE NOTE:  this is not
intended as a slur against dynamic typing, or against valid arguments
for dynamic typing.  In fact, I also need dynamic typing on occasion.)

>Static type systems can actually be MUCH LESS SAFE than dynamic type systems.

Static typing is not less safe than dynamic typing.  Escapes from static
typing can certainly be unsafe, and incompetent escapes most likely are.

>A specific example is provided by the X Toolkit Intrisics.
>To get information from a widget you use a call that looks like this:
>  XtVaGetValues(widget, XtNwidth, &width, XtNheight, &height, NULL);
>Unfortunately width and height are type XtArgVal which is a union which
>completely defeats any typechecking.  The correct type is Dimension, should
>I instead use int I get no help from the compiler, and my code has a type
>error.  This kind of type error CANNOT occur in a dynamically typed
>language.

It could occur in a dynamically typed language, if the programmer can tell
the language to throw away tagging and type-checking.  A competent programmer
would put a tag in the union type.  (C is not an encouraging language for
this kind of practice, but it still could be done.)
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.
Permission is granted to feel this signature, but not to look at it.

quale@saavik.cs.wisc.edu (Douglas E. Quale) (05/15/91)

In article <1991May13.082152.29294@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>In article <1991May13.061520.11992@daffy.cs.wisc.edu> quale@khan.cs.wisc.edu (Douglas E. Quale) writes:
>
>>A specific example is provided by the X Toolkit Intrisics.
>>To get information from a widget you use a call that looks like this:
>>  XtVaGetValues(widget, XtNwidth, &width, XtNheight, &height, NULL);
>>Unfortunately width and height are type XtArgVal which is a union which
>>completely defeats any typechecking.  The correct type is Dimension, should
>>I instead use int I get no help from the compiler, and my code has a type
>>error.  This kind of type error CANNOT occur in a dynamically typed
>>language.
>
>It could occur in a dynamically typed language, if the programmer can tell
>the language to throw away tagging and type-checking.  A competent programmer
>would put a tag in the union type.  (C is not an encouraging language for
>this kind of practice, but it still could be done.)
>

In lisp, the most ``common'' dynamically typed language, you cannot throw away
tagging.  You are, however, somewhat at the mercy of the compiler in regards
to type checking unless you explicitly make the checks yourself.

But the fact is, this particular error does NOT occur in a dynamically typed
language.  If you or anyone else reading this is familiar with the X
Toolkit Intrinsics and can see a way to avoid this problem I'd like to hear
about it.  The designers of the Intrinsics found that C's static typing
wasn't adequate for their needs and built an unchecked dynamic type system
on top of it.  Now C's type system is very primitive, but I'm not certain
that any static type system is flexible enough to handle this problem.  The
most promising approach might be a statically typed object oriented language.

-- Doug Quale
quale@saavik.cs.wisc.edu

olson@juliet.ll.mit.edu ( Steve Olson) (05/15/91)

In article <1991May15.141839.20603@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
   In lisp, the most ``common'' dynamically typed language, you cannot throw away
   tagging.

Careful, there is no single thing called "lisp".  (Its more like a state
of mind :-) This is exactly what an optimizing Common Lisp compiler will do.
Spectacular errors (worthy of C :-) will result if you lie when you make
your type declarations.

--
-- Steve Olson
-- MIT Lincoln Laboratory
-- olson@juliet.ll.mit.edu
--

diamond@jit533.swstokyo.dec.com (Norman Diamond) (05/16/91)

In article <1991May15.141839.20603@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>In article <1991May13.082152.29294@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>>In article <1991May13.061520.11992@daffy.cs.wisc.edu> quale@khan.cs.wisc.edu (Douglas E. Quale) writes:
>>>  XtVaGetValues(widget, XtNwidth, &width, XtNheight, &height, NULL);
>>>Unfortunately width and height are type XtArgVal which is a union which
>>>completely defeats any typechecking.
>>>This kind of type error CANNOT occur in a dynamically typed language.
>>
>>It could occur in a dynamically typed language, if the programmer can tell
>>the language to throw away tagging and type-checking.  A competent programmer
>>would put a tag in the union type.
>
>In lisp, the most ``common'' dynamically typed language, you cannot throw away
>tagging.

I did say "if".

>But the fact is, this particular error does NOT occur in a dynamically typed
>language.

It could be made to occur.  (Not that I would recommend it, of course.)

>If you or anyone else reading this is familiar with the X Toolkit Intrinsics
>and can see a way to avoid this problem I'd like to hear about it.

As I said, C is not an encouraging language for it, but a competent programmer
could put a tag in.  (C allows a programmer to support dynamic typing, though
C itself does not support it, and in fact cumbersome syntax discourages it.)

typedef enum XtArgTag {
        XtArgCaddr,
        XtArgCharPtr,
        XtArgLong,
        XtArgIntPtr,
        XtArgProcPtr,
} XtArgTag;
typedef union XtArgUnion {
        caddr_t     Caddr;
        char       *CharPtr;
        long        Long;
        int        *IntPtr;
        proc       *ProcPtr;
} XtArgUnion;
typedef struct XtArgVal {
        XtArgTag    Tag;
        XtArgUnion  Val;
} XtArgVal;

(I have tried to maintain the same style as the typename "XtArgVal".
My personal naming style conflicts with a Posix standard.)

>The designers of the Intrinsics found that C's static typing wasn't adequate

Of course.

>and built an unchecked dynamic type system on top of it.

That's their problem.  And then they made life difficult for their clients.

>Now C's type system is very primitive, but I'm not certain
>that any static type system is flexible enough to handle this problem.

Pascal handled this problem -- and the syntax deliberately encouraged this
kind of checking.

>most promising approach might be a statically typed object oriented language.

Of course, object orientation is useful, but object orientation is really
orthogonal (or nearly so) to issues of static vs. dynamic typing.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.
Permission is granted to feel this signature, but not to look at it.

quale@saavik.cs.wisc.edu (Douglas E. Quale) (05/16/91)

One of the difficulties that proponents of static typing have in convincing
folks who enjoy the benefits of dynamic typing to switch is that the type
systems of the most popular statically typed languages are very primitive
and inflexible.  Compare C/Pascal/Ada to Common Lisp, for instance.
The claimed benefit of extra safety simply doesn't exist with most C
compilers.  ANSI C is a big improvement in this regard but backward
compatibility is a real hindrance to type safety.

Another issue is that _very_ often I don't care what type a value is.  It
may be the result of a library call and then simply be fed into another call.
For instance, I recently spent more than a couple of minutes figuring out
what was wrong with a C program with this fragment:

{
	...
	double f, foo;
	int i;

	...

	f = modf(foo, &i);
	...
	switch (i) { ... }
}

This is a type error that the $%#!!% compiler didn't catch (Yes, I did
include math.h).  All that was done with i was to use it in a switch
statement, and no runtime errors occured, it just gave the wrong answers
(i was always 0).  It wasn't immediately obvious that the modf call was to
blame, but once I got that far checking the man page I found that modf is
double modf(double, *double).  Changing i to double fixed the problem.
(This error occured because a certain useless pocket reference to the C
libraries said i should be int.  I'm glad I didn't pay anything for it,
or I'd be even more pissed.)

Ada would have caught this type error, but among other things I don't have
an Ada compiler.  (Even if I did, I'd have to learn Ada to be able to get
much use from it. :)

The point is, I don't care what numeric type i is.  In a dynamically typed
language this type error would not occur.  In dynamically typed languages
a lot of things that can be type errors in static languages simply *cannot*
cause errors.  I think much of the belief in the extra safety of statically
typed languages is based on experience with C, where uncaught type errors
are a common and serious problem.  C is a terrible example that gives all
of the disadvantages of static typing with very little safety in return.

-- Doug Quale
quale@saavik.cs.wisc.edu

quale@picard.cs.wisc.edu (Douglas E. Quale) (05/16/91)

In article <1991May16.011804.21042@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>
>typedef enum XtArgTag {
>        XtArgCaddr,
>        XtArgCharPtr,
>        XtArgLong,
>        XtArgIntPtr,
>        XtArgProcPtr,
>} XtArgTag;
>typedef union XtArgUnion {
>        caddr_t     Caddr;
>        char       *CharPtr;
>        long        Long;
>        int        *IntPtr;
>        proc       *ProcPtr;
>} XtArgUnion;
>typedef struct XtArgVal {
>        XtArgTag    Tag;
>        XtArgUnion  Val;
>} XtArgVal;
>

I think there are reasons why the X Toolkit Intrinsics doesn't do this.
One of the design goals was binary compatibility with past versions.

Notice that when you are forced to simulate dynamic typing in a statically
typed language you can't do it as well as a dynamically typed language can.
The explicit tags are bulky and slow compared to the representation tricks
used by lisp compilers.  Also the syntax is abominable, as you mentioned.

To boot, when (or in my case more likely if) I write my own widget and
introduce a new arg type I have to use Caddr, and type safety is shot to hell.
Dynamically typed languages are safer than primitive statically typed
languages such as C.

-- Doug Quale
quale@picard.cs.wisc.edu

quale@picard.cs.wisc.edu (Douglas E. Quale) (05/16/91)

In article <OLSON.91May15145347@goneril.juliet.ll.mit.edu> olson@juliet.ll.mit.edu ( Steve Olson) writes:
>In article <1991May15.141839.20603@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>   In lisp, the most ``common'' dynamically typed language, you cannot throw away
>   tagging.
>
>Careful, there is no single thing called "lisp".  (Its more like a state
>of mind :-) This is exactly what an optimizing Common Lisp compiler will do.
>Spectacular errors (worthy of C :-) will result if you lie when you make
>your type declarations.
>

Yes, this can happen but in general Common Lisp doesn't get rid of the tags
(because it can't).  Suppose foo takes a single integer argument.  If foo
is visible to the world the CL compiler can't find every invocation at
compile time and so the call (foo 0) will require 0 to be tagged.  It is
possible to give foo two entry points, one for regular boxed data calls and
another for unboxed data, but in general the tag will be there.  Usually
optimization tells the compiler not to look at the tags.  This can cause
spectacular failures.

You can shoot yourself in the foot in practically any language, but a
language like C encourages unportable, unsafe practices.  In Common Lisp
it's more work to write unsafe code than safe code.

-- Doug Quale
quale@picard.cs.wisc.edu

cok@islsun.Kodak.COM (David Cok) (05/17/91)

In article <1991May16.134106.24421@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>For instance, I recently spent more than a couple of minutes figuring out
>what was wrong with a C program with this fragment:
>
>{
>	...
>	double f, foo;
>	int i;
>
>	...
>
>	f = modf(foo, &i);
>	...
>	switch (i) { ... }
>}
>
>This is a type error that the $%#!!% compiler didn't catch (Yes, I did
>include math.h).  All that was done with i was to use it in a switch
>statement, and no runtime errors occured, it just gave the wrong answers
>(i was always 0).  It wasn't immediately obvious that the modf call was to
>blame, but once I got that far checking the man page I found that modf is
>double modf(double, *double).  Changing i to double fixed the problem.
>(This error occured because a certain useless pocket reference to the C
>libraries said i should be int.  I'm glad I didn't pay anything for it,
>or I'd be even more pissed.)

It is well(?) known that C does NO type checking on arguments to subroutines,
even with appropriate include files (prototypes weren't invented yet).  But
if you are using C you really ought to use lint to add to the rudimentary
type checking in the compiler.  That still leaves lint+C with both an
inadequate type system and inadequate type checking, but in some
environments we have to live with it.  It would have caught this error.

>
>The point is, I don't care what numeric type i is.  In a dynamically typed
>language this type error would not occur.  

It seems to me it still would:  if modf returned for you an i which was a double
and the switch statement knew only how to handle int's, you would get a
runtime error because of mismatched types.  (If you allow for smarter switch
statements or implicit conversion, you've changed the comparison to one of
flexibility of the type system rather than static vs. dynamic typing.)

> In dynamically typed languages
>a lot of things that can be type errors in static languages simply *cannot*
>cause errors.  
I believe that most dynamically typed languages have more flexible type
systems than most statically typed systems, but I need convincing on the 
point above.  Separate the issue of the type system from dynamic vs. static.
Suppose you had two languages with the same type system (e.g. same set of
functions and operators accepting the same sets of types of arguments), but
one was statically typed and the other dynamically typed.  The dynamically
typed language would catch all type errors at runtime, presumably with good
diagnostics about just what variables and types and functions were involved.
The statically typed system would catch some type errors at compile time.  
The ones it did not catch would cause difficult to find errors -- at least
they do in traditional C.  The same set of errors would occur.  So I do not
understand why "a lot of things that can be type errors in static languages 
simply *cannot* cause errors [in a dynamically typed language]".

The issue of uncaught type errors is actually because the statically typed
language is insufficiently *strong*, not because it is static.  There are
three variables that get confused in these discussions:
		expressiveness of the type system
		dynamic vs. static
		strong vs. weak

>I think much of the belief in the extra safety of statically
>typed languages is based on experience with C, where uncaught type errors
>are a common and serious problem.  C is a terrible example that gives all
>of the disadvantages of static typing with very little safety in return.

C is a prime example of static weak typing (weak in the sense that not all type
errors are caught at either run or compile time); I see no reason why any
current compiler should not be strong for an ANSI-C type language, barring
language allowed loopholes (like pointer casting).  I see no reason why anyone
would develop a dynamic weak language.  Newer static languages (e.g. C++) are
strong with loopholes ("segmentation fault" with termination being as bad for
the user as "message not understood" followed by termination).
Furthermore, if you count domain errors (such as divide by zero) as type errors,
no language is completely statically typed; it may still be strong if that error
is caught and reported at run-time; out-of-range errors are sometimes uncaught 
(e.g. C), and sometimes are caught at run-time (always or switchable at 
compile time).   So we are left with
		strong dynamic languages
		strong mostly static languages

With the *same type system* these will catch the *same* errors -- and I'd
rather catch as many at compile-time as possible.  BUT,  it seems to me that
it is much easier to provide a more flexible type system in a dynamically
typed environment; languages which have appeared to date bear this out (the
presence of type loopholes in at least some static languages is a tacit
admission that the type system is not flexible enough).  And
flexibility in typing is important for reuse of code and hence for quick
prototyping.  The challenge is to provide a flexible type system in which the 
compiler can do sufficient type analysis so that (a) as many type errors are 
reported by the compiler as is feasible (b) as many type tags are optimized 
away as is possible and (c) the rest of the potential errors are dynamically 
checked.  This would give a strong expressive language with a mixture of 
static and dynamic typing.  Maybe the compiler would even give switches -t0 .. 
-t9 trading off speed of compilation vs. amount of static/dynamic checking.

David R. Cok
Eastman Kodak Company
cok@Kodak.COM

olson@juliet.ll.mit.edu ( Steve Olson) (05/17/91)

In article <1991May16.134106.24421@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
   The point is, I don't care what numeric type i is.  In a dynamically typed
   language this type error would not occur.  In dynamically typed languages
   a lot of things that can be type errors in static languages simply *cannot*
   cause errors.  I think much of the belief in the extra safety of statically
   typed languages is based on experience with C, where uncaught type errors
   are a common and serious problem.  C is a terrible example that gives all
   of the disadvantages of static typing with very little safety in return.

   -- Doug Quale
   quale@saavik.cs.wisc.edu


I agree with that, but as was pointed out to me previously, when we think we 
are debating static vs. dynamic typing, we sometimes are actually debating
abstract types vs. machine types.  I think that a lot of type errors come when
types that are conceptually very similar to a human are not at all the same to
the machine.  

I make an order of magnitude more type errors in Common Lisp when I insert 
type declarations for optimizing purposes.  All of a sudden the difference 
between int and float and whatever becomes very important, and it *is* hard 
to keep it all straight without help from the compiler.

--
-- Steve Olson
-- MIT Lincoln Laboratory
-- olson@juliet.ll.mit.edu
--

diamond@jit533.swstokyo.dec.com (Norman Diamond) (05/17/91)

In article <1991May16.153308.4054@spool.cs.wisc.edu> quale@picard.cs.wisc.edu (Douglas E. Quale) writes:
>In article <1991May16.011804.21042@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>>[ugly C syntax to do programmer-computed dynamic typing of some X11 values]
>I think there are reasons why the X Toolkit Intrinsics doesn't do this.
>One of the design goals was binary compatibility with past versions.

Uh, if you want binary compatibility between existing C code and a
dynamically-typed language, you'll have a lot of problems.  When we
were discussing whether static typing eliminates all possibility of
tagging and type-checking, I thought we were talking about programs
that might be written by competent programmers, not about binary
compatibility with random nonsense.

>Notice that when you are forced to simulate dynamic typing in a statically
>typed language you can't do it as well as a dynamically typed language can.

Of course.  I did say that when dynamic typing is necessary, the programmer
should be able to tell the compiler to do it; and when efficiency or strict
early static checking are necessary, the programmer should be able to tell
the compiler to do that too.  And that C is not an encouraging basis for
programmer simulation of either of these.

>The explicit tags are bulky and slow compared to the representation tricks
>used by lisp compilers.  Also the syntax is abominable, as you mentioned.

Yes.  And as I said, the Pascal language supported exactly this kind of tag,
and nothing would stop a Pascal compiler from using representation tricks.
(Although if the programmer for some reason demands ord() of the tag, then
there will have to be code to convert from the tricky representation.)

>Dynamically typed languages are safer than primitive statically typed
>languages such as C.

No.  Dynamically typed languages are safer than C.  Statically typed
languages are also safer than C.  Languages that permit the programmer to
specify each kind of typing when necessary will be safer than all of the above.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.
Permission is granted to feel this signature, but not to look at it.

quale@saavik.cs.wisc.edu (Douglas E. Quale) (05/17/91)

In article <1991May16.175340.25522@kodak.kodak.com> cok@islsun.Kodak.COM (David Cok) writes:
>In article <1991May16.134106.24421@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>>For instance, I recently spent more than a couple of minutes figuring out
>>what was wrong with a C program with this fragment:
>>
>>{
>>	...
>>	double f, foo;
>>	int i;
>>
>>	...
>>
>>	f = modf(foo, &i);
>>	...
>>	switch (i) { ... }
>>}
>>
>>The point is, I don't care what numeric type i is.  In a dynamically typed
>>language this type error would not occur.  
>
>It seems to me it still would:  if modf returned for you an i which was a double
>and the switch statement knew only how to handle int's, you would get a
>runtime error because of mismatched types.  (If you allow for smarter switch
>statements or implicit conversion, you've changed the comparison to one of
>flexibility of the type system rather than static vs. dynamic typing.)
>

Yes.  I'd get an error _and_an_error_message_ the first time I tried the code.
C gives me no error message, just the wrong answer.  IMHO the lisp behavior
is infinitely better in this regard.  Are you actually defending silently
giving the wrong answer to "Error in switch: expected an int but got a double"
??????  This bug gets fixed in a dynamic language in the first pass, utterly
trivially.  What makes you so certain that the programmer would even know
that the C version was bugged?  No error messages are given.  What if the
program just happens to give the expected answers on the test cases despite
the bug?  I'd consider this an overwhelming victory for the dynamically typed
language.  Guarranteed to find the bug, trivial to fix.  C, good luck.

>> In dynamically typed languages
>>a lot of things that can be type errors in static languages simply *cannot*
>>cause errors.  
>I believe that most dynamically typed languages have more flexible type
>systems than most statically typed systems, but I need convincing on the 
>point above.
>....  The same set of errors would occur.  So I do not
>understand why "a lot of things that can be type errors in static languages 
>simply *cannot* cause errors [in a dynamically typed language]".
>

This point is very simple, but perhaps not obvious.  For an example I'll use
the X Toolkit again.

	int width;
	XtVaGetValues(widget, XtNwidth, &width, NULL);

will compile fine and run without giving any error diagnostics.  (For those
not familiar with the X Toolkit, XtVaGetValues is a procedure that takes a
widget object and a varargs list of name, value pairs and retrieves those
properties from the widget.  In order to handle properties of many different
types, the values are type XtPointer which is (usually) caddr_t.  This is a
black hole as far as type checking goes.)  The problem is that the correct
type of an XtNwidth property is Dimension, (which is often) not int.  This
is a type error and C doesn't catch it.  In lisp there is no type error.
The type is latent, stored with width.  If the X Toolkit decided that XtNwidth
should be a bignum there would be *no* changes required to the lisp code.
In the unlikely case that other lisp code makes the erroneous assumption that
width is an int the error will be detected and I'll get an error message.

>one was statically typed and the other dynamically typed.  The dynamically
>typed language would catch all type errors at runtime, presumably with good
>diagnostics about just what variables and types and functions were involved.
>The statically typed system would catch some type errors at compile time.  
>The ones it did not catch would cause difficult to find errors -- at least
>they do in traditional C.  The same set of errors would occur.  So I do not
>understand why "a lot of things that can be type errors in static languages 
>simply *cannot* cause errors [in a dynamically typed language]".
>

See above.  The same set of errors do not occur.  And remember, many difficult
to find errors simply aren't found and the product goes out the door.

-- Doug Quale
quale@saavik.cs.wisc.edu

P.S.  The C compiler won't even give an error diagnostic if you omit the NULL
at the end of of the varargs argument list.  You do get the very handy
"Segmentation fault -- Core dumped" diagnostic when you try to run it.
A language with more flexible and robust argument passing is a lot nicer.

quale@saavik.cs.wisc.edu (Douglas E. Quale) (05/17/91)

In article <1991May17.011209.29486@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>In article <1991May16.153308.4054@spool.cs.wisc.edu> quale@picard.cs.wisc.edu (Douglas E. Quale) writes:
>>In article <1991May16.011804.21042@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>>>[ugly C syntax to do programmer-computed dynamic typing of some X11 values]
>>I think there are reasons why the X Toolkit Intrinsics doesn't do this.
>>One of the design goals was binary compatibility with past versions.
>
>Uh, if you want binary compatibility between existing C code and a
>dynamically-typed language, you'll have a lot of problems.  When we

No, I mean binary compatibility between different C versions of the Toolkit.
Adding new types messes up the tags.  In C the type union approach is awkward
as the X Toolkit resources have a lot of different types.  Unfortunately I
don't have a complete list here, but it includes int, String *, WidgetList,
Function, Cardinal, AcceleratorTable, Pixel, Pixmap, Colormap, Boolean, Screen,
TranslationTable, Dimension, Callback, Position, Visual, Atom, Widget,
InitialState, Window, CallbackList, Cursor, FontStruct*, Justify, ShapeStyle,
Orientation, float, BackingStore, TextPosition, TextResizeMode, TextScrollMode,
TextSelectType*, EditType, AsciiType, and WrapMode.  My apologies to anyone
I've inadvertantly overlooked, and I hope I didn't repeat any types.  These
are only the types of the resources for the Athena widgets, the Motif widget
resources have a *lot* more types, and it must be possible for any widget
writer to add still more.

>
>>Dynamically typed languages are safer than primitive statically typed
>>languages such as C.
>
>No.  Dynamically typed languages are safer than C.  Statically typed
      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

I believe that's what I said.

-- Doug Quale
quale@saavik.cs.wisc.edu

Primitive statically typed languages are languages that practically force the
programmer to use an unchecked simulation of dynamic typing.  For a start
they generally lack polymorphism, and frequently have little support for
generics at all.  Sadly, a vast majority of the programs written in static
languages are in languages that have very primitive type systems.

cok@islsun.Kodak.COM (David Cok) (05/18/91)

In article <1991May17.045347.26722@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>In article <1991May16.175340.25522@kodak.kodak.com> cok@islsun.Kodak.COM (David Cok) writes:
>>In article <1991May16.134106.24421@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>>>For instance, I recently spent more than a couple of minutes figuring out
>>>what was wrong with a C program with this fragment:
>>>
>>>{
>>>	...
>>>	double f, foo;
>>>	int i;
>>>
>>>	...
>>>
>>>	f = modf(foo, &i);
>>>	...
>>>	switch (i) { ... }
>>>}
>>>
>>>The point is, I don't care what numeric type i is.  In a dynamically typed
>>>language this type error would not occur.  
>>
>>It seems to me it still would:  if modf returned for you an i which was a double
>>and the switch statement knew only how to handle int's, you would get a
>>runtime error because of mismatched types.  (If you allow for smarter switch
>>statements or implicit conversion, you've changed the comparison to one of
>>flexibility of the type system rather than static vs. dynamic typing.)
>>
>
>Yes.  I'd get an error _and_an_error_message_ the first time I tried the code.
>C gives me no error message, just the wrong answer.  IMHO the lisp behavior
>is infinitely better in this regard.  Are you actually defending silently
>giving the wrong answer to "Error in switch: expected an int but got a double"
>??????  This bug gets fixed in a dynamic language in the first pass, utterly
>trivially.  What makes you so certain that the programmer would even know
>that the C version was bugged?  No error messages are given.  What if the
>program just happens to give the expected answers on the test cases despite
>the bug?  I'd consider this an overwhelming victory for the dynamically typed
>language.  Guarranteed to find the bug, trivial to fix.  C, good luck.
>
No I am absolutely not defending the idea of  "silently
giving the wrong answer to "Error in switch: expected an int but got a double"'
Read a little more carefully.  C is a bad comparison, as I point out, because
it has so many holes in its type checking.  I was asking for a comparison 
between a dynamically typed language and a strongly statically typed language.
They both would show the error (one a run-time and one at compile-time).  I'm
not advocating one or the other here (and certainly not advocating C) --
just trying to understand the benefits and drawbacks of different systems.

>>> In dynamically typed languages
>>>a lot of things that can be type errors in static languages simply *cannot*
>>>cause errors.  
>>I believe that most dynamically typed languages have more flexible type
>>systems than most statically typed systems, but I need convincing on the 
>>point above.
>>....  The same set of errors would occur.  So I do not
>>understand why "a lot of things that can be type errors in static languages 
>>simply *cannot* cause errors [in a dynamically typed language]".
>>
>
>This point is very simple, but perhaps not obvious.  For an example I'll use
>the X Toolkit again.
>
>	int width;
>	XtVaGetValues(widget, XtNwidth, &width, NULL);
>
>will compile fine and run without giving any error diagnostics.  (For those
>not familiar with the X Toolkit, XtVaGetValues is a procedure that takes a
>widget object and a varargs list of name, value pairs and retrieves those
>properties from the widget.  In order to handle properties of many different
>types, the values are type XtPointer which is (usually) caddr_t.  This is a
>black hole as far as type checking goes.)  The problem is that the correct
>type of an XtNwidth property is Dimension, (which is often) not int.  This
>is a type error and C doesn't catch it.  In lisp there is no type error.
>The type is latent, stored with width.  If the X Toolkit decided that XtNwidth
>should be a bignum there would be *no* changes required to the lisp code.
>In the unlikely case that other lisp code makes the erroneous assumption that
>width is an int the error will be detected and I'll get an error message.
>

Again your example does not fill the bill -- you pick one where the type
checking is turned off, i.e. a weakly typed system.  If you want to argue
that no current statically typed language is flexible enough to do the
right things with variable length argument lists  -- you do not have to 
convince me.  But don't say that you do not get type errors in dynamically
typed languages -- you do; what you don't get are uncaught type errors -- that's
a plus -- at run-time -- that's a minus.

>>one was statically typed and the other dynamically typed.  The dynamically
>>typed language would catch all type errors at runtime, presumably with good
>>diagnostics about just what variables and types and functions were involved.
>>The statically typed system would catch some type errors at compile time.  
>>The ones it did not catch would cause difficult to find errors -- at least
>>they do in traditional C.  The same set of errors would occur.  So I do not
>>understand why "a lot of things that can be type errors in static languages 
>>simply *cannot* cause errors [in a dynamically typed language]".
>>
>
>See above.  The same set of errors do not occur.  And remember, many difficult
>to find errors simply aren't found and the product goes out the door.

The same set of errors do not occur in *C* -- no argument on the fact that C's
type checking is abysmal and type errors go out the door.

>
>-- Doug Quale
>quale@saavik.cs.wisc.edu
>
>P.S.  The C compiler won't even give an error diagnostic if you omit the NULL
>at the end of of the varargs argument list.  You do get the very handy
>"Segmentation fault -- Core dumped" diagnostic when you try to run it.
>A language with more flexible and robust argument passing is a lot nicer.

Agree with the last sentence -- just wondering how to make that language as
statically typed as possible without overly restricting flexibility.

David R. Cok
Eastman Kodak Company
cok@Kodak.COM

quale@khan.cs.wisc.edu (Douglas E. Quale) (05/18/91)

In article <1991May17.221539.22844@kodak.kodak.com> cok@islsun.Kodak.COM (David Cok) writes:
>In article <1991May17.045347.26722@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>>>> In dynamically typed languages
>>>>a lot of things that can be type errors in static languages simply *cannot*
>>>>cause errors.  
>>>....  The same set of errors would occur.  So I do not
>>>understand why "a lot of things that can be type errors in static languages 
>>>simply *cannot* cause errors [in a dynamically typed language]".
>>>
>>
>>This point is very simple, but perhaps not obvious.  For an example I'll use
>>the X Toolkit again.
>>
>>	int width;
>>	XtVaGetValues(widget, XtNwidth, &width, NULL);
>
>Again your example does not fill the bill -- you pick one where the type
>checking is turned off, i.e. a weakly typed system.  If you want to argue
>that no current statically typed language is flexible enough to do the
>right things with variable length argument lists  -- you do not have to 
>convince me.  But don't say that you do not get type errors in dynamically
>typed languages -- you do; what you don't get are uncaught type errors -- that's

I guess I should have gone a little farther with my example.
I have two widgets, widget1 and widget2.  I want to set widget2's width
to be the same as the width of widget1.

{
	int width;
	XtVaGetValues(widget1, XtNwidth, &width, NULL);
	XtVaSetValues(widget2, XtNwidth, width, NULL);
}

This code is bugged because I used the wrong type for width.  But I really,
really, really don't care what type width is.  In lisp,

	(let ((width (xtgetvalue widget1 xtnwidth)))
	  (xtsetvalue widget2 xtnwidth width))

has no type errors, and can't have any.  Not uncaught type errors,
no type errors, period.

-- Doug Quale
quale@saavik.cs.wisc.edu

diamond@jit533.swstokyo.dec.com (Norman Diamond) (05/20/91)

In article <1991May17.051840.26916@daffy.cs.wisc.edu> quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
>In article <1991May17.011209.29486@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>>In article <1991May16.153308.4054@spool.cs.wisc.edu> quale@picard.cs.wisc.edu (Douglas E. Quale) writes:
>>>Dynamically typed languages are safer than primitive statically typed
>>>languages such as C.
>>
>>No.  Dynamically typed languages are safer than C.  Statically typed
>      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>I believe that's what I said.

It isn't.  Your previous statement confused statically typed languages
with C.  Your example only proved that dynamically typed languages are
safer than one particular collection of C programs.  You didn't even
prove that dynamically typed languages are safer than C in general,
but I made this generalization intuitively.

Now, statically typed languages are also safer than C.  Your same
example (the X-window system), though it does not prove this statement,
remains a valid example.

You play word games in order to create the assertion that dynamically
typed languages are safer than statically typed languages.  If you have
a shred of evidence to support this assertion, it is better to show
your evidence than to play word games.
--
Norman Diamond       diamond@tkov50.enet.dec.com
If this were the company's opinion, I wouldn't be allowed to post it.
Permission is granted to feel this signature, but not to look at it.

chisnall@cosc.canterbury.ac.nz (The Technicolour Throw-up) (05/20/91)

From article <1991May13.061520.11992@daffy.cs.wisc.edu>, by quale@khan.cs.wisc.edu (Douglas E. Quale):
[...]
> This statement suggests that programs compiled in dynamic languages contain
> more type errors than similar programs written in statically typed languages.
> Unfortunately I don't know of any studies that have been made that could
> definitively affirm or deny this.  In fact, I don't know of any evidence at

You could try looking at:

"An experimental evaulation of data type conventions" by J.D. Gannon,
in Communications of the ACM 20 (8), august 1977, pp 584 - 595

which contains the results of a comparison between statically typed and
untyped languages.  Apparently the author had previously looked at the
question of static vs. dynamic typing (but, unfortunately, published it
in some obscure volume that we don't have here :-(. In the preamble to
the above mentioned paper he briefly summarises some of the results of
this prior comparison:

    "... This evidence indicated that the use of dynamically typed
    operands resulted in errors that remain in programs longer than
    the errors attributable to statically typed operands."

The above paper concludes, unsurprisingly, that static is better than
untyped.

I'm not too sure, however, whether his results are applicable to all
languages.  His experiments were performed with variations on algol-like
languages where the variations were in the type system.  I suspect that
some of his results may say just as much about his choice of the underlying
language as about the type systems imposed upon them.

I think that a distinction that it may be useful to draw here is between
declarative languages (e.g. prolog,ML,hope,miranda,smalltalk) which have
a highly modular structure, and non-declarative languages (such as pascal,
fortran,cobol,ada) which tend, imho, to give rise to rather long-winded
programs.  Gannon's results can be seen as a demonstration, then, that
static typing is definitely better for nd languages (which is his under-
lying model).

> Dynamic typing makes program components easier to test interactively in
> isolation; the components get tested again together.  You may say you do

I don't see this at all.  What has typing got to do with testing program
components.  It seems to me that far more important than the type system
is whether the language is declarative (i.e highly modular) or not.
Imagine a version of pascal with dynamic typing - is it easy to test
program components?  Contrast this with declarative languages with static
typing (e.g. ML,hope,miranda,..) - easy to test components? You bet.

> -- Doug Quale
> quale@khan.cs.wisc.edu

--
 "Merely corrobarative detail, intended to give artistic verisimilitude
 to an otherwise bald and unconvincing narrative"    -- W.S. Gilbert 

Name: Michael Chisnall		email: chisnall@cosc.canterbury.ac.nz

rockwell@socrates.umd.edu (Raul Rockwell) (05/20/91)

Douglas E. Quale:
   > This statement suggests that programs compiled in dynamic
   > languages contain more type errors than similar programs written
   > in statically typed languages.  ...

Michael Chisnall:
   [ see previous article for CACM reference and background ]

   I think that a distinction that it may be useful to draw here is
   between declarative languages (e.g.  prolog, ML, hope, miranda,
   smalltalk) which have a highly modular structure, and
   non-declarative languages (such as pascal, fortran, cobol, ada)
   which tend, imho, to give rise to rather long-winded programs.
   Gannon's results can be seen as a demonstration, then, that static
   typing is definitely better for nd languages (which is his under-
   lying model).

Hmm... yes.  The way I remember this group of threads started off with
the assertion that dynamically typed languages were less error prone
than statically typed languages because (paraphrasing slightly)
dynamically typed languages are more declarative in nature.

Maybe the emphasis should have been on declaratively typed languages
vs. non-declaratively typed languages :-)?

Douglas E. Quale:
   > Dynamic typing makes program components easier to test
   > interactively in isolation...


Michael Chisnall:
   I don't see this at all.  What has typing got to do with testing
   program components.  It seems to me that far more important than
   the type system is whether the language is declarative (i.e highly
   modular) or not.  ...

Am I the only one interested in pointing out that an infinite loop is
a type error?  Consider a recent bug I encountered:  a 'continue' inside
an 'if' statement which should have been outside -- without the
continue the code would occasionally fall through to a different state
and would never terminate.  Yeah, that was in C, but testing is the
only real good way to find statements out of order.

Raul Rockwell

sakkinen@jyu.fi (Markku Sakkinen) (05/21/91)

In article <ROCKWELL.91May20081117@socrates.umd.edu> rockwell@socrates.umd.edu (Raul Rockwell) writes:
> ...
>Michael Chisnall:
>   [ see previous article for CACM reference and background ]
>
>   I think that a distinction that it may be useful to draw here is
>   between declarative languages (e.g.  prolog, ML, hope, miranda,
>   smalltalk) which have a highly modular structure, and
>   non-declarative languages (such as pascal, fortran, cobol, ada)
>   ...

BTW, Smalltalk declarative?!
Prolog has a modular structure?!

>Hmm... yes.  The way I remember this group of threads started off with
>the assertion that dynamically typed languages were less error prone
>than statically typed languages because (paraphrasing slightly)
>dynamically typed languages are more declarative in nature.
>
>Maybe the emphasis should have been on declaratively typed languages
>vs. non-declaratively typed languages :-)?

"Started off" at some point of time?  Hasn't it been going on
since time immemorial? :-)  Anyway, it looks like a silly statement
(has anybody really made it?) that having to _declare_ the types
of variables would make a language _less_ declarative.

> ...
>Am I the only one interested in pointing out that an infinite loop is
>a type error?

So it seems:  you have probably insisted on that some 10 times
during the last few months without getting many supporters.
Most people evidently feel (like myself) that lumping too much
into the concept 'type error' would make it too general to be
interestingly different from 'error'.

>  Consider a recent bug I encountered:  a 'continue' inside
>an 'if' statement which should have been outside -- without the
>continue the code would occasionally fall through to a different state
>and would never terminate.  Yeah, that was in C, but testing is the
>only real good way to find statements out of order.

What is the connection of this programming mistake with _type_ errors?
And how does it confirm your opinion that "testing is the only real
good way"?

Markku Sakkinen
Department of Computer Science and Information Systems
University of Jyvaskyla (a's with umlauts)
PL 35
SF-40351 Jyvaskyla (umlauts again)
Finland
          SAKKINEN@FINJYU.bitnet (alternative network address)

strom@watson.ibm.com (Rob Strom) (05/21/91)

In article <1991May17.051840.26916@daffy.cs.wisc.edu>, quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
|> In article <1991May17.011209.29486@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
|> >In article <1991May16.153308.4054@spool.cs.wisc.edu> quale@picard.cs.wisc.edu (Douglas E. Quale) writes:
|> >
|> >>Dynamically typed languages are safer than primitive statically typed
|> >>languages such as C.
|> >
|> >No.  Dynamically typed languages are safer than C.  Statically typed
|>       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|> 
|> I believe that's what I said.
|> 
|> 
|> Primitive statically typed languages are languages that practically force the
|> programmer to use an unchecked simulation of dynamic typing.  For a start
|> they generally lack polymorphism, and frequently have little support for
|> generics at all.  Sadly, a vast majority of the programs written in static
|> languages are in languages that have very primitive type systems.

Let me give you the perspective of someone who is designing and implementing
"better" statically typed languages.

If you want to argue that languages (like C) with incomplete static type
checking and no runtime checking are unsafe, you'll get enthusiastic
agreement from me.  

But if you want to argue that therefore there's something intrinsically
wrong with static typing and that we should use dynamic typing instead,
you'll get enthusiastic disagreement from me.

I agree with you that nonsensical operations should never be executed.
I believe that wherever possible nonsensical operations should be
avoided by static analysis, and where this is not possible, they
should be avoided by runtime checks.  

I have previously posted my opinion that most of the time, it is
reasonable to statically declare the types of values a variable
can hold.  For certain kinds of situations it is not.  That is
why our language makes static the usual case, and dynamic the unusual case,
while allowing both.
-- 
Rob Strom, strom@ibm.com, (914) 784-7641
IBM Research, 30 Saw Mill River Road, P.O. Box 704, Yorktown Heights, NY  10958

quale@saavik.cs.wisc.edu (Douglas E. Quale) (05/21/91)

In article <1991May20.005238.19244@tkou02.enet.dec.com> diamond@jit533.enet@tkou02.enet.dec.com (Norman Diamond) writes:
>>>In article <1991May16.153308.4054@spool.cs.wisc.edu> quale@picard.cs.wisc.edu (Douglas E. Quale) writes:
>>>>Dynamically typed languages are safer than primitive statically typed
>>>>languages such as C.
>>>
>>>No.  Dynamically typed languages are safer than C.  Statically typed
>>      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>>I believe that's what I said.
>
>It isn't.  Your previous statement confused statically typed languages
>with C.  Your example only proved that dynamically typed languages are
>safer than one particular collection of C programs.  You didn't even
>prove that dynamically typed languages are safer than C in general,
>but I made this generalization intuitively.
>
>You play word games in order to create the assertion that dynamically
>typed languages are safer than statically typed languages.  If you have
>a shred of evidence to support this assertion, it is better to show
>your evidence than to play word games.
>--

No.  I don't confuse statically typed languages with C.  I may confuse C
with a primitive statically typed language, but I don't think that's a
confusion.

Dynamically typed languages are safer than primitive statically typed
languages such as C, exactly as I said.  I should rather have said languages
with primitive static typing, but my original statement is close enough.

In primitive static typing systems type safety is thrown out the window to
provide genericity.  You can disagree with me, but this is not a word game.

SML has a much better type system than C, but I'm not certain how convenient
it would be for writing X programs.

-- Doug Quale
quale@saavik.cs.wisc.edu

rockwell@socrates.umd.edu (Raul Rockwell) (05/22/91)

Michael Chisnall:
   >   I think that a distinction that it may be useful to draw here
   >   is between declarative languages (e.g.  prolog, ML, hope,
   >   miranda, smalltalk) which have a highly modular structure, and
   >   non-declarative languages (such as pascal, fortran, cobol, ada)

Markku Sakkinen:
   BTW, Smalltalk declarative?!  Prolog has a modular structure?!

heh.  In the smalltalk group you've got languages which can declare
data to be some type.  In the pascal group you've got languages where
you declare variables to be some type and data is typed as a side
effect and you must handle type information with control flow (with
exceptions for language intrinsics).  The above grouping doesn't bear
too close examination however...

Markku Sakkinen:
   it looks like a silly statement (has anybody really made it?) that
   having to _declare_ the types of variables would make a language
   _less_ declarative.

Well, the idea is that you can't declare the type of a piece of data,
and must compensate by using some other mechanism to keep track of
this state information.

Me:
   >Am I the only one interested in pointing out that an infinite loop
   >is a type error?

Markku Sakkinen:
   So it seems: you have probably insisted on that some 10 times
   during the last few months without getting many supporters.  Most
   people evidently feel (like myself) that lumping too much into the
   concept 'type error' would make it too general to be interestingly
   different from 'error'.

Well, in the context of turing machines, the definition of a type
error is a combination of an algorithm and data which causes an
infinite loop.  But you're right, infinite loops are too general for
them all to be called type errors.

   >  Consider a recent bug I encountered:  ...

   What is the connection of this programming mistake with _type_
   errors?  And how does it confirm your opinion that "testing is the
   only real good way"?

Not "only real good way" but still essential.  Essential does not mean
that other techniques are not useful.

About the bug: (1) it appeared as an infinite loop which occurred only
for certain input values.  (2) it is possible to write that particular
algorithm (a string matching problem) without using loops.  (3) in C,
I had to represent some of the information as program state and write
loops, so this was a data representation problem.

I suppose I'm coming across as rather unreal, but I can't help it :-)
I rarely use loops or conditional expressions in my own code because I
rarely need to.  Control flow is just another way to represent data.

Raul Rockwell

quale@saavik.cs.wisc.edu (Douglas E. Quale) (05/22/91)

>|> >In article <1991May16.153308.4054@spool.cs.wisc.edu> quale@picard.cs.wisc.edu (Douglas E. Quale) writes:
>|> >
>|> >>Dynamically typed languages are safer than primitive statically typed
>|> >>languages such as C.
>|> >

I just realized that perhaps my use of the word primitive was taken as a
perjorative aimed at all statically typed languages, rather than only
statically typed languages with particularly poor type systems such as C as
I had intended.

About a year ago, I wrote a short nqueens function in Scheme

(define (nqueens n)
  (letrec ((try-col (lambda (c b)
		      (if (= c 0)
			  b
			  (try-row n c b))))
	   (try-row (lambda (r c b)
		      (cond ((= r 0) '())
			    ((safe? r b) (append (try-col (- c 1) (cons r b))
						 (try-row (- r 1) c b)))
			    (else (try-row (- r 1) c b)))))
	   (safe? (lambda (r b)
		    (let loop ((up (+ r 1))
			       (dn (- r 1))
			       (bd b))
			   (or (null? bd)
			       (and (not (= (car bd) r))
				    (not (= (car bd) up))
				    (not (= (car bd) dn))
				    (loop (+ up 1) (- dn 1) (cdr bd))))))))
	(try-col n '())))

A simple test,

> (nqueens 4)
(2 4 1 3 3 1 4 2)

and I see I made a type error.  Looking at the source it was trivial to
fix, but it occurred to me that the SML compiler would have rejected the
program immediately.  This shows that some programmers (namely me) can make
elementary type errors even in very, very simple programs.  Unfortunately
it's so trivial it doesn't show much else.

I looked up the CACM reference given re: type errors in statically vs
dynamically typed languages.  Unfortunately the experiment really dealt
only with an untyped language.  The only untyped languaged that I've used
is assembly language.  Bliss and BCPL are still in use, but in general I
wouldn't think that untyped languages have much to recommend them.  I still
have to treat the assertion that "programs in dynamically typed languages
have more type errors than those in statically typed languages" as having
no objective evidence either way.  (Remember, statically typed languages
give many more chances to make type errors.  Some programs with type errors
in a statically typed language cannot have type errors in a dynamically
typed language.)

Static typing brings up the problem of type equivalence:  should the
language use name equivalence or should it use structural equivalence?
This question got a lot of attention in Pascal, but I don't know that the C
community has given it thought.  C uses structural equivalence which can be
less than helpful, especially when a large program typedefs dozens of
different things to int.  As an example from X Windows again, Xlib typedefs
XID as unsigned long.  Window ids are XIDs, so the call

	XDestroyWindow(display, 2 + 2);

is accepted by the compiler even though this is really a type error.
Although this is rather unlikely, pixels in Xlib are also unsigned longs so

	XDestroyWindow(display, pixel);

is another type error that can't be caught.  In order to catch these errors
using structural equivalence XID would have to become a struct, and that is
a somewhat unpleasant arrangement.  In SML we could simply write

datatype Window = Win of int

fun XDestroyWindow (Display dpy) (Win win) = ....

and in fact although Window is a structure, the compiler doesn't need to
tag it so it gives the same efficiency as an int but is much safer.  You
can also greatly increase type safety --

datatype voltage = Volt of real
datatype current = Amp of real
datatype resistance = Ohm of real

fun ohms_law (Amp i) (Ohm r) = (Volt i * r)

(Pattern matching is great -- inside the function ohms_law we can treat i
and r as reals and use ordinary multiplication.  To the outside world they
look like structures.)

This is a big win for sophisticated static typing.  Dynamic typing can do
this, but it's slow and painful.  The types would have to be explicitly
tagged, untagging them before performing arithmetic etc.

Static + dynamic typing in the same language might overcome any problem.
Suppose however, that only static typing is supported.  How does Ada or SML
handle the resource problem in the X Toolkit?  It must be possible for a
widget writer to add arbitrary types to his widgets, and not have to
recompile libXt.a, the X Toolkit library.  The C implementation simply uses
unchecked dynamic typing.  This is much worse than using a language that
supports dynamic typing because no type checks are performed even at run
time.  Is there a better way?

-- Doug Quale
quale@saavik.cs.wisc.edu

doug@netcom.COM (Doug Merritt) (05/23/91)

In article <1991May20.172441.780@csc.canterbury.ac.nz> chisnall@cosc.canterbury.ac.nz writes:
>From article <1991May13.061520.11992@daffy.cs.wisc.edu>, by quale@khan.cs.wisc.edu (Douglas E. Quale):
>[...]
>> This statement suggests that programs compiled in dynamic languages contain
>> more type errors than similar programs written in statically typed languages.
>> Unfortunately I don't know of any studies that have been made that could
>> definitively affirm or deny this.  In fact, I don't know of any evidence at
>
>    "... This evidence indicated that the use of dynamically typed
>    operands resulted in errors that remain in programs longer than
>    the errors attributable to statically typed operands."

This is in fact trivially true. Errors caught at compile time are
guaranteed to be 100% detectable (and removable), whereas errors caught
at run time cannot be detected until just the right conditions occur,
so in arbitrary programs it will take arbitrarily long for all run time
errors to occur and be detected. Or even infinitely long, since proving
otherwise is the halting problem.

Therefore dynamically typed languages are inheritantly less safe than
strongly statically typed languages, because only in the latter can you
know that all type errors have been removed.

This is a minor point, though, because there are situations where dynamic
typing is a powerfully expressive and natural tool, and the desirability of 
such is (in some situations) far more important than merely increasing the
safety of type usage.

The reason why that is true is that if you really need dynamic typing,
but the language doesn't support it, then you have to implement it yourself
anyway (that's what e.g. C unions and Pascal variant records are *for*,
after all), and now you've written new code that may contain errors,
so you haven't really gained anything.

Note that this argument is algorithm-specific, and says nothing in support
of completely dynamically-typed languages over optionally dynamically-typed
languages.

BTW there's also an age-old esthetic criterion that "everything should be
a first class citizen", and it derives from the practical experience of
awkwardness of expression that results for certain applications if some
things are second class citizens. This is a somewhat stronger argument
than simply "dynamic typing"; it argues for dynamic typing of the strongest
sort, that allows type-values to manipulated via a calculus (ideally
via an algebra).

All in all, one really wants dynamic typing allowed, but optimized out
and statically checked where at all possible. This is a challenge for
language design (I'm working on it :-)
	Doug
-- 
Doug Merritt	doug@netcom.com		apple!netcom!doug

strom@watson.ibm.com (Rob Strom) (05/23/91)

In article <1991May22.161814.15196@daffy.cs.wisc.edu>, quale@saavik.cs.wisc.edu (Douglas E. Quale) writes:
 
|> Static + dynamic typing in the same language might overcome any problem.
|> Suppose however, that only static typing is supported.  How does Ada or SML
|> handle the resource problem in the X Toolkit?  It must be possible for a
|> widget writer to add arbitrary types to his widgets, and not have to
|> recompile libXt.a, the X Toolkit library.  The C implementation simply uses
|> unchecked dynamic typing.  This is much worse than using a language that
|> supports dynamic typing because no type checks are performed even at run
|> time.  Is there a better way?
|>
Hermes uses "polymorphs".  You apply the "wrap" operator to a value of any
data type.  The result is a value of type "polymorph", which can be stored
in a variable of type "polymorph".  It can be copied, stored, and passed
around, but no type-specific operations can be applied.  You must then
"unwrap" the polymorph into a variable of type T before applying operations
appropriate to type T.  A runtime check is made that the wrapped polymorph
value actually is of type T.

This allows dynamic typing where needed, while retaining static typing as
the more common situation.  Whether checked statically or dynamically,
type safety (plus "typestate safety" --- e.g., avoiding reading undefined data)
is always assured.
 
-- 
Rob Strom, strom@watson.ibm.com, (914) 784-7641
IBM Research, 30 Saw Mill River Road, P.O. Box 704, Yorktown Heights, NY  10598