[comp.object] A Hard Problem for Static Type Systems

craig@leland.Stanford.EDU (Craig Chambers) (04/20/91)

Here's a simple example that cannot be described by a static type
system in most statically-typed object-oriented languages.  I'm using
it to help me make sure that the static type system in my new OO
language is sufficiently powerful, but you could use it as another
example of a simple, useful program that is handled simply in a
dynamically-typed system that requires a lot of sophistication in a 
statically-typed system.

Consider the general min function (written in a dynamically-typed
C-like language):

    min(x, y) {
    	if (x < y) {
    	    return x;
    	} else {
    	    return y;
    	}
    }

Assume that we have two kinds of numbers in our language, integers and
floats, and that we've defined implementations of "<" for all four
combinations of integer and float arguments.  We define number as the
common supertype of both integers and floats; since we've defined all
possible combinations, "<" is defined over pairs of numbers.

We also have a collection hierarchy.  The "<" message is defined for
all collections of things that themselves understand "<" to do
lexicographic ordering of two collections.

Note that we do NOT have a "<" message that can take a number as one
argument and a collection as the other.

Here's the problem: we'd like to describe the type of the min
function, so that this one piece of source code can be used to compute
the minimum of two numbers or of two collections of numbers or of two
collections of collections of numbers, etc.

So here are some examples that should type-check:

    min(3, 4)
    min(3, 4.5)
    min({3,4}, {4,5,6})
    min({{3,4.5},{5},{6,8.9}}, {{1.2,4},{2}})

And here are some that shouldn't:

    min(3, {4})
    min({3,4}, {{4,6},{2,3.4,6}})

We're only allowed to use static type systems that actually make
static guarantees about no message-not-understood problems; otherwise,
we'd just be using dynamic typing.  This therefore disallows using
Eiffel's (old, currently implemented) type system based on covariant
typing.

I'm not saying that this cannot be done in a static type system; I'm
hoping that it can, in fact.  I will claim (with the hope of being
disproved) that no "practical" existing language can statically
type-check this example.

-- Craig Chambers

guido@cwi.nl (Guido van Rossum) (04/22/91)

craig@leland.Stanford.EDU (Craig Chambers) writes:

>Consider the general min function (written in a dynamically-typed
>C-like language):
>[...]
>Here's the problem: we'd like to describe the type of the min
>function, so that this one piece of source code can be used to compute
>the minimum of two numbers or of two collections of numbers or of two
>collections of collections of numbers, etc.

This can be done in ABC, a statically typed interpreted language
developed at CWI [1].  ABC knows a single type "number" which can hold
a float or arbitrary-precision rational, and "lists" that are
sorted collections of values with the same type; list items may be
lists if their types are the same, etc.; lists are sorted
lexicographically.  If you don't want the items sorted there is a
"table" type that lets you determine the order; tables are really
associative arrays.

The function you describe is written in ABC as follows:

	HOW TO RETURN min(x, y):
	    IF x < y: RETURN x
	    RETURN y

and now you will have min(3.14, 1) = 1, min({1;2;3}, {1;2}) = {1;2}, etc.,
and min(3.14, {}) will not type-check.  [2]

--Guido van Rossum, CWI, Amsterdam <guido@cwi.nl>
Founder of the Royal Society for Prevention of Cruelty to Amoebae

[1]
%T ABC Programmer's Handbook
%A Leo Geurts
%A Lambert Meertens
%A Steven Pemberton
%I Prentice-Hall
%C London
%D 1990
%O ISBN 0-13-000027-2

[2]
ABC is implemented; I typed the example in and here's the session log:

piring& abc

ABC Release 1.02.01.
Copyright (c) Stichting Mathematisch Centrum, Amsterdam, 1989.
Type '?' for help.
>first
>>> HOW TO RETURN min(x, y):
HOW TO RETURN min(x, y):
   IF x < y: RETURN x
   RETURN y

>>> WRITE min(3.14, 1)
1
>>> WRITE min({1; 2; 3}, {1; 2})
{1; 2}
>>> WRITE min(1, {})
*** Cannot reconcile the types in your command
    WRITE min(1, {})
*** The problem is: I found type EG (0, list or table)
where I expected (?, ?)
>>> QUIT
piring&

craig@leland.Stanford.EDU (Craig Chambers) (04/22/91)

In article <3378@charon.cwi.nl> guido@cwi.nl (Guido van Rossum) writes:
>craig@leland.Stanford.EDU (Craig Chambers) writes:
>
>>Consider the general min function (written in a dynamically-typed
>>C-like language):
>>[...]
>>Here's the problem: we'd like to describe the type of the min
>>function, so that this one piece of source code can be used to compute
>>the minimum of two numbers or of two collections of numbers or of two
>>collections of collections of numbers, etc.
>
>This can be done in ABC, a statically typed interpreted language
>developed at CWI [1].  ABC knows a single type "number" which can hold
>a float or arbitrary-precision rational, and "lists" that are
>sorted collections of values with the same type; list items may be
>lists if their types are the same, etc.; lists are sorted
>lexicographically.  If you don't want the items sorted there is a
>"table" type that lets you determine the order; tables are really
>associative arrays.

I may not have made myself clear with the problem.  I want a general
type system that will handle examples like the one I posted.  I am not
interested in languages or type systems that have built-in data
structures that solve this particular problem but do not generalize.
It sounds from your reply that ABC falls into the second category, but
since you didn't describe ABC's type system, I can't tell whether it
is a generally useful solution to a large class of problems.  What's
the type (inferred, I guess) of the arguments to the min function?
How does the compiler know that "<" in min will work?  Does the
compiler type-check each invocation of min separately (which would be
sidestepping the issue) or does it really infer a type for min that's
usable by all its callers?

Part of the description of the problem implies to me that the type
system should include some notion of subtyping and some notion of
parameterized types.  My (partially formed) solution also uses type
variables and some notion of a type pattern (I'm not sure whether or
not this is the same as a parameterized type; I'm leaning towards yes
if I can overcome some annoying technical obstacles).

If someone posts that some type system solves the problem, please also
post enough information to give people a feel for the facilities in
the type system that allow it to solve the problem.  Thanks.

-- Craig Chambers

dl@g.g.oswego.edu (Doug Lea) (04/22/91)

[Craig Chambers suggested I post this reply I sent him about the `min'
example for others to comment on. Here goes (in slightly edited form).]

Good example. Please bear with me while I meander toward one answer.

At first glance, you'd like to type min as something like

    fn min (x: TotalOrder, y: TotalOrder) -> TotalOrder

where

    type PartialOrder =
        op (x: PartialOrder) <= (y: PartialOrder) -> bool
        op (x: PartialOrder) == (y: PartialOrder) -> bool
    axioms
        forall x (x <= x);
        forall x, y ((x <= y & y <= x) => x == y);
        forall x, y, z ((x <= y | y <= z) => x <= z);

    type TotalOrder : subtype of PartialOrder =
        op (x: TotalOrder) <= (y: TotalOrder) -> bool
        op (x: TotalOrder) == (y: TotalOrder) -> bool
    axioms
        forall x, y (x <= y | y <= x);

followed by things like

    type Number : subtype of TotalOrder = ...
    type Float : subtype of Number = ...
    type Int : subtype of Number = ...

    type NumberCollection: subtype of TotalOrder = ...
    type NumberList : subtype of NumberCollection = ...

  Notes:
    * Most people would rename TotalOrder as `Comparable' or somesuch,
        and use it via `mixin-MI'. 
    * I'm using op<= rather than op< throughout just to make easier
        contact with well-known theory.
    * The `axioms' have little to do with all of this except to clarify
        the meanings of the relations involved. I mainly just wrote them 
        down for completeness.

But all of this is wrong. As you mention,

    * It allows things like 
        min(3, {3, 4})
      as well as 
         3 <= {3, 4}
      to typecheck.  

    * The signatures for TotalOrder ops (and other unstated ops) break 
        contravariance.

The problem lies in how PartialOrder/TotalOrder are defined and used.
In fact, declaring the signatures of op<= as I did doesn't even match
standard interpretations of orderings, since you don't want to say
that any op<= can necessarily accept two DIFFERENT TotalOrder subtype
objects, just two from the set under consideration.

So instead, you'd like to say that any type T is ordered if it obeys
the Partial (Total) ordering PROPERTY rather than is descended from a
PartialOrder supertype. This can be stated in a parametric
polymorphic (`generic') style via something like

    property PartialOrder [T] =
        op (x: T) <= (y: T) -> bool
        op (x: T) == (y: T) -> bool
    axioms ...

    property TotalOrder[T] : subproperty of PartialOrder [T] = ...

    type Number : obeys TotalOrder = ...
    type NumberCollection : obeys TotalOrder = ...

Here, the type Number obeys TotalOrder, even when extended to
subtypes, since you'd like to allow all possible Number subtypes to
compare, presumably via contravariance-maintaining multimethods. If
this were not so, then Int, say, might be claimed to obey TotalOrder,
but not Number. In other words, properties necessarily inherit.

The type NumberCollection should also obey TotalOrder, but the
union type Number-or-NumberCollection does not, and isn't claimed
to (unlike, implicitly, in the orginal declarations).

So it's not the case that Number ISA TotalOrder, instead Number OBEYS
TotalOrder, which means that
    fn min (x: TotalOrder, y: TotalOrder) -> TotalOrder
is senseless, since ordering is a property, not a type.
Instead, you need to say something like
    fn min (x: T obeys TotalOrder, y: T) -> T'
    {
       if (x <= y) return x; else return y;
    }
where everything typechecks if x and y are both of some (perhaps
distantly removed) supertype that obeys TotalOrder (i.e., a T'),
i.e., T' is the `meet' type of ExactTypeOf(x) and ExactTypeOf(y) 
obeying TotalOrder.

This is a form of MIXED universal and subtype polymorphism.

... which seems to solve the original problem. 

OBJ3/FOOPS has some constructs along these lines. But they are mired
in other things that make it hard to see how to nicely apply to a more
normal OO type system or language. It's worth serious exploration to
find ways that make these things easy for programmers to state and
use.

In fact, some of this is doable even in C++, assuming PT (Templates,
as in ch. 14 of the Ellis & Stroustrup C++ ARM). One way would be to

    1. Make class Number and subclasses, that implement (via double
        dispatching) op<= (Number&). (Along with workarounds for
        C++ contravariance-blindness, etc.)

    2. Similarly for NumberCollection, etc.

    3. Make min a template function
        template <class T> T& min(T& x, T& y) { ... }

What you miss is 

    * Code sharing of different versions of min. But this is a 
        compiler-smarts issue. Templates don't HAVE to be macro-expanded.

    * Correct dispatching and type matching. Without properties or
        whatever to guide it, and because argument matching is 
        done statically, C++ won't necessarily invoke the expected
        version of min. For example, 
            Int a; Float b; min(a, b)
        should invoke the min(Number, Number) version, which is not
        guaranteed by the rules in the ARM. 

-Doug

--
Doug Lea  dl@g.oswego.edu || dl@cat.syr.edu || (315)341-2688 || (315)443-1060
|| Computer Science Department, SUNY Oswego, Oswego, NY 13126 
|| Software Engineering Lab, NY CASE Center, Syracuse Univ., Syracuse NY 13244

rmf@cs.columbia.edu (Robert M. Fuhrer) (04/22/91)

Doug Lea (dl@g.oswego.edu) writes:
> At first glance, you'd like to type min as something like
> 
>     fn min (x: TotalOrder, y: TotalOrder) -> TotalOrder
> [...]
> But all of this is wrong.
> [...]
> In fact, declaring the signatures of op<= as I did doesn't even match
> standard interpretations of orderings, since you don't want to say
> that any op<= can necessarily accept two DIFFERENT TotalOrder subtype
> objects, just two from the set under consideration.
> [...]
> So instead, you'd like to say that any type T is ordered if it obeys
> the Partial (Total) ordering PROPERTY rather than is descended from a
> PartialOrder supertype. This can be stated in a parametric
> polymorphic (`generic') style via something like

Let me say that while I agree with the bulk of this posting, I think two
separate things are going on. First, there is the issue that single
inheritance makes harder certain kinds of sharing (like the "property" sharing
mentioned above). [It's not clear to me whether the sharing of important
object characteristics is made impossible or not -- I haven't thought about it
much, comments are welcome!.]

The second is that while we like the idea of the type hierarchy, we run into a
problem with enforcing constraints among the actual parameter types. In Mr.
Lea's near-solution of the example problem, a simple path to an acceptable
solution would be to enforce that the 2 parameters have the *same* type (or
compatible types, perhaps in the "property" sense).

So, my question is, given that the type hierarchy can be properly factored
such that the desired properties are represented by some class, does
existential quantification (well, bounded since we have a class structure)
(ala Cardelli & Wegner, ACM Surveys, 1985) solve the remaining problem?

  I.e., does a construct such as

     exists(T < TotalOrder) such that fn min(x: T, y: T) => T

  do the trick?

If so, is this an example of the "impractical" mechanism Mr. Chambers wants to
avoid? If not, what else is going on?
--

--------------------------
Robert M. Fuhrer
Computer Science Department
Columbia University
1117B Fairchild Building
Internet: rmf@cs.columbia.edu
UUCP:     ...!rutgers!cs.columbia.edu!rmf

sakkinen@jyu.fi (Markku Sakkinen) (04/22/91)

In article <1991Apr20.010347.28984@leland.Stanford.EDU> craig@self.stanford.edu writes:
>
>Here's a simple example that cannot be described by a static type
>system in most statically-typed object-oriented languages.  I'm using
>it to help me make sure that the static type system in my new OO
>language is sufficiently powerful, but you could use it as another
>example of a simple, useful program that is handled simply in a
                                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^
>dynamically-typed system that requires a lot of sophistication in a 
 ^^^^^^^^^^^^^^^^^^^^^^^^
>statically-typed system.
> ...
>Assume that we have two kinds of numbers in our language, integers and
>floats, and that we've defined implementations of "<" for all four
>combinations of integer and float arguments.  We define number as the
>common supertype of both integers and floats; since we've defined all
>possible combinations, "<" is defined over pairs of numbers.
>
>We also have a collection hierarchy.  The "<" message is defined for
>all collections of things that themselves understand "<" to do
>lexicographic ordering of two collections.
>
>Note that we do NOT have a "<" message that can take a number as one
>argument and a collection as the other.
> ...

It appears to me that the given starting point for this problem
(although somewhat fuzzily defined) itself requires additional
work in a purely dynamically-typed system,
but is simple in a statically-typed system with the appropriate
features, i.e. first-class set types.  (I don't know about SETL
except that it's built mainly upon set handling; is it
statically typed?)

It seems that you require homogeneous sets, i.e. sets of numbers,
sets of sets of numbers, etc.  In a statically-typed language that
really supports sets of any order, you can get that homogeneity
automatically with the correct type definition.  If you then try
to add a NUMBER to a SET OF SET OF NUMBER you get a compile-time error.
In a dynamically-typed language, you have to program yourself
the run-time tests to check:

(1) when you try to add a new element to a non-empty set,
    that it is of the same "degree" as the previous elements

(2) when applying the '<' operator to two objects,
    that they are of the same "degree"

On the other hand, I don't know if any current statically-typed
language allows a convenient single recursive definition of '<'
for all such set types (some functional language perhaps?).
In a dynamically-typed object-oriented language, it would obviously
suffice to define a class StratifiedOrderedCollection,
which would have the "degree" as one instance variable.

P.S. The word 'degree' is in quotes above because I am uncertain
about the established term.  Is it 'order' (what an overloaded
word: no wonder that the misnomer 'sorting' is so commonly used
for 'ordering')?

Markku Sakkinen
Department of Computer Science and Information Systems
University of Jyvaskyla (a's with umlauts)
PL 35
SF-40351 Jyvaskyla (umlauts again)
Finland
          SAKKINEN@FINJYU.bitnet (alternative network address)

guido@cwi.nl (Guido van Rossum) (04/22/91)

>>	[me]
>	[craig]

craig@leland.Stanford.EDU (Craig Chambers) writes:

>In article <3378@charon.cwi.nl> guido@cwi.nl (Guido van Rossum) writes:
>>This can be done in ABC, a statically typed interpreted language
>>developed at CWI [1].  ABC knows a single type "number" which can hold
>>a float or arbitrary-precision rational, and "lists" that are
>>sorted collections of values with the same type; list items may be
>>lists if their types are the same, etc.; lists are sorted
>>lexicographically.  If you don't want the items sorted there is a
>>"table" type that lets you determine the order; tables are really
>>associative arrays.
>
>I may not have made myself clear with the problem.  I want a general
>type system that will handle examples like the one I posted.  I am not
>interested in languages or type systems that have built-in data
>structures that solve this particular problem but do not generalize.
>It sounds from your reply that ABC falls into the second category, but
>since you didn't describe ABC's type system, I can't tell whether it
>is a generally useful solution to a large class of problems.

I still don't quite understand your gripe, so here's an explanation of
ABC's type system.  It solves much more than the particular problem
you stated, and I believe that the type system could be extended to
handle cases currently not supported by the language.  Look for
yourself:

- basic types: numbers and strings
- type constructors:
	- tuple containing <type1>, <type2>, ...
	- list of <type1>
	- table with keys of <type1> and values of <type2>
- There are no pointers, nor function variables
- There are no unions
- There is no way to explicitly introduce a new type
  (there are infinitely many types possible through the type
  constructors though)
- Functions may be polymorphic
- The type inference system is "bootstrapped" with knowledge about
  the built-in operators; e.g., "+" requires two number arguments and
  returns a number, "^" operates on strings and returns a string, "#"
  (the length function) operates on strings, lists and tables, etc.

>What's the type (inferred, I guess) of the arguments to the min function?

It is (<alpha>, <alpha>), where <alpha> stands for any type but the two
occurrences must match.  This leaves the actual type unrestricted but
says that the two arguments must have the same type.

>How does the compiler know that "<" in min will work?

From the definition of min it concludes that the arguments must be
comparable to each other.  The definition of the built-in "<" operator
says that two types are comparable to each other if they are the same
type.  [As I said, ABC has no compiler since is interpreted, but
that's beside the point -- there is a static type checker that must
accept a program before it is ever passed to the interpreter.]

>Does the
>compiler type-check each invocation of min separately (which would be
>sidestepping the issue) or does it really infer a type for min that's
>usable by all its callers?

The latter.  Min's type would be "function with (<alpha>, <alpha>)
argument returning <alpha>".  ABC's type checker uses a unification
algorithm to combine the various things it learns about variables and
functions.

>Part of the description of the problem implies to me that the type
>system should include some notion of subtyping and some notion of
>parameterized types.  My (partially formed) solution also uses type
>variables and some notion of a type pattern (I'm not sure whether or
>not this is the same as a parameterized type; I'm leaning towards yes
>if I can overcome some annoying technical obstacles).

>If someone posts that some type system solves the problem, please also
>post enough information to give people a feel for the facilities in
>the type system that allow it to solve the problem.  Thanks.

I am not a type theorist, just a former member of the ABC
implementation group, so it may be difficult for me to explain ABC's
type system to you.  Hope this has helped.  If not, I'll ask its
designer to take over...

--Guido van Rossum, CWI, Amsterdam <guido@cwi.nl>
"If this is Bolton, I shall return to the pet shop"

craig@leland.Stanford.EDU (Craig Chambers) (04/23/91)

In article <3392@charon.cwi.nl>, guido@cwi.nl (Guido van Rossum) writes:
|> craig@leland.Stanford.EDU (Craig Chambers) writes:
|> >How does the compiler know that "<" in min will work?
|> 
|> From the definition of min it concludes that the arguments must be
|> comparable to each other.  The definition of the built-in "<" operator
|> says that two types are comparable to each other if they are the same
|> type.

The "<" message was supposed to be a user-defined message, implemented
differently by different objects/classes.  But since ABC has no
user-defined data types, this whole example doesn't make sense, and
ABC can't express the general problem I'm trying to type-check.

-- Craig Chambers

boehm@parc.xerox.com (Hans Boehm) (04/23/91)

Aside from syntactic issues, this doesn't seem very hard in languages like
Russell, Quest, or Poly.  I either need a built-in type number, or I need
to explicitly define something that is the tagged union of integers and floats.
(The tag may be a method suite.) I then define min to take a type (or algebra,
or your favorite term...) with a "<" operation and its two "real" arguments.
(When I apply min, the type argument can be inferred.)  This forces a static
check that the two argument types are the same.

This still involves at least some dynamic method selection.  But that seems
inherent in the problem.

Whether you consider these languages to be practical is a matter of taste.
They have all been used to build nontrivial programs.

Hans

euaabt@eua.ericsson.se (Anders.Bjornerstedt) (04/23/91)

I suspect that you could express this, or something very close to it, in 
the language CLU. The problem is I dont have the relevant references 
accessible, I am short of time, I am lazy, etc etc. So why do I write
this ? Well to encourage any person out there knowlegable in CLU to try!

--------------------------------------------
Anders Bjornerstedt
Software Development Environments
ELLEMTEL
Box 1505
S-125 25  Alvsjo
SWEDEN

Tel: +46-8-727 40 67
Fax: +46-8-647 82 76

E-mail: Anders.Bjornerstedt@eua.ericsson.se

new@ee.udel.edu (Darren New) (04/24/91)

In article <3378@charon.cwi.nl> guido@cwi.nl (Guido van Rossum) writes:
>This can be done in ABC, a statically typed interpreted language
>developed at CWI [1].  

From your example, it looks like ABC is dynamically typed (altho strongly
typed) to me.  Why do you say it is statically typed?  What are the types
of "x" and "y" in your declaration of "min"?         -- Darren

-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
     +=+=+ My time is very valuable, but unfortunately only to me +=+=+
+=+ Nails work better than screws, when both are driven with screwdrivers +=+

craig@elaine35.Stanford.EDU (Craig Chambers) (04/24/91)

In article <1991Apr23.152110.6500@eua.ericsson.se> euaabt@eua.ericsson.se (Anders.Bjornerstedt) writes:
>I suspect that you could express this, or something very close to it, in 
>the language CLU. The problem is I dont have the relevant references 
>accessible, I am short of time, I am lazy, etc etc. So why do I write
>this ? Well to encourage any person out there knowlegable in CLU to try!

I used to work with the CLU people and have written a number of CLU
programs while a student at MIT.  CLU's where clauses provide some of
the solution (they act like the type patterns I alluded to in an
earlier message), but CLU has no subtyping, so it can't handle
comparing subtypes of number.

-- Craig Chambers

new@ee.udel.edu (Darren New) (04/24/91)

I think it is important to remember the differences between dynamic
typing and dynamic binding.  Many of the arguments in this thread get
confused because the distinction is not made clear and some systems
have one without the other while other systems intertwine both.  For
example, Hermies has dynamic typing (via polymorph) without dynamic
binding (in the normal OO sense).  C++ has some limited dynamic binding
without dynamic typing.  Smalltalk intermixes the dynamic typing and
the dynamic binding semantically, muddying the waters.

Personally, I think both are necessary, but for different tasks.
-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
     +=+=+ My time is very valuable, but unfortunately only to me +=+=+
+=+ Nails work better than screws, when both are driven with screwdrivers +=+

duchier@cs.yale.edu (Denys Duchier) (04/24/91)

Haskell has the notion of classes, and below is the code taken
verbatim from the implementation of the Prelude. `instance (Ord a) =>
Ord [a] where ...' basically means, if `a' is a type of class Ord,
then `list of a' is also a type of class Ord, and the following
operations are defined on it...'.  Implementing this functionality
typically requires passing a dictionnary (as an additional argument)
that specifies the operations defined on type `a' as a member of class
Ord (something like this; I'm a little fuzzy on the exact details).

module PreludeListInst where

import PreludeRealCore

instance (Eq a) => Eq [a] where
  [] == [] = True
  (a:b) == (c:d) = a == c && b == d
  _ == _ = False

instance (Ord a) => Ord [a] where
  [] <= _ = True
  _ <= [] = False
  (a:b) <= (c:d) = a <= c || a == c && b <= d
  
  _ < [] = False
  [] < _ = True
  (a:b) < (c:d) = a < c || a == c && b < d

See "Report on the programming Language Haskell, A non-strict, Purely
Functional Language" (YALEU/DCS/RR-777).

Does this answer your question, or did I misunderstand the point you
were trying to make?

--Denys

craig@elaine35.Stanford.EDU (Craig Chambers) (04/24/91)

In article <DUCHIER.91Apr23205041@albania.cs.yale.edu> duchier-denys@cs.yale.edu writes:
>Haskell has the notion of classes, and below is the code taken
>verbatim from the implementation of the Prelude. `instance (Ord a) =>
>Ord [a] where ...' basically means, if `a' is a type of class Ord,
>then `list of a' is also a type of class Ord, and the following
>operations are defined on it...'.

Several people have pointed at Haskell as a language including a type
system that solves the problem (a lot of people have sent e-mail to me
directly; I was actually hoping to spur group discussion rather than
request information).  And Haskell's system might be a reasonable
answer.  I've read the Haskell report, but since I've never programmed
in Haskell, I can't really tell.  I wonder e.g. whether the type
variables in the Ord class allow subtypes or whether all "instances"
of a type variable must be the same type.  I also wonder whether type
classes are a compile-time macro expansion type of thing for
overloading or whether they support run-time dynamic dispatching like
a full-fledged OO language.  I'd appreciate answers from people
more knowledgable about Haskell than me (I?).

In any case, I'll take this opportunity to post my thoughts on a type
system that attempts to solve the problem.

The type of the arguments to the min function must be things that
understand "<".  The straightforward (and broken) approach would be to
define a supertype called Ordered that is the type of these arguments:

type Ordered
	Ordered "<" Ordered: bool
	...
end Ordered

min(x:Ordered, y:Ordered):Ordered {
	return x < y ? x : y;
}

Numbers and collections of Ordered things are then made subtypes of
Ordered:

type Number <= Ordered
	Number "<" Number: bool
	...
end Number

type Integer <= Number
	Integer "<" Integer: bool { ... }
	Integer "<" Float: bool { ... }
	...
end Integer

type Float <= Number
	Float "<" Integer: bool { ... }
	Float "<" Float: bool { ... }
	...
end Float

type Collection[S]
	...
end Collection

type Collection[T <= Ordered] <= Ordered
	Collection[T] "<" Collection[T]: bool { ... }
	...
end Collection

Unfortunately, this doesn't work since it allows calls of the form
	min(3, {3,4})
which are not implemented anywhere.  In other words, Number and
Collection are not legal subtypes of Ordered since they violate
contravariance.

What we need is some "type pattern" which describes types that support
"<" on their elements, or more precisely, a pattern of two types that
can be compared (the type of the first argument to < doesn't have to
be the same as the second type).  I'm thinking that parameterized
types are such pattern types.  I'm looking at solutions like the
following:

type Ordered[T]
	T "<" T: bool
	...
end Ordered

type Number <= Ordered[Number]
	...
end Number

...

type Collection[T <= Ordered[T]] <= Ordered[Collection[T]]
	...
end Collection

min(x:Ordered[T], y:Ordered[T]):T {
	...
}

Now Number is a legal subtype of the Ordered[Number] instance of the
Ordered parameterized type (check the signatures), and similarly for
the Collection type (although this case begins to look a little
hairy).  The interface to min states that both arguments should be
subtypes of some common instance of the Ordered parameterized type,
and that the result of min is (a subtype of) this instantiating type.

However, I don't completely understand this type system.  Are type
patterns really the same as parameterized types?  Sometimes I get
confused whether I should say Ordered[T] somewhere or just T.  For
instance, I wonder whether the type of y in the min function above
could just as easily be T.  How would such a change affect what kinds
of objects can be passed to min?  I'm also trying to get this type
system to work out in a multiply-dispatched OO language that I've been
working on, and that's making the problem (for me) even harder.

-- Craig Chambers

pwd@cl.cam.ac.uk (Peter Dickman) (04/24/91)

In article <1991Apr23.152110.6500@eua.ericsson.se> 
euaabt@eua.ericsson.se (Anders.Bjornerstedt) writes:
>I suspect that you could express this, or something very close to it, in 
>the language CLU.

You mean like this? (Please excuse the style... :-)
Please read carefully - including the notes below, before telling me that I 
haven't solved the problem.

min = PROC [T : TYPE] (x, y : T) RETURNS (T) 
      WHERE T HAS lt : PROCTYPE (T,T) RETURNS (bool)

  IF (x < y) 
  THEN 
    RETURN (x) 
  ELSE 
    RETURN (y) 
  ENDIF
END min

Note that both arguments must be the same type; but int & real aren't the same 
so you'd have to explicitly convert the int into a real first (*). Also, < is 
just syntactic sugar for an invocation of the lt function, so (x < y) is read 
by the compiler as type_of(x)$lt(x,y), in other words I could have put 
T$lt(x,y) instead of x < y. CLU doesn't distinguish between built-in & 
user-defined types - it treats them all the same. If the instantiating 
type for T doesn't support a lt operation there'll be a compile-time error.

Calls would then be things like:

   a : int  := min[int](5,6)
   b : list := min[list](c,d)   % where list is a user-defined cluster 
                                % (type) supporting an lt operation and 
				% c & d are of type list

   e : real := min[real](real$i2r(6), 2.0)    % convert an int to a real first

(the type declaration would, obviously, be omitted if the variables were 
previously declared).

Many other languages can be used in EXACTLY the same way. If it has constrained
genericity you can do this. All of the CLU family (CLU, CCLU, Argus, Troy etc)
for example. Sadly Eiffel has inheritance & unconstrained genericity but the 
constrained genericity was left out (I don't like Bertrand Meyer's 
justification for this I'm afraid - but accept that he had his reasons). 
Doesn't Ada have some form of generics too? 

Note that CLU is strongly typed. There is the possibility of over-riding the
static typing by use of the ANY type, however the only way you can apply an
operation to an ANY is by FORCEing it to a type (otherwise all you can do
is assign the ANY value to a variable of type ANY). And the only type that an
object can be forced to is the one it started as (otherwise there's a run-time
exception). Note also that because of the signal mechanism (and the fact that 
the CCLU compiler (& maybe others) warns you if you fail to catch all possible 
signals) you have no-one to blame but yourself if there is a run-time type 
failure which can't be handled by your code. The FORCE operation is defined as:

    FORCE = PROC [T: TYPE] (x : ANY) RETURNS (T) SIGNALS (wrong_type)

This solves the resource-manager problem that's being discussed in the 
"Run-time Type Errors in Smalltalk" thread.

----------------

(*) By the way - int and real AREN'T the same. I certainly don't want a 
compiler 'helping' me when I use 2 instead of 2.0 (or, worse still, 
'number_readings : int' instead of 'latest_reading : real'). 
It helps to keep the lid on bugs to be explicit about what you are doing - and 
helps the compiler to optimise things too.

Because of this lack of sub-typing the problem as originally posed cannot be
answered in CLU. Finding the min of a real and an int would be faulted.
What's the problem with an explicit coercion though? Is it merely a matter of
taste? Or down to the "minimal thought vs maximal confidence" religious war?

Those who believe this problem should be solvable as originally posed, please
read & reflect on the following:

Assume that A is of type M, B is of type N and both M & N are subtypes of
type P, which defines a < operation; it doesn't necessarily make sense to
assume automatically that A & B can be compared with <, since either M or
N may have redefined <. In particular, if <P is the P < operator and <M
is a reimplemented < operator for M. We could have A,C of type M and 
B of type N with A <P B, B <P C and C <M A; but, one hopes, A <P C in 
order to maintain transitivity of <P. So how would this hoped-for 
super-min function cope? Any one invocation might be reasonable but
a mutually inconsistent set of invocations could easily result.

The only way I can see to handle this is to say that the reimplemented <M 
must be consistent with the original <P function - I don't believe anyone 
knows how to ensure this sort of thing, let alone has actually built 
it into a real language. Besides, in this case such a constraint renders
<M redundant (unless it has side-effects :-).

-- Peter

boehm@parc.xerox.com (Hans Boehm) (04/25/91)

It seems to me that this problem fundamentally has very little to do with object
oriented programming.  Expressing things in terms of a subtype hierarchy
confuses the issue and contributes nothing.  In Russell, the min function
has signature:

func [x,y: val T; T: type { < }] val T

It's a function of two values of type T, and of the type T itself.  T is
expected to include a < operation.  (I get to cheat slightly, in that
operations like < have the "right" type by default.)  The third (type)
parameter will be inferred for each call, and is not explicitly needed,
except in the declaration of min.

It would be nice if I could annotate "<" with some properties (e.g. the fact
that it should be transitive).  Languages like IBM's Scratchpad have
mechanisms for doing that.

Cardelli's Quest language combines this kind of mechanism with inheritance.
(Quest and Russell differ in other profound respects.  But those have nothing
to do with this discussion.)

Hans
(boehm@xerox.com)

Usual disclaimers ...

gudeman@cs.arizona.edu (David Gudeman) (04/25/91)

In article  <51669@nigel.ee.udel.edu> Darren New writes:
]... Hermies has dynamic typing (via polymorph) without dynamic
]binding (in the normal OO sense).  C++ has some limited dynamic binding
]without dynamic typing.  Smalltalk intermixes the dynamic typing and
]the dynamic binding semantically, muddying the waters.

OK, I give up.  What is the difference between dynamic typing and
dynamic binding?
--
					David Gudeman
gudeman@cs.arizona.edu
noao!arizona!gudeman

new@ee.udel.edu (Darren New) (04/25/91)

In article <2392@optima.cs.arizona.edu> gudeman@cs.arizona.edu (David Gudeman) writes:
>OK, I give up.  What is the difference between dynamic typing and
>dynamic binding?

How about an example?  Hermes gives a declaration (polymorph) and two
operations  (wrap and unwrap) for dealing with it.  You can take a
typed value and wrap it into a polymorph. Later, you can take that
polymorph and unwrap it into a typed value (with appropriate runtime
checks).  However, there is nothing else you can do with the polymorph
(except ask its type and such) while it is wrapped: specifically, you
cannot operate on the wrapped typed value.  This feature allows
building of hetrogeneous lists, resource managers, dictionaries, and so
on. However, it seems to be a pain to build `objects' in Hermes because
each capability for sending a message must be unwrapped before use if
it is passed in a polymorph and must be predeclared of the right type
if it is not a polymorph.  Either each "object" will have a dictionary
of polymorphs which must be unwrapped or will have a separate
declaration for all possible combinations of messages handled (clearly
unfeasible).

C++ has static typing (in that it is illegal to perform an operation on
something that the declared type does not support), but it is possible
that the same call will invoke different methods for different values
of arguments. This I call dynamic binding.

Smalltalk mixes this up because the type of a value *always* determines
what method a message will be bound to, and thus people tend to confuse
these two properties.

-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
+=+ Nails work better than screws, when both are driven with hammers +=+

wg@opal.cs.tu-berlin.de (Wolfgang Grieskamp) (04/25/91)

gudeman@cs.arizona.edu (David Gudeman) writes:

>In article  <51669@nigel.ee.udel.edu> Darren New writes:
>]... Hermies has dynamic typing (via polymorph) without dynamic
>]binding (in the normal OO sense).  C++ has some limited dynamic binding
>]without dynamic typing.  Smalltalk intermixes the dynamic typing and
>]the dynamic binding semantically, muddying the waters.

>OK, I give up.  What is the difference between dynamic typing and
>dynamic binding?

I guess the following:

C++ is "strongly typed" with dynamic binding (according to "virtuals"), 
since the error "message not understood" is not part of the languages 
operational semantics. Each message send to an object will be understood 
by one of its classes in the class hierarchy (however, the final 
superclass may implement the message by printing "message not 
understood" and just exit()...). 

Smalltalk is "dynamically typed" with dynamic binding, since the error
"message not understood" is part of the languages operational semantics.

--
Wolfgang Grieskamp 
wg@opal.cs.tu-berlin.de tub!tubopal!wg wg%opal@DB0TUI11.BITNET

craig@leland.Stanford.EDU (Craig Chambers) (04/25/91)

In article <boehm.672515148@siria>, boehm@parc.xerox.com (Hans Boehm) writes:
|> It seems to me that this problem fundamentally has very little to do with object
|> oriented programming.  Expressing things in terms of a subtype hierarchy
|> confuses the issue and contributes nothing.  In Russell, the min function
|> has signature:
|> 
|> func [x,y: val T; T: type { < }] val T
|> 
|> It's a function of two values of type T, and of the type T itself.  T is
|> expected to include a < operation.  (I get to cheat slightly, in that
|> operations like < have the "right" type by default.)  The third (type)
|> parameter will be inferred for each call, and is not explicitly needed,
|> except in the declaration of min.

Part of the subtype interaction is that I'd like to instantiate the type T
above to "Number", and then allow subtypes of Number as the actuals passed
to x and y.  So I don't think that subtyping is completely spurious.

But I believe that subtyping might be able to be "grafted on" to an
existing non-object-oriented language that handles these sorts of
things.  For example, I think that you could take CLU's form of the
parameterized min function with where clauses:

min = proc[T:type](x:T, y:T) returns(T)
	where T has <:proctype(T, T) returns bool
    ...
end min

...and then allow subtyping of arguments.  Then I could write:

min[number](3,4.5)

I could also write:

min[collection[number]]({3,4.5}, {2,6.2,-5})

The where clause above is similar to the type pattern idea I talked
about earlier and to properties in the Haskell type class system.  But
I'd like to find a solution that didn't have both a notion of type
that has subtypes and a notion of type property that has instances;
having both seems non-orthogonal in some way.

It seems to me that languages based (at least in spirit) on some form
of polymorphic lambda calculus, e.g. Russell, ML, and Haskell, can do
the part of the problem that involves type variables (i.e. handling
Num * Num and Collection * Collection), but none of them really handle
the part dealing with subtyping of various kinds of numbers.  And most
existing statically-typed object-oriented languages handle the
subtyping of numbers fine but fall down when faced with a need for
some form of type variables.

-- Craig Chambers

P.S. People might find it interesting that A. K. Wright sent me e-mail
saying that he felt that the min problem and similar problems cannot
be solved to my satisfaction, and references his SIGPLAN PLDI'90 paper
with G. V. Cormack on "Type-Dependent Parameter Inference" as
containing a number of linear algebra examples that pose similar
problems for static type checkers.

boehm@parc.xerox.com (Hans Boehm) (04/26/91)

It's not entirely clear to me what you're trying to achieve by viewing
integer and float as subtypes of number.  They presumably share no code,
hence code reuse is not the issue.  There is no real possibility of adding
further subtypes without modifying the existing ones.  (You would have to
add definitions of < for all newly introduced combinations.)  Thus it seems
that in this case subtyping is equivalent to viewing number as the union
of integer and float, with < suitably defined on the union.  Essentially all
of the languages under discussion can deal with that.

There are problems with this, but they are essentially syntactic.  I may
not want to write convert_to_number(3.5).  (If I care about "performance
transparency", I may want to write it, since convert_to_number(x) may
cost a lot more to evaluate than x.  I may have to convert a floating
point value in a register to a pointer to a tagged value in the heap.)
Thus there are sticky issues about inferring injections into a union.
In this case it doesn't seem hard.  But a general elegant solution probably
is hard, at least if it's also capable of inferring the type argument to min.

Hans
(boehm@xerox.com)

bertrand@eiffel.UUCP (Bertrand Meyer) (04/26/91)

From <1991Apr20.010347.28984@leland.Stanford.EDU>
by craig@leland.Stanford.EDU (Craig Chambers):
> 
> Here's a simple example that cannot be described by a static type
> system in most statically-typed object-oriented languages.  I'm using
> it to help me make sure that the static type system in my new OO
> language is sufficiently powerful.
> 
> Here's the problem: we'd like to describe the type of the min
> function, so that this one piece of source code can be used to compute
> the minimum of two numbers or of two collections of numbers or of two
> collections of collections of numbers, etc.
> 
> So here are some examples that should type-check:
> 
>     min(3, 4)
>     min(3, 4.5)
>     min({3,4}, {4,5,6})
>     min({{3,4.5},{5},{6,8.9}}, {{1.2,4},{2}})
> 
> And here are some that shouldn't:
> 
>     min(3, {4})
>     min({3,4}, {{4,6},{2,3.4,6}})

I don't know about ``most statically-typed object-oriented
languages'' but in Eiffel this does not appear particularly
difficult. Class COMPARABLE describes order relations;
one could define `min' in that class as

	min (other: like Current): like Current is
			-- Minimum of current element and `other'
		do
			if Current < other then
				Result := Current
			else
				Result := other
			end
		end -- min

In COMPARABLE, the operator "<" means a call to the following
function:

	infix "<" (other: like Current): BOOLEAN is
			-- Is current element less than or equal to `other'?
		deferred
		end -- "<"

Then INT, FLOAT and SORTED_LIST (a generic class) may inherit
from COMPARABLE and provide an effective declaration for infix "<". 

The ``like'' keyword used in these declarations is one of the key parts
of the type system, known as anchored declarations.
What it means is that `other' and the result
of `min' must be of the same type as Current (the current object),
even if redefined in a descendant class. In SORTED_LIST [X], for example,
the actual argument to both of the above functions must also
be of type SORTED_LIST [X], or a conforming (descendant) type.
Anchored declarations of this kind are what makes typing possible and
useful; they directly reflect the covariant rule, without
which typing, in our experience, would not work.


With the proper declarations for arguments a and b, the call
a.min (b) for the six examples given by Mr. Chambers will yield
the desired behavior: acceptance in the first four cases, rejection
in the last two. This assumes the following declarations (respectively):

1	a, b: INT
2	a: INT; b: FLOAT
3	a, b: SORTED_LIST [INT]
4	a, b: SORTED_LIST [SORTED_LIST [INT]]
 
5	a: INT; b: SORTED_LIST [INT]
6	a: SORTED_LIST [INT]; b: SORTED_LIST [SORTED_LIST [INT]]
	

In the last two cases, you can cheat the type system in Eiffel 2.3
by declaring for example a and b as being of type COMPARABLE,
and then assigning to them the values given in the corresponding examples.
This is because the detection of such erroneous cases requires
system-level checking (as opposed to class-level checking), which will
only be provided in Eiffel version 3. However such cases occur rarely except
if specially contrived.

Eiffel 3 will also have two properties which are relevant to this
discussion:
-  

	- It will be possible to anchor a function result to an argument
	of the function. In function `min', for example, it will be possible
	to declare the function result as being of type `like other',
	which provides more flexibility than `like Current'.
	(With the above declaration, if `n' is integer and `r' real,
	you have to write a call as r.min (n) rather than n.min (r);
	with the relaxed rule both are possible.

	- Support for manifest arrays makes it possible to write
	examples  such as min({3,4}, {{4,6},{2,3.4,6}}) in almost
	exactly this syntax, with << for the opening brace and
	>> for the closing brace. The rule is that <<a, b, ...>>
	conforms to ARRAY [T] for any T for which all of a, b, ...
	conform to T. This will mean that even without any
	entity declarations (of the forms numbered 1 to 6 above)
	the type checking will yield the desired effect. 

-- 
-- Bertrand Meyer
Interactive Software Engineering Inc., Santa Barbara
bertrand@eiffel.uucp

euaabt@eua.ericsson.se (Anders.Bjornerstedt) (04/26/91)

craig@elaine35.Stanford.EDU (Craig Chambers) writes:

>In article <1991Apr23.152110.6500@eua.ericsson.se> euaabt@eua.ericsson.se (Anders.Bjornerstedt) writes:
>>I suspect that you could express this, or something very close to it, in 
>>the language CLU. The problem is I dont have the relevant references 
>>accessible, I am short of time, I am lazy, etc etc. So why do I write
>>this ? Well to encourage any person out there knowlegable in CLU to try!

>I used to work with the CLU people and have written a number of CLU
>programs while a student at MIT.  CLU's where clauses provide some of
>the solution (they act like the type patterns I alluded to in an
>earlier message), but CLU has no subtyping, so it can't handle
>comparing subtypes of number.

>-- Craig Chambers

Yes, but CLU does have parameterized types. If I remeber correctly you
could specify the type parameters of a new type by requiring that the
parameter types "conform" by having one or more methods with a certain
signature. This might be more flexible than using inheritance (of 
specification), although less safe since signatures say very little
about semantics. On the other hand maybe the type parameters have to be bound
at compile time in CLU? in which case i guess CLU could not solve the problem.

--------------------------------------------
Anders Bjornerstedt
Software Development Environments
ELLEMTEL
Box 1505
S-125 25  Alvsjo
SWEDEN

Tel: +46-8-727 40 67
Fax: +46-8-647 82 76

E-mail: Anders.Bjornerstedt@eua.ericsson.se

rick@tetrauk.UUCP (Rick Jones) (04/26/91)

In article <1991Apr24.144714.17740@cl.cam.ac.uk> pwd@cl.cam.ac.uk (Peter Dickman) writes:
> [ ... ]  Sadly Eiffel has inheritance & unconstrained genericity but the 
>constrained genericity was left out (I don't like Bertrand Meyer's 
>justification for this I'm afraid - but accept that he had his reasons). 

Just to correct a misunderstanding, Eiffel DOES have constrained genericity.
This was introduced in version 2.2 of the language (released about 18 months
ago).  Dr. Meyer has revised a few of the opinions he expressed in his book
OOSC since it was first published, and Eiffel has evolved as a result.  It does
mean that OOSC is out of date as far as a true definition of the language is
concerned.

On this subject, the book "Eiffel: the Language" is due for publication very
soon, and will define version 3 of the language.  This addresses many issues,
large and small, which have been found in practice by users of the language,
and should prove very interesting reading for anyone concerned with programming
languages.

-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

craig@leland.Stanford.EDU (Craig Chambers) (04/27/91)

In article <554@eiffel.UUCP>, bertrand@eiffel.UUCP (Bertrand Meyer) writes:
|> I don't know about ``most statically-typed object-oriented
|> languages'' but in Eiffel this does not appear particularly
|> difficult. Class COMPARABLE describes order relations;
|> one could define `min' in that class as
|> 
|> 	min (other: like Current): like Current is
|> 			-- Minimum of current element and `other'
|> 		do
|> 			if Current < other then
|> 				Result := Current
|> 			else
|> 				Result := other
|> 			end
|> 		end -- min
|> 
|> In COMPARABLE, the operator "<" means a call to the following
|> function:
|> 
|> 	infix "<" (other: like Current): BOOLEAN is
|> 			-- Is current element less than or equal to `other'?
|> 		deferred
|> 		end -- "<"
|> 
|> Then INT, FLOAT and SORTED_LIST (a generic class) may inherit
|> from COMPARABLE and provide an effective declaration for infix "<". 

Eiffel's rules allow this to be type checked by having broken type
checking rules, in my opinion.  Covariant type-checking rules do not
enure type safety statically (you mention this towards the end of your
message).  The newer proposed rules (as yet unimplemented, I believe)
do ensure type safety statically, but by effectively enforcing a
contravariant typing discipline which then prevents this example from
being type checked.  I suspect that the new type checking rules will
never actually be implemented and widely adopted since they will
disallow many existing Eiffel programs which have relied on the
covariant type checking rule.

The main purpose of my example was to convince fans of static type
checking as done in most existing OO languages that these type systems
are not powerful enough to describe relatively simple, useful programs
(and preserve static type safety), and that OO language designers
should incorporate more powerful type systems if they really want to
claim that their type systems do not reduce expressive power over what
exists naturally in dynamically-typed OO languages.

-- Craig Chambers

new@ee.udel.edu (Darren New) (04/27/91)

In article <554@eiffel.UUCP> bertrand@eiffel.UUCP (Bertrand Meyer) writes:
>by craig@leland.Stanford.EDU (Craig Chambers):
>> Here's the problem: we'd like to describe the type of the min
>> function
>one could define `min' in that class as
>	min (other: like Current): like Current is

Doesn't look like you've answered the question here.  What's the type of `min'?
All you've shown is how Eiffel can express restrictions on the patterns of
inputs that min can accept and the type that min will return given certain
input types.  You have not said what the type of min is.  In Smalltalk,
I can say

Class 		Mary 
method 		zelda: thing
code		^ thing

and say that the zelda: message will always return the same type as its
argument.  That doesn't make Smalltalk statically typed.

I'm not bashing Eiffel.  I don't even know Eiffel.  Maybe the response would
be obvious if I *did* know Eiffel.  But so far, it looks like `min' is
a dynamically-typed function.  (Either that, or it is a polymorphic function,
at which point the *declaration* is dynamically typed and the *application*
is statically typed; i.e. "min(other : like Current)" is dynamically typed,
but "min(3,5)" is statically typed as an integer and "min(<<3,4>>,<<5,6>>)" is
statically typed as a list.)
	   -- Darren

-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
+=+ Nails work better than screws, when both are driven with hammers +=+

bertrand@eiffel.UUCP (Bertrand Meyer) (04/29/91)

In <554@eiffel.UUCP> I offered a straightforward Eiffel solution
to Craig Chambers's problem, supposedly intractable by typed
languages.

In <51986@nigel.ee.udel.edu>, new@ee.udel.edu (Darren New) replies: 
	
	[Quoting from my solution]:
	>> one could define `min' in that class as
	>> 	min (other: like Current): like Current is

	[His comment]:
	> Doesn't look like you've answered the question here.  What's the
	> type of `min'?

Now let us repeat patiently: the type of `min' is `like Current'.
`like' is an Eiffel keyword whose meaning was explained in my message,
and the corresponding typing mechanism (declaration by association)
is makes static typing possible in practice. Thank you for your attention.

	> I don't even know Eiffel. Maybe the response would
	> be obvious if I *did* know Eiffel.

It is generally considered preferable to know first and then criticize.
But it is never too late to know.

Here now is the response by Craig Chambers:
> 
> Eiffel's rules allow this to be type checked by having broken type
> checking rules, in my opinion.  Covariant type-checking rules do not
> enure type safety statically (you mention this towards the end of your
> message).  The newer proposed rules (as yet unimplemented, I believe)
> do ensure type safety statically, but by effectively enforcing a
> contravariant typing discipline which then prevents this example from
> being type checked.  I suspect that the new type checking rules will
> never actually be implemented and widely adopted since they will
> disallow many existing Eiffel programs which have relied on the
> covariant type checking rule.
> 
> The main purpose of my example was to convince fans of static type
> checking as done in most existing OO languages that these type systems
> are not powerful enough to describe relatively simple, useful programs
> (and preserve static type safety), and that OO language designers
> should incorporate more powerful type systems if they really want to
> claim that their type systems do not reduce expressive power over what
> exists naturally in dynamically-typed OO languages.

I have quoted this text in full because I can't repress a feeling of
admiration for the skill it takes to accumulate so many
misrepresentations in so few lines. To call Eiffel's rules
``contravariant'', for example, is a quite remarkable achievement.

This forum has seen the same claims made time and again, and time and
again rebuked, but whenever you cut the dragon's head a new one
grows back. Rational debate is useless; dynamic typing is good,
and static typing is at once bad, useless, and impossible.

I have always felt sympathy towards the biologists who accept
to debate creationists. Now I also understand them better;
one can fight opinions, not articles of faith.

Not having the infinite amount of both time and patience which
it would take to continue, I quit, declaring total rhetorical
defeat.
-- 
-- Bertrand Meyer
Interactive Software Engineering Inc., Santa Barbara
bertrand@eiffel.com

rick@tetrauk.UUCP (Rick Jones) (04/29/91)

In article <556@eiffel.UUCP> bertrand@eiffel.UUCP (Bertrand Meyer) writes:
BM>In <554@eiffel.UUCP> I offered a straightforward Eiffel solution to
BM>Craig Chambers's problem, supposedly intractable by typed languages.
BM>
BM>Here now is the response by Craig Chambers:

CC> Eiffel's rules allow this to be type checked by having broken type
CC> checking rules, in my opinion.  Covariant type-checking rules do not
CC> enure type safety statically (you mention this towards the end of your
CC> message).  The newer proposed rules (as yet unimplemented, I believe)
CC> do ensure type safety statically, but by effectively enforcing a
CC> contravariant typing discipline which then prevents this example from
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ !?!
CC> being type checked.  I suspect that the new type checking rules will
CC> never actually be implemented and widely adopted since they will
CC> disallow many existing Eiffel programs which have relied on the
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ !?!
CC> covariant type checking rule.
CC> 
CC> The main purpose of my example was to convince fans of static type
CC> checking as done in most existing OO languages that these type systems
CC> are not powerful enough to describe relatively simple, useful programs
CC> (and preserve static type safety), and that OO language designers
CC> should incorporate more powerful type systems if they really want to
CC> claim that their type systems do not reduce expressive power over what
CC> exists naturally in dynamically-typed OO languages.

BM>I have quoted this text in full because I can't repress a feeling of
BM>admiration for the skill it takes to accumulate so many
BM>misrepresentations in so few lines. To call Eiffel's rules
BM>``contravariant'', for example, is a quite remarkable achievement.
BM>
BM>Not having the infinite amount of both time and patience which
BM>it would take to continue, I quit, declaring total rhetorical
BM>defeat.

I have avoided getting embroiled in this debate as I feel it has been rather
academic, and, like Bertrand, I don't have infinite time & patience.

However, the above attack on Eiffel's new type-checking system embodies so many
misunderstandings I feel compelled to take up the cause.  There have been a
number of passing criticisms of the concept, all of which have shown a lack of
understanding, but failed to discuss the issue in any depth.

Although I am usually averse to acronymns, I don't want to write "extended type
checking" or the like any more than I have to, so for now I shall christen it
"Global Type Analysis", or GTA for short (ETC sounds too trivial :-).  The
following explains why it can in fact work.


Fact:	A program is type-safe if no feature is ever called on an object which
	does not support that feature.

The problem is how to establish the truth of this.  The extremes are either a
completely unbreakable, simple static type system, or dynamic typing.  The
former is so strict it is unusable for non-trivial applications.  The latter
requires 100% branch coverage testing, or total static data-flow analysis, both
of which are either impossible or impractical.  They also require an embedded
run-time error trap system.  The compromise is GTA.

To understand what GTA does, it is useful to consider the idea of a partial
type.  Static type systems are always discussed in terms of complete types -
i.e. the type of an object is defined by the total set of features which it
supports.  However, an object can also be considered as having a number of
partial types, represented by all the possible subsets of its features.  Each
of these partial types is clearly a supertype of the complete type, or if you
prefer the other way, the complete type conforms to each of the partial types.

E.g. if object A supports features F, G, & H, then we can draw an implicit
partial type tree as follows:

		F         G         H
		|\       /|\       /|
		| \     / | \     / |
		|   F,G   |   G,H   |
		|    |    |    |    |
		|____|____|____|____|
			  |
			F,G,H

The total number of partial types is a function of the total number of
features, using the formulae from combination theory (which off the top of my
head I forget :-).

If we now consider a class variable in a program (i.e. one which can refer to
an object, whose actual type is dynamic), that variable has an IMPLICIT type.
The implicit type is defined by the set of features which are actually coded as
called from that variable.  This is true regardless of whether the language is
statically or dynamically typed.  If the language is statically typed, then the
variable also has an EXPLICIT type;  this is the complete type of the class for
which it is declared.  Simple static type checking will guarantee that the
implicit type is a supertype of the explicit type.

E.g.	if class A exports the features F, G, & H, then the declaration
	(Eiffel style):

	var: A

	defines var with an explicit type F,G,H.  If the only code which uses
	var if of the form:

	var.F; var.G;

	i.e. there is no occurrence of var.H, then the implicit type of var is
	F,G.

By contrast, the whole basis of a dynamically typed language is that all
variables are implicitly typed by their usage - there are no explicit types.

The "holes" in Eiffel's current type system, pricipally resulting from
covariance, are that it may allow var to refer to an object which is a
supertype of A.  However, var's implicit type is also a supertype of A.
Provided no object is ever attached to var which is a supertype of var's
IMPLICIT type, then the system is type-safe.  This analysis is statically
feasible, and is what GTA sets out to do.

The total effect is not a lot different from running a test with 100% branch
coverage.  Suppose in the above example, as part of a complete program, GTA
detected that an object could be assigned to var which did not support feature
G - it would generate a compile-time error.  The same scenario would only be
safe in a dynamically typed language if the code guaranteed never to call G at
the times when var referred to the offending object.  This implies that there
would be code which tested that object's actual type, and behaved accordingly.
This is contrary to all the principles of object oriented design, and suggests
that the program should be re-written.

Note that it does NOT mean that feature G can never be called on any object
which may at some time be attached to var.  It simply means that feature G must
be applied to those objects which support it via some other variable which
never references objects which don't support G.

It should also be noted that GTA resolves the infamous polygon/rectangle
problem.  Here we have a class POLYGON, whose number of vertices can be
altered.  A descendant of POLYGON is RECTANGLE, which is intuitively a
sub-type, but clearly must have a fixed number of vertices.  Thus RECTANGLE
inherits POLYGON, but does not export the add_vertex feature (and probably some
other non-applicable ones as well).  The effect in terms of types is that
RECTANGLE is not a subtype of POLYGON is terms of complete types, but it is a
subtype of a particular set of POLYGON partial types - those which exclude the
features inapplicable to rectangles.  Thus a variable of type POLYGON may
safely have a RECTANGLE object assigned to it provided the implicit type of the
variable is a supertype of RECTANGLE.  A program which includes an assignment
of a RECTANGLE object to a POLYGON variable of an invalid implicit type will be
caught be the GTA system.

This quite clearly does not "reduce Eiffel's type system to contravariance",
but allows covariance to be used completely safely.  It is worth noting that
Craig Chambers' "hard problem" is only irresolvable in a static type system if
you demand contravariance (which he did in his initial posting) - but then most
problems are irresolvable in a useful way with contravariance.  His point
really seems to be that static type checking with covariance isn't type
checking at all, so dynamic typing is better.  However, GTA allows covariant
flexibility combined with static checking, which in fact checks exactly the
same things that a dynamically typed language checks at run-time.

I do not wish to claim that "static typing is better than dynamic typing",
since such wars are futile - both forms have a useful place.  However, such
debate should focus on useful differences, not on prejudice and dogma.

As a footnote, I have written a lot of code in Eiffel over the last year or so,
with extensive use of genericity and polymorphism, and I have NEVER encountered
a covariance-induced type failure.  I would be extremely surprised if GTA
objected to any existing, reliably working Eiffel programs - if it did, they
would have shown up run-time failures by now.
-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

dl@g.g.oswego.edu (Doug Lea) (04/29/91)

[I attempted to post this 3 months ago in reply to a similar
posting about contravariance, etc., but apparently the posting
never made it out. Now seems as good a time as any to try again.]

Bertrand Meyer wrote:
> The question represents an attempt on my part to understand
> how the contravariant rule (which may at first be theoretically
>  appealing because it makes type checking easier) can be made to
> work at all in practice. 

I don't think the solution is all that complicated or even controversial.

> Assume the following situation

[Example recast in a C++-ish form -- Sorry (especially since C++
doesn't have any any useful rules about contra- or co- variant
arguments), but I don't know Eiffel syntax well enough.  I also gave
`Register' a return value to make it easier to distinguish the cases.]

The contravariance-breaking declarations look like:

    class Driver { ... };
    class Professional_Driver : public Driver {...};

    class Vehicle
    {
       virtual int  Register(Driver& d) { return 0; }
    };

    class Truck : public Vehicle
    {
      virtual int Register(Professional_Driver& p) { return 1; }
    };

The first question to ask in finding a contravariance-conforming
strategy is what behavior you want in each of the following
situations, assuming Driver d, Professional_Driver p, Vehicle v, and
Truck t:

    [1] v.Register(d);
    [2] v.Register(p);
    [3] t.Register(d);
    [4] t.Register(p);

Most likely, you want cases [1], [2], and [3] to invoke
Vehicle::Register, and case [4] to invoke Truck::Register.

Since this dispatch pattern depends on the types of two kinds of
objects, the way to express it is through some form of multiple
dispatch. In a language directly supporting multiple dispatch (e.g.,
CLOS), it might be stated in this way:

    class Driver { ... };
    class Professional_Driver : public Driver {...};

    class Vehicle {...};
    class Truck : public Vehicle {...};

    int Register(Vehicle& v, Driver& d)            { return 0; }
    int Register(Truck& t, Professional_Driver& p) { return 1; }

This would be handled in the intended manner by CLOS-type resolution
and dispatch rules (which are implictly contravariance maintaining
when the functions are of this form.)

(Note: this is valid in C++ too, but overload resolution is only
done statically, so it doesn't always have the desired effect.)

But this resolution strategy can also be obtained with `manual' double
dispatch in other languages (including, finally, C++ and Eiffel), to
look something like

    class Driver
    {
      virtual int RegisterVehicle(Vehicle& v) { return 0; }
      virtual int RegisterTruck(Truck& t)     { return RegisterVehicle(t); }
    };

    class Professional_Driver : public Driver
    {
      virtual int RegisterTruck(Truck& t) { return 1; }
    };

    class Vehicle
    {
      virtual int Register(Driver& d) { return d.RegisterVehicle(*this); }
    };

    class Truck : public Vehicle
    {
      virtual int Register(Driver& d) { return d.RegisterTruck(*this); }
    };

which is legal, does what you want, and obeys contravariance. You
can always do this conversion mechanically (algorithmically).

A perfectly valid objection is that people don't want to have to do
conversion into double dispatch themselves, especially since the
definition of one special case involves 3 other classes besides the
one programmers have in mind.

I agree with this objection. Languages and their compilers should help
automate this. The CLOS generic function approach is one attractive
method to do this in C++-like and Eiffel-like langauges.

--
Doug Lea  dl@g.oswego.edu || dl@cat.syr.edu || (315)341-2688 || (315)443-1060
|| Computer Science Department, SUNY Oswego, Oswego, NY 13126 
|| Software Engineering Lab, NY CASE Center, Syracuse Univ., Syracuse NY 13244

eliot@cs.qmw.ac.uk (Eliot Miranda) (04/29/91)

In article <boehm.672608274@siria> you write:
>It's not entirely clear to me what you're trying to achieve by viewing
>integer and float as subtypes of number.  They presumably share no code,
>hence code reuse is not the issue.  
They most certainly can.  In Smalltalk-80's arithmetic implementation
class Number provides a number(sorry) of arithmetic operators defined in terms
of other (presumed more primitve operators).
e.g.
Number abs
	^self < 0 ifTrue: [self negated] ifFalse: [self]

e.g.  quo: (rounding division towards zero),  // (rounding division towards -ve
infinity) are implemented in Number in terms of / (exact division) floor
& ceiling.

Number provides the specification & implementation of
the coercion system which allows mixed arithmetic.

Number provides generic implementations of many functions implemented by
float, e.g.
Number cos
	^self asFloat cos

Number provides enumeration & interval creation: to: to:by: to:by:do: etc

At least in Smalltalk, using a class hierarchy that requires subclasses to
implement a certain subset of operators & then implementing other operators
implemented upon them higher in the hierarchy is a common & powerful technique.
The canonical example is the implementation of the relational operators
> >= <= ~= min: max: between:and: etc
in Magnitude in terms of = < and not.
-- 
Eliot Miranda			email:	eliot@dcs.qmw.ac.uk
Dept of Computer Science	ARPA:	eliot%dcs.qmw.ac.uk@nsf.ac.uk
Queen Mary Westfield College	UUCP:	eliot@qmw-dcs.uucp
Mile End Road			Fax:	081 980 6533 (+44 81 980 6533)
LONDON E1 4NS			Tel:	071 975 5229 (+44 71 975 5229)

boehm@parc.xerox.com (Hans Boehm) (04/30/91)

eliot@cs.qmw.ac.uk (Eliot Miranda) writes:

>In article <boehm.672608274@siria> you write:
>>It's not entirely clear to me what you're trying to achieve by viewing
>>integer and float as subtypes of number.  They presumably share no code,
>>hence code reuse is not the issue.  
>They most certainly can.  In Smalltalk-80's arithmetic implementation
>class Number provides a number(sorry) of arithmetic operators defined in terms
>of other (presumed more primitve operators).
>e.g.
>Number abs
>	^self < 0 ifTrue: [self negated] ifFalse: [self]

> ...

Certainly it is possible for them to share code using inheritance, once you
augment the example with other functions.  However the original < and min
example also point out why you might not want to do this.  Implementing min
as a method of number means that ir only applies to numbers, and not to other
objects understanding <.

This argument is perhaps less convincing for abs, since there are fewer kinds
of objects that understand 0, <, and negation.  Other operations like integer
exponentiation again have much wider applicability.

The point is that inheritance is sometimes the right tool, but it often is not.
For the original problem, I believe it is not.  For some expansions of the
original problem, it probably is.  But even then there are static type systems
(e.g.  Cardelli's Quest) that deal with both parametrization and subtyping.

Hans
boehm@xerox.com

Standard disclaimers ...

new@ee.udel.edu (Darren New) (04/30/91)

In article <556@eiffel.UUCP> bertrand@eiffel.UUCP (Bertrand Meyer) writes:
>	> Doesn't look like you've answered the question here.  What's the
>	> type of `min'?
>
>Now let us repeat patiently: the type of `min' is `like Current'.
>`like' is an Eiffel keyword whose meaning was explained in my message,
>and the corresponding typing mechanism (declaration by association)
>is makes static typing possible in practice. Thank you for your attention.

Yup.  I caught that.  What's the type of Current?  If "Current" might
be an integer when the call is "min(4,6)" and "Current" may be a list
when the call is "min(<<3,4>>,<<5,6>>)" then it looks to me like the
single definition of "min" specifices that "min" shall return different
types on different calls.  Whether you call this "dynamic typing" or
"static typing" or "automatic polymorphism" or "generics" or what is
beside the point.  In the definition most of us have been using,
"dynamic typing" means that syntactic elements don't have types. 
Here, it seems to me that the syntactic element "min" has a type which
changes depending on the arguments on the rest of the line of text
representing the functional application.  It looks to me as tho
the *declaration* is dynamically typed and the *application* is
statically typed.  This is one of those inbetweens for which I have
not heard a good buzzword.

>	> I don't even know Eiffel. Maybe the response would
>	> be obvious if I *did* know Eiffel.
>It is generally considered preferable to know first and then criticize.

What criticism?  I merely asked you to clarify the type of "min".
To define a type by saying it is "like current" and then define
"like" but not "current" does not illuminate me to the point where
I fully understand the ramifiations of why it *isn't* dynamic typing.
I really don't care whether Eiffel is dynamically or statically typed.

>But it is never too late to know.

That is certainly true.

>This forum has seen the same claims made time and again, and time and
>again rebuked, but whenever you cut the dragon's head a new one
>grows back. Rational debate is useless; dynamic typing is good,
>and static typing is at once bad, useless, and impossible.

You mean, debate that disagrees with you.  I think that most of the
discussion has been beneficial at least to me.  Just beause you claim
that static typing is all you ever need doesn't mean it is so. Just
because somebody claims that dynamic typing is better doesn't make it
so.  "Rebukes" are generally ineffective when the pperson rebuking
is in a different situation than the person whose beliefs are being
rebuked.  (See, for example, the "formal semantics" thread; people
who communicate with others a lot tend to like formal semantics
more than those who just try to get something out. Formalisms
themselves are neither good nor bad in and of themselves.)

Besides, I still haven't seen a good example of how to do hetrogeneous
lists in a statically-typed language.

>I have always felt sympathy towards the biologists who accept
>to debate creationists. Now I also understand them better;
>one can fight opinions, not articles of faith.

>Not having the infinite amount of both time and patience which
>it would take to continue, I quit, declaring total rhetorical
>defeat.

That's funny.  I don't argue to "win".  I argue to "learn", whether
it be to learn what your faith is, or to learn how you came to a
rational decision.  Even when I finish an argument I've "won" or
"lost" or didn't really finish at all, I usually manage to keep an
open enough mind that I might admit that the other side *could*
have *some* validity to their claims, and thereby expand my own
horizons. By not assuming that since the other side doesn't agree
then they must be working on blind faith, I find that I often manage
to converse long enough to get something out of the conversation,
if only a better understanding of how to deal with people stuck 
in a blind-faith trap.

Of course, we are all busy.  I have no time to learn Eiffel right now
(especially as it does not seem to present any truely novel-to-me
concepts), and you may not have time to continue this argument.
I accept and respect that.  Have fun!
		     -- Darren

-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
+=+ Nails work better than screws, when both are driven with hammers +=+

rockwell@socrates.umd.edu (Raul Rockwell) (04/30/91)

Darren New:
>  Besides, I still haven't seen a good example of how to do
>  hetrogeneous lists in a statically-typed language.

Sorry, I can't resist (-:

struct dumb { char c; int i; double* d; };

Raul

rick@tetrauk.UUCP (Rick Jones) (04/30/91)

new@ee.udel.edu (Darren New) writes:
> Doesn't look like you've answered the question here.  What's the
> type of `min'?

I feel this does deserve a bit of expansion.  To explain the implication of
"Current" in Eiffel, it is first necessary to point out that routines in Eiffel
can only exist as features of classes.  A class is the only form of code module
in Eiffel, and everything is written within the boundaries of some class.  Thus
there is no such thing as "the" min function - many classes may provide a "min"
function.  A function can only be invoked via a non-void reference to an object
of that class.

A function such as "min" would in fact be written as a function of say a number
class, and would return either the object on which it is called or its
argument, whichever is the smaller.  I.e. (fragment code of class NUMBER)

class NUMBER
export min
feature
	min (other: like Current): like Current is
	do
		if other < Current then
			Result := other
		else
			Result := Current
		end
	end
end

"Current" is the current object, and "like Current" is the same type as the
current object.  So in this example "like Current" is the same as NUMBER.

In use:

	num1, num2, num3: NUMBER ;
	-- ... values assigned to num1 & num2 ...
	num3 := num1.min (num2)

Now if a class INTEGER is a descendant of NUMBER, it inherits "min", but the
types of the argument and return value automatically change without explicit
redefinition.  So:

class INTEGER
inherit NUMBER
end

will give INTEGER a "min" function whose type is INTEGER, and whose argument
must also be (or conform to) an INTEGER.  So if I have:

	num1, num2, num3: NUMBER ;
	int1, int2, int3: NUMBER ;

The following are legal:

	int3 := int1.min (int2)		-- all types the same
	num3 := int1.min (int2)		-- conforming result type
	num3 := num1.min (int2)		-- conforming argument type

but the following aren't:

	int3 := num1.min (num2)		-- wrong result type
	int3 := int1.min (num2)		-- wrong argument type

I hope this helps.

-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

boehm@parc.xerox.com (Hans Boehm) (05/01/91)

rick@tetrauk.UUCP (Rick Jones) writes:
>...  It is worth noting that
>Craig Chambers' "hard problem" is only irresolvable in a static type system if
>you demand contravariance (which he did in his initial posting) - but then most
>problems are irresolvable in a useful way with contravariance...

Craig Chambers' problem is resolvable in at least some statically typed
languages, as is the Polygon-Rectangle example. There are elegant solutions
that don't involve inheritance at all.  See the discussion in comp.object.

(There may also be some that do use inheritance.
I'm less of an expert on those. But I haven't seen any discussion of the
more aggressive type systems that incorporate inheritance, but obey the
contravariance rule,  e.g. Cardelli's Quest, or Cook's system.) 

The original sentence strikes me as a substantial overgeneralization.  The
problem appears to be unsolvable in a straightforward way using a version of
Eiffel's type system that enforces contravariance.  Nothing else has been
established.

Hans
(boehm@xerox.com)

Usual disclaimers ...

craig@leland.Stanford.EDU (Craig Chambers) (05/01/91)

In article <1146@tetrauk.UUCP>, rick@tetrauk.UUCP (Rick Jones) writes:
|> The "holes" in Eiffel's current type system, pricipally resulting from
|> covariance, are that it may allow var to refer to an object which is a
|> supertype of A.  However, var's implicit type is also a supertype of A.
|> Provided no object is ever attached to var which is a supertype of var's
|> IMPLICIT type, then the system is type-safe.  This analysis is statically
|> feasible, and is what GTA sets out to do.

I don't think you are describing the proposal to fix Eiffel's typing
rules, at least as I understand it.  Bertrand has explicitly stated
that he isn't proposing complete flow analysis of the program to
detect type errors.  If there is a call to feature F *anywhere in the
program* then the type checker assumes that feature F will be called
for any variable declared to be a type that includes F.  This then
prevents the program you are describing to type-check (assuming that
all three features of the declared type in your example are called
somewhere in the program).

Put another way, the type system you describe completely ignores the
declared types of variables, using interprocedural flow analysis to
compute what you call the implicit type of a variable; see the above
quote for an example (the explicit type of "var" is never used).
Although I like the results of this kind of type system (nearly all
legal programs will type check with little effort on the part of the
programmer), I don't believe that the necessary flow analysis is
particularly feasible.  The Typed Smalltalk people do include a
type-checking system that uses flow-sensitive analysis like you
describe (abstract interpretation of the program in the type domain),
but the algorithm is exponential (double-exponential?) in the worst
case.

|> It should also be noted that GTA resolves the infamous polygon/rectangle
|> problem.  Here we have a class POLYGON, whose number of vertices can be
|> altered.  A descendant of POLYGON is RECTANGLE, which is intuitively a
|> sub-type, but clearly must have a fixed number of vertices.  Thus RECTANGLE
|> inherits POLYGON, but does not export the add_vertex feature (and probably some
|> other non-applicable ones as well).  The effect in terms of types is that
|> RECTANGLE is not a subtype of POLYGON is terms of complete types, but it is a
|> subtype of a particular set of POLYGON partial types - those which exclude the
|> features inapplicable to rectangles.  Thus a variable of type POLYGON may
|> safely have a RECTANGLE object assigned to it provided the implicit type of the
|> variable is a supertype of RECTANGLE.  A program which includes an assignment
|> of a RECTANGLE object to a POLYGON variable of an invalid implicit type will be
|> caught be the GTA system.

No.  If there is a call to add_vertex *anywhere in the program*, then
*all* assignments of rectangles to polygons will be declared illegal
by Eiffel's proposed type system.

|> This quite clearly does not "reduce Eiffel's type system to contravariance",
|> but allows covariance to be used completely safely.  It is worth noting that
|> Craig Chambers' "hard problem" is only irresolvable in a static type system if
|> you demand contravariance (which he did in his initial posting) - but then most
|> problems are irresolvable in a useful way with contravariance.

I admit I was a bit hasty in my implication that Eiffel's new rules
are nothing more than requiring contravariance.  I should have said
that the following rules will type-check the same programs, no more
and no less, as Eiffel's new rules:

1) Remove all features that are never invoked in the program.

2) Construct a type hierarchy from the class hierarchy such that one
class is a subtype of another iff it obeys the normal subtype
conformance rules using contravariance.

3) Disallow all assignments in the program where an expression of one
class is being assigned to a variable (or passed as a parameter) that
is declared to be a class that's not a legal supertype.

This phrasing of the rules make it easier to compare Eiffel's new type
system with those of other languages, and highlights the fact that
assignments from one type to another can only take place where normal
contravariant subtyping rules would allow.  This is only natural,
since some form of these rules is necessary to allow static type
safety.  But it does pose problems for existing Eiffel programs that
rely on covariant type checking (more akin to the implicit type
checking that you describe).

If I have misunderstood Eiffel's new rules (again), I'd appreciate
being set straight.  The easiest way to do that is to post an example
program that will type-check under Eiffel's new rules that won't under
the rules I've listed above.

|> His point
|> really seems to be that static type checking with covariance isn't type
|> checking at all, so dynamic typing is better.  However, GTA allows covariant
|> flexibility combined with static checking, which in fact checks exactly the
|> same things that a dynamically typed language checks at run-time.

No, I'm saying that covariance (by itself) isn't statically type safe,
so a better type system is needed.  Some fairly powerful type systems
have been developed for more theoretical languages and for functional
languages, and I'm sure that one could be developed to handle the
"min" example I posed.  The main point is that current popular OO
languages are a far cry from these type systems.

-- Craig Chambers

rick@tetrauk.UUCP (Rick Jones) (05/01/91)

In article <1991Apr30.213115.9990@leland.Stanford.EDU> craig@self.stanford.edu writes:
} In article <1146@tetrauk.UUCP>, rick@tetrauk.UUCP (Rick Jones) writes:
} |> The "holes" in Eiffel's current type system, pricipally resulting from
} |> covariance, are that it may allow var to refer to an object which is a
} |> supertype of A.  However, var's implicit type is also a supertype of A.
} |> Provided no object is ever attached to var which is a supertype of var's
} |> IMPLICIT type, then the system is type-safe.  This analysis is statically
} |> feasible, and is what GTA sets out to do.
} 
} I don't think you are describing the proposal to fix Eiffel's typing
} rules, at least as I understand it.

We clearly have a different interpretation of the proposal.

} Bertrand has explicitly stated
} that he isn't proposing complete flow analysis of the program to
} detect type errors.  If there is a call to feature F *anywhere in the
} program* then the type checker assumes that feature F will be called
} for any variable declared to be a type that includes F.

This is where we disagree.  The proposal as I read it (I have just gone
through my copy again) clearly relates the application of a feature to the
variable used to apply it.  Your interpretation would be no more complex to
implement than the existing checker, just a lot more restrictive.  If this were
the case, I would also consider it to be unusable.

} Put another way, the type system you describe completely ignores the
} declared types of variables, using interprocedural flow analysis to
} compute what you call the implicit type of a variable; see the above
} quote for an example (the explicit type of "var" is never used).

Not quite.  The use of simple type conformance based on inheritance as a
starting point enables the full type checking to be done without actually
indulging in flow analysis.  I will not attempt to formally justify this, as I
am not a language theoretician;  I am merely explaining my understanding of
Bertrand Meyer's description of how it will work.

} Although I like the results of this kind of type system (nearly all
} legal programs will type check with little effort on the part of the
} programmer), I don't believe that the necessary flow analysis is
} particularly feasible.  The Typed Smalltalk people do include a
} type-checking system that uses flow-sensitive analysis like you
} describe (abstract interpretation of the program in the type domain),
} but the algorithm is exponential (double-exponential?) in the worst
} case.

This I can believe, but since Smalltalk starts off with no explicit types or
simple static type conformance rules, this is the only way to work it out.
I believe it _can_ be done in Eiffel, and without flow analysis per-se.

Since we seem to be debating different understandings of Bertrand Meyer's
description, perhaps the simplest thing would be for Bertrand to comment on
which of us (if either!) has got it right.
-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

anw@maths.nott.ac.uk (Dr A. N. Walker) (05/01/91)

In article <52166@nigel.ee.udel.edu> new@ee.udel.edu (Darren New) writes:
>Besides, I still haven't seen a good example of how to do hetrogeneous
>lists in a statically-typed language.

	Perhaps I might have a go if I understood the question better.

(a) There is no difficulty [in the right language!] in writing a generic
    list constructor, so that given a type T you could instantiate lists
    of T.  This could be a library routine, that needn't know in advance
    what T might be.  This would give a homogeneous list, but different
    calls of the same routine would deliver lists of different types.

(b) Given a set of types, say "student", "room", "teacher", "course",
    there is no difficulty [in languages that don't necessarily have
    generics but do have unions and pointers] in writing routines for
    lists of "union (student, room, teacher, course)".  This would
    give a homogeneous list, but the base contents could be as
    heterogeneous as you [statically] liked.

(c) Given a collection of ways of constructing types, say "array of",
    "pointer to", "procedure with parameters ... returning ...", etc.,
    there is no difficulty [in suitable languages!] in constructing an
    ADT (perhaps some tree structure) which represents arbitrary types
    and building lists of this ADT (and/or of instantiated objects of
    such types).  Compilers do this all the time.

(d) Given a generic pointer, in languages that assume rightly or
    wrongly that all pointers are the same "shape", there is no
    difficulty in building a list of pointers to arbitrary objects.
    Storage allocators do this all the time.

	Are you thinking of something else?  Or is one of (a-d) what
you are after?

	[I have no axe to grind on the general static/dynamic debate.
Dynamic typing is often more convenient, but it is, I still maintain,
*necessary* only when you, the programmer, don't know what types of
objects your program is going to have to deal with or what operations
are going to be effected on them.  That situation might be acceptable
for a quick prototype, but not for a finished system.  That is why it
is easy to describe *situations* in which dynamic typing is desirable,
but we have not seen a completely specified *problem* for which it is
needed.]

-- 
Andy Walker, Maths Dept., Nott'm Univ., UK.
anw@maths.nott.ac.uk

mario@cs.man.ac.uk (Mario Wolczko) (05/02/91)

In article <1146@tetrauk.UUCP>, rick@tetrauk.UUCP (Rick Jones) writes:
> Fact:	A program is type-safe if no feature is ever called on an object which
> 	does not support that feature.

[description of GTA with example deleted]

> supertype of A.  However, var's implicit type is also a supertype of A.
> Provided no object is ever attached to var which is a supertype of var's
> IMPLICIT type, then the system is type-safe.  

> This analysis is statically feasible, and is what GTA sets out to do.
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Nonsense.  It is easy to construct a program which is type-safe by
your definition but cannot be verified as type-correct by static
analysis.  If you could verify all such programs, you would have
solved the Halting Problem.

Consider this program in a dynamically-typed language:
	var X;
	if predicate1
	then X := L1;  /* L1 is an object that has feature F */
	else X := L2;  /* L2 is an object that does not have feature F */
	fi
	X.F;

There is no way at compile time to tell whether this program is
type-safe or not.  Predicate1 could invoke an artbirary amount of
computation, but always result in "true".  It could be based on user
input (isodd(readnum())), which cannot be foreseen.

There will always be programs that are type-safe, but cannot be
verified as type-safe by a static type system.

Mario Wolczko

   ______      Dept. of Computer Science   Internet:      mario@cs.man.ac.uk
 /~      ~\    The University              uucp:      mcsun!ukc!man.cs!mario
(    __    )   Manchester M13 9PL          JANET:         mario@uk.ac.man.cs
 `-':  :`-'    U.K.                        Tel: +44-61-275 6146  (FAX: 6236)
____;  ;_____________the mushroom project___________________________________

craig@leland.Stanford.EDU (Craig Chambers) (05/02/91)

In article <1150@tetrauk.UUCP>, rick@tetrauk.UUCP (Rick Jones) writes:
|> We clearly have a different interpretation of the proposal.
|> 
|> In article <1991Apr30.213115.9990@leland.Stanford.EDU> craig@self.stanford.edu writes:
|> } Bertrand has explicitly stated
|> } that he isn't proposing complete flow analysis of the program to
|> } detect type errors.  If there is a call to feature F *anywhere in the
|> } program* then the type checker assumes that feature F will be called
|> } for any variable declared to be a type that includes F.
|> 
|> This is where we disagree.  The proposal as I read it (I have just gone
|> through my copy again) clearly relates the application of a feature to the
|> variable used to apply it.  Your interpretation would be no more complex to
|> implement than the existing checker, just a lot more restrictive.  If this were
|> the case, I would also consider it to be unusable.

I just finished reading through my copy again, and I think I have to
retract what I said about Eiffel's new rules.  There is some sort of
analysis more precise than "any assignment in the program" that
Bertrand calls "alias analysis" in the proposal.  This analysis tracks
which variables have been assigned to which other variables or formal
parameters, and then does checking of feature application based on the
statically declared types of any variable/parameter which could be
assigned/passed to the receiver of the message and the parameter to
the message.  It's not as precise as the Typed Smalltalk type
inference and checking algorithm, though.  I don't know if your GTA is
closer to (my new interpretation of) Bertrand's proposal or to the
Typed Smalltalk approach.

It's hard for me to guess what the results of Bertrand's proposal are
likely to be in practice.  Will most Eiffel programs type-check, or
will most not type-check (in the interesting cases)?  It's hard to
say.  It's also hard for me to say how long the alias analysis will
take in practice.  Since these rules are much more complex than my
earlier interpretation of them, I'm no longer sure that they guarantee
static type safety, either.

|> Since we seem to be debating different understandings of Bertrand Meyer's
|> description, perhaps the simplest thing would be for Bertrand to comment on
|> which of us (if either!) has got it right.

I think that's probably a wise idea.  But I wish there were some more
illustrative discussion of these new typing rules, complete with
examples.  Perhaps this will be included in the new Eiffel language
book?

To relate these last few messages then to the subject line, I'm no
longer sure whether Eiffel's new rules will handle the "min" example.

-- Craig Chambers

new@ee.udel.edu (Darren New) (05/02/91)

In article <1991May1.143831.2065@maths.nott.ac.uk> anw@maths.nott.ac.uk (Dr A. N. Walker) writes:
>In article <52166@nigel.ee.udel.edu> new@ee.udel.edu (Darren New) writes:
>>Besides, I still haven't seen a good example of how to do hetrogeneous
>>lists in a statically-typed language.

(a) gives you a bunch of homogenous lists.

(b) gives you a list of unions (which can only contain types you consider when
    you write the program).

>(c) Given a collection of ways of constructing types, say "array of",
>    "pointer to", "procedure with parameters ... returning ...", etc.,
>    there is no difficulty [in suitable languages!] in constructing an
>    ADT (perhaps some tree structure) which represents arbitrary types
>    and building lists of this ADT (and/or of instantiated objects of
>    such types).  Compilers do this all the time.

Getting closer.  However, you have to then run this through the
compiler.  (Hermes can let you do this.)  Or are you talking about
something like XDR or a routine to do ASN.1/BER encoding or something?
Then you are implementing dynamic typing in order to obtain hetro
lists.

>(d) Given a generic pointer, in languages that assume rightly or
>    wrongly that all pointers are the same "shape", there is no
>    difficulty in building a list of pointers to arbitrary objects.
>    Storage allocators do this all the time.

But doing this makes it difficult to actually *do* something with
the values you are pointing to.

>	Are you thinking of something else?  Or is one of (a-d) what
>you are after?

I think the canonical example (for me) is something like a resource manager.
I want to be able to write a routine (once) and compile it (once) at
the beginning of my work. This routine should accept pairs of
<name, value> where the value may be anything the language can handle.
It should be able to add and find and remove tuples, and it should be
able to iterate through the list, handing me each tuple (for example,
to find out the total number of bytes used by values). In case (c),
you have to write code for each type to break it into the values you are 
looking for, interpret the tags, etc.  In case (d), you have to already
know the type of the object associated with each name before you can
do anything with it (such as find it's size) or else it turns into case (c).

>	[I have no axe to grind on the general static/dynamic debate.
>Dynamic typing is often more convenient, but it is, I still maintain,
>*necessary* only when you, the programmer, don't know what types of
>objects your program is going to have to deal with or what operations
>are going to be effected on them.  

Right.

>That situation might be acceptable
>for a quick prototype, but not for a finished system.  

Here I must disagree.  Building libraries for other people to use, and
building these libraries such that the library can evolve without
invalidating programs built with previous versions of the library and
vica versa, is an important use for dynamically-typed systems.  I don't
think that your (d) is as safe as a dynamic type system.

I'm starting to think that "dynamic typing" might be somewhat akin to
"object oriented programming" in the sense that it is more a
organization/management technique than an actual programming language
issue.  Just like you can do object-oriented programming in COBOL (but
it isn't very pretty), you can "do" dynamic typing in a static language
(but it isn't very pretty).  If you knew at the start exactly how the
program would work, knew all the programs it would ever have to
interface to, and knew the complete requirements, then you could do it
with static typing.  However, few projects I've ever worked on (and
none I didn't spec by myself for myself) have ever met these
requirements.
	       -- Darren

(Followups via email -- It'll expire before I have a chance to read
news again.)
-- 
--- Darren New --- Grad Student --- CIS --- Univ. of Delaware ---
----- Network Protocols, Graphics, Programming Languages, FDTs -----
+=+ Nails work better than screws, when both are driven with hammers +=+

bertrand@eiffel.UUCP (Bertrand Meyer) (05/02/91)

First I would like to thank Rick Jones for a very clear expose
of system-level type rules, and Craig Chambers for not overreacting
to my somewhat overreacting response to his message. That was
very classy of him. I forgot my Net Rule #1 -
if you are going to post anything negative, let
a night pass first. True, I was rather irritated by Mr. Chambers's
message, since I had the impression that it ignored my earlier answer
and was starting on the old route again. But then Rule #2 says
either you don't join the net or you have to accept that
repeating things is part of the game.
I apologize for the heated reaction. (What worries me is that I did
not receive the usual hate mail this time, only one gentle note
of reproach; am I doing something wrong?)

Now about message <1991May1.194620.1141@leland.Stanford.EDU>
by craig@leland.Stanford.EDU (Craig Chambers):

[!! indicates Mr. Chambers's quotations from message by Rick Jones.
Question numbers in square brackets added by BM.]

> It's hard for me to guess what the results of [system-level checking] are
> going to be in practice. [1]  Will most Eiffel programs type-check, or
> will most not type-check (in the interesting cases)?  It's hard to
> say. [2]  It's also hard for me to say how long the alias analysis will
> take in practice.  Since these rules are much more complex than my
> earlier interpretation of them, I'm no longer sure that they guarantee
> static type safety, either.

> !! [3] Since we seem to be debating different understandings of Bertrand Meyer's
> !! description, perhaps the simplest thing would be for Bertrand to comment on
> !! which of us (if either!) has got it right.

> I think that's probably a wise idea. [4]  But I wish there were some more
> illustrative discussion of these new typing rules, complete with
> examples. [5] Perhaps this will be included in the new Eiffel language
> book?

> [6] To relate these last few messages then to the subject line, I'm no
> longer sure whether Eiffel's new rules will handle the "min" example.

Some partial answers:

[1] I am convinced that 99.5% of Eiffel systems will typecheck
under the complete rules. (I don`t like to talk about the ``new
rules'' because for me they were always there implicitly,
although not completely stated.) The 0.5% that will not
typecheck will be rejected because of actual inadequacies
which could have led to incorrect situations at run time;
full checking will thus be beneficial in this case.

[2] Clearly we are working hard to make the full type analysis
very efficient. I'd rather refrain from any more boasting until
we have actual timing figures to announce, but I am very optimistic.

[3] I think Rick Jones has presented a quite clear picture
of the system-level type validity rules.

[4] Yes, there is a need for more examples. I have tried to
be as clear as possible in the chapter on Type Checking in the
forthcoming revised version of ``Eiffel: The Language''.
I'd like to be able to extract a subset of that chapter
and post it on comp.lang.eiffel, but frankly I don't see
how I can find the time to do this in the next few weeks.
On the other hand, the paper ``Static Typing for Eiffel'',
which was posted twice (either on comp.object or comp.lang.eiffel,
I forgot), presents a reasonably coherent view in spite of a few
material errors. (I have not published that paper in a widely
available printed form because I couldn't think of a publication
or conference that would have accepted it. It is part of a book
that our company distributes, ``An Eiffel Collection''.)

[5] Yes, to some extent. See above.

[6] The complete rules certainly handle the `min' example.
Don't forget, they are the same as the old rules; they simply exclude
certain erroneous cases which would have escaped the incomplete
rules. But they certainly don't limit the expressive power
of the language.

Let me expand. (Since my earlier message conceded defeat,
just consider this as just playing for sheer fun once the ball
game is over.) My fundamental disagreement with Mr. Chambers
is that I do *not* think the problem of static vs. dynamic
typing is one of expressive power. If the type system has
been designed properly then types help you, rather than
constraining you. They make your software much more clear
(through declarations, i.e. useful redundancy); they help the compiler
generate good code; and they enable a static checker (usually
a part of the compiler) to catch errors early rather than late in the
development cycle.

This is only true, of course, if the type system is complete enough;
this means the presence of genericity, constrained and
unconstrained, of the reverse assignment attempt for
forcing a type on a known object,
and of anchored declarations (the `like something'
type construct). Without these mechanisms, static typing in
an object-oriented language is, as believe,
either impossible or simply a joke (as in C extensions when you spend
your time casting back and forth between pointer types).
Furthermore, although this is more controversial, I am convinced by
my experience, confirmed by that of many Eiffel users
and by the absence of any practical argument to the contrary,
that for typing to work in practice with inheritance requires a 
covariant redefinition policy. If the mathematical models for
contravariance are simpler, then that's too bad for mathematics.
Denotational semantics 0, software engineering 1.
(By the way I love denotational semantics, even wrote a book on it,
but I believe that scientists should build theories to fit the practice,
not the other way around.)

As a consequence I believe that *conceptually* a good statically typed
O-O languages is *always* better than a dynamically typed one,
because you don't lose anything: if what you want is a fast,
non-type-checking interpreter or compiler, then you can always build one
for a statically typed languages; programmers then won't lose anything as
compared to Smalltalk or CLOS, save for the effort needed to write a
few declarations, which they'll probably find helpful anyway.
But the reverse is not true: if you have a dynamically typed language,
you will *never* be able to write a type checker for it because it
would lack the necessary information. (ML fans might disagree here,
but we'll have to wait until they have produced OOML.)

Does this mean that static typing is always good and dynamic
typing always bad? (I can hear the rumblings: who is the creationist
here?) The answer would be yes except for one strong argument in
favor of dynamically typed languages: they can be processed very
quickly, enabling developers to try out new ideas without the
interference of a static type checker, which may take some time to
perform its duties. If speed of development is more important than
reliability and efficiency of the resulting product, this makes
dynamically typed languages attractive if they are backed by tools
ensuring a fast turnaround.

In other words, I don't think, as Mr. Chambers does, that the static
vs. dynamic debate is a conceptual discussion at all.
Conceptually, static wins hands down every time.
What the debate is about is much more mundane:

	It's purely a question of implementation.

If we were able to build static checkers that were totally
unobtrusive performance-wise, and did their work in - say -
ten seconds after a comparatively small change even to a very large
system, then who in the world would forsake the extra benefits of type
checking?

Solving this problem - that is to say, a Very Fast Reexecution Cycle,
comparable to the change-to-reexecute cycle of the best interpreters,
without sacrificing any of the fantastic advantages of full type checking -
has been our obsession for several years. We are convinced we now have
the technology to do it, but no one has to believe this until the day
it's out on the desks of Eiffel users. You can count on us for
not sparing our time to make this happen as quickly as possible,
and for not being too shy about it then.
-- 
-- Bertrand Meyer
Interactive Software Engineering Inc., Santa Barbara
bertrand@eiffel.uucp

rick@tetrauk.UUCP (Rick Jones) (05/02/91)

In article <boehm.673043278@siria> boehm@parc.xerox.com (Hans Boehm) writes:
} rick@tetrauk.UUCP (Rick Jones) writes:
} >...  It is worth noting that
} >Craig Chambers' "hard problem" is only irresolvable in a static type system if
} >you demand contravariance (which he did in his initial posting) - but then most
} >problems are irresolvable in a useful way with contravariance...

} The original sentence strikes me as a substantial overgeneralization.  The
} problem appears to be unsolvable in a straightforward way using a version of
} Eiffel's type system that enforces contravariance.  Nothing else has been
} established.

Criticism accepted - I was assuming a context of types related by inheritance,
which was the general context of the discussion.  The statement was not
intended to be as sweeping as it might sound.

-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

rick@tetrauk.UUCP (Rick Jones) (05/02/91)

In article <1991May1.143831.2065@maths.nott.ac.uk> anw@maths.nott.ac.uk (Dr A. N. Walker) writes:
]  [ ... ]
] Dynamic typing is often more convenient, but it is, I still maintain,
] *necessary* only when you, the programmer, don't know what types of
] objects your program is going to have to deal with or what operations
] are going to be effected on them.  That situation might be acceptable
] for a quick prototype, but not for a finished system.  That is why it
] is easy to describe *situations* in which dynamic typing is desirable,
] but we have not seen a completely specified *problem* for which it is
] needed.

This has to be the most succinct description of the relative advantages of
static and dynamic typing that I have seen - thank you, I agree entirely.

-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

rick@tetrauk.UUCP (Rick Jones) (05/02/91)

In article <1991May1.194620.1141@leland.Stanford.EDU> craig@self.stanford.edu writes:
> [ ... ]
>I wish there were some more
>illustrative discussion of these new typing rules, complete with
>examples.  Perhaps this will be included in the new Eiffel language
>book?

I can't say for certain, but I have just received some personal mail from
Bertrand Meyer in which he mentions that he has been extremely busy just
recently finishing the book (which probably explains his lack of patience!).

It should be available very soon now - there are apparently a lot of advance
orders for it.  In the meantime, I shall try to compose an illustrative example
which will explain the solution as I see it.

>To relate these last few messages then to the subject line, I'm no
>longer sure whether Eiffel's new rules will handle the "min" example.

I think it can, but I guess we shall have to wait and see.

-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

cjeffery@optima.UUCP (Clinton Jeffery) (05/02/91)

From article <566@eiffel.UUCP>, by bertrand@eiffel.UUCP (Bertrand Meyer):
> If we were able to build static checkers that were totally
> unobtrusive performance-wise, and did their work in - say -
> ten seconds after a comparatively small change even to a very large
> system, then who in the world would forsake the extra benefits of type
> checking?

I would.  I am not willing to type one keystroke (e.g. type declarations)
more than I have to in order to satisfy your need for everyone to do so.
What is this sweeping generalization doing here after your very nice
concession to dynamically typed languages earlier in your post?

The closest I am willing to come to your cumbersome world is to let my
type inference system insert comments into my code when it notices I am
using a variable for different types of values at different times.

Keystrokes.  How many keystrokes does it take me to solve my problem?

rockwell@socrates.umd.edu (Raul Rockwell) (05/02/91)

Bertrand Meyer:
> But the reverse is not true: if you have a dynamically typed language,
> you will *never* be able to write a type checker for it because it
> would lack the necessary information. 
...
> In other words, I don't think, as Mr. Chambers does, that the static
> vs. dynamic debate is a conceptual discussion at all.
> Conceptually, static wins hands down every time.

I disagree.  Completely.

If the language includes the empty function (one which can not be
closed -- any value applied results in a "domain error" or "message
not understood"), you have a very clear mechanism to express type
errors.  (any predicate which allows application of that function...)

The main issue I see between static typing and dynamic typing is that
static typing is, well.. static.  The difference between static typing
and dynamic typing is very analogous to the difference between
evaluation to a constant, and evaluation to a function.

Of course, what I call "dynamic typing" others may call "static
typing" -- I'm still pondering the question of user defined types.
[With sufficiently powerful primitive types, and sufficiently powerful
function re-write capability, what does user defined typing buy you?]

[[[ Actually, I'm struggling with some specific models for typing.
Basically, I just haven't found a way of expressing user types that is
as expressive as I'd like.  Yet. ]]]

Raul Rockwell

olson@juliet.ll.mit.edu ( Steve Olson) (05/02/91)

In article <2672@optima.cs.arizona.edu> cjeffery@optima.UUCP (Clinton Jeffery) writes:
   From article <566@eiffel.UUCP>, by bertrand@eiffel.UUCP (Bertrand Meyer):
   > If we were able to build static checkers that were totally
   > unobtrusive performance-wise, and did their work in - say -
   > ten seconds after a comparatively small change even to a very large
   > system, then who in the world would forsake the extra benefits of type
   > checking?

Because static type checking mostly solves problems introduced by ... 
static typing.  Now, if I'm forced to use static typing, there is no
question that I would also want a good type checker.  And I have no
problem with the notion that as static type checkers go, Eiffel has
a very good one.

   I would.  I am not willing to type one keystroke (e.g. type declarations)
   more than I have to in order to satisfy your need for everyone to do so.
   What is this sweeping generalization doing here after your very nice
   concession to dynamically typed languages earlier in your post?

Um, well, in my opinion, the extra keystrokes involved are the weakest of
the arguments against static typing.  I mean, the botom line isn't keystrokes,
its total programmer effort.  And of course, in many applications, machine
effort must be considered also.  I don't mind typing the declarations so much
as I mind being forced to specify my data objects at the bits 'n' bytes level.
("32 or 64 bits, pal, thats all we got here, and you better not get 'em
mixed up either!")

--
-- Steve Olson
-- MIT Lincoln Laboratory
-- olson@juliet.ll.mit.edu
--

guest@alfrat.uucp (Mr. Guest User) (05/03/91)

This is actually a slight detour from the main thread of this
discussion but C'est la vie.

In article <566@eiffel.UUCP> bertrand@eiffel.UUCP (Bertrand Meyer) writes:

>covariant redefinition policy. If the mathematical models for
>contravariance are simpler, then that's too bad for mathematics.
>Denotational semantics 0, software engineering 1.
>(By the way I love denotational semantics, even wrote a book on it,
>but I believe that scientists should build theories to fit the practice,
>not the other way around.)
>

Does this mean that there exists no formal description of Eiffel
in terms of Denotational Semantics? If this is the case, what
method is used to define the semantics of Eiffel (including
the semantics of the type checking)? I have heard (correct me
if I'm wrong) that the syntax of Eiffel is now Public Domain
and assume that other vendors must be currently developing
their own compilers. How can we be sure that implementations
of Eiffel comply to Bertrand Meyers definition, I would hate
to see Eiffel take the same road as PASCAL.

For the record, I too am a fan of Denotational Semantics and
also of Software Engineering, I'd like to see the score as
Denotational Semantics 1, Software Engineering 1.

Dave Cullen                       ...!unido!alfrat.uucp!dave

rick@tetrauk.UUCP (Rick Jones) (05/03/91)

In article <2479@m1.cs.man.ac.uk> mario@cs.man.ac.uk (Mario Wolczko) writes:
>In article <1146@tetrauk.UUCP>, rick@tetrauk.UUCP (Rick Jones) writes:
>> [ ... ]  However, var's implicit type is also a supertype of A.
>> Provided no object is ever attached to var which is a supertype of var's
>> IMPLICIT type, then the system is type-safe.  
>
>> This analysis is statically feasible, and is what GTA sets out to do.
>  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>
>Nonsense.  It is easy to construct a program which is type-safe by
>your definition but cannot be verified as type-correct by static
>analysis.  If you could verify all such programs, you would have
>solved the Halting Problem.

This is true, and static analysis will always be pessimistic.  What GTA does do
is to establish whether there are any assigments of objects in the whole
program, which could (directly or indirectly) result in an object which does
not support some feature being attached to a variable which calls the feature.
This ignores run-time branches, so there is an implicit assumption that every
possible branch combination may at some time be executed.  Thus your example:

>	var X;
>	if predicate1
>	then X := L1;  /* L1 is an object that has feature F */
>	else X := L2;  /* L2 is an object that does not have feature F */
>	fi
>	X.F;

will be rejected.  However, for normal, well-designed programs, this is not a
problem.  A program such as this example is inherently unreliable, even though
thorough testing and use may never have produced the conditions required for
failure.  A safer version would be:

>	var X;
>	if predicate1
>	then X := L1;  /* L1 is an object that has feature F */
>	else X := L2;  /* L2 is an object that does not have feature F */
>	fi
>	if predicate1
>	then X.F;
>	fi

but this is not good object oriented design.  The program is making branch
decisions based (indirectly) on the object's actual type.  Good OO design uses
abstraction and polymorphism to produce code which is independent of the
object's actual type.  A checkably safe version is:

>	var X, Y;
>	if predicate1
>	then Y := L1;  /* L1 is an object that has feature F */
>	     Y.F;
>	     X := Y;
>	else X := L2;  /* L2 is an object that does not have feature F */
>	fi

Feature F has been safely applied to object L1 via Y, subsequently feature F
may not be applied via X, since X may or may not refer to object L1.  Although
this appears to make a simple program more complex, the example is artificial.
It is more likely that the equivalent assignments are across different modules,
where there would have to be separate declarations of X and Y anyway.  If the
situation of the first example existed, but distributed across different source
modules, the unreliability would go unnoticed.  If the second existed in the
same way, it would be very clumsy code.  GTA would pick up both these,
suggesting a more robust or cleaner design should be used.

I maintain that a well designed program which avoids branch decisions based on
type will check correctly under the rules of GTA.  I do not believe even the
pessimistic checking of GTA is possible in a dynamically typed language - it
requires explicit type declarations, and a conventional (e.g. inheritance
based) mechanism for defining basic type conformance as a starting point.  I
cannot produce a rigorous justification of this viewpoint, though.

The theory of this type-checking system was discussed quite thoroughly by
Bertrand Meyer in the article he posted last year.  If you don't have it and
would like a copy (it's about 1500 lines), I can email it to you.

-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

adam@visix.com (05/04/91)

In article <1991May1.143831.2065@maths.nott.ac.uk>, anw@maths.nott.ac.uk (Dr A. N. Walker) writes:

|> 	[I have no axe to grind on the general static/dynamic debate.
|> Dynamic typing is often more convenient, but it is, I still maintain,
|> *necessary* only when you, the programmer, don't know what types of
|> objects your program is going to have to deal with or what operations
|> are going to be effected on them.  That situation might be acceptable
|> for a quick prototype, but not for a finished system.  That is why it
|> is easy to describe *situations* in which dynamic typing is desirable,
|> but we have not seen a completely specified *problem* for which it is
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|> needed.]

Yes, a completely specified problem is, by definition, one in which
you know all the objects and operations, ergo there is no way to
completely specify a problem that requires dynamic typing.

However, I think you overemphasize the importance of completely
specified problems.  In my view, underspecified problems are not
just quick prototypes.  System browsers are underspecified.
AI programs are underspecified.

Learning starts with an underspecified state.  A system that learns,
or helps you learn, must be underspecified.

You learn how to completely specify a problem by attempting various
underspecified solutions.

I see this as the fundamental difference between computer science and
software engineering.

|> -- 
|> Andy Walker, Maths Dept., Nott'm Univ., UK.
|> anw@maths.nott.ac.uk

Adam

amanda@visix.com (Amanda Walker) (05/04/91)

In article <1991May3.184332.28319@visix.com> adam@visix.com writes:

   However, I think you overemphasize the importance of completely
   specified problems.  In my view, underspecified problems are not
   just quick prototypes.  System browsers are underspecified.
   AI programs are underspecified.

Indeed.  In fact, one of the advantages I have found in OO design and
implementation is that it makes it possible to attack problems over
incompletely specified domains.  In particular, dynamic type systems
allow existing code to work over domains which were not originally
specified, as long as those domains have the necessary
characteristics.  A trivial example of this idea is that a heapsort
should operate on any set of objects for which I can provide
comparison and exchange operations.

One all too common example of this situation is using a precompiled
module with a new data type (i.e., code reuse at the binary level),
which we can expect to happen more and more often as OO concepts
become more "mainstream."
--
Amanda Walker						      amanda@visix.com
Visix Software Inc.					...!uunet!visix!amanda
-- 
"It's not the tragedies that kill us, it's the messes."	   --Dorothy Parker

mario@cs.man.ac.uk (Mario Wolczko) (05/04/91)

In article <1991May3.184332.28319@visix.com>, adam@visix.com writes:
> |> That is why it
> |> is easy to describe *situations* in which dynamic typing is desirable,
> |> but we have not seen a completely specified *problem* for which it is
>        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> |> needed.]
> 
> Yes, a completely specified problem is, by definition, one in which
> you know all the objects and operations, ergo there is no way to
> completely specify a problem that requires dynamic typing.

Indeed.  One of the main ideas behind inheritance is that you leave
some things open so that classes may be reused in unanticipated ways,
ie you are building components to assist in the solution of problems
you do not even know about yet!

Mario Wolczko

   ______      Dept. of Computer Science   Internet:      mario@cs.man.ac.uk
 /~      ~\    The University              uucp:      mcsun!ukc!man.cs!mario
(    __    )   Manchester M13 9PL          JANET:         mario@uk.ac.man.cs
 `-':  :`-'    U.K.                        Tel: +44-61-275 6146  (FAX: 6236)
____;  ;_____________the mushroom project___________________________________

davidm@uunet.UU.NET (David S. Masterson) (05/06/91)

>>>>> On 4 May 91 15:39:18 GMT, mario@cs.man.ac.uk (Mario Wolczko) said:

Mario> Indeed.  One of the main ideas behind inheritance is that you leave
Mario> some things open so that classes may be reused in unanticipated ways,
Mario> ie you are building components to assist in the solution of problems
Mario> you do not even know about yet!

Hmmm.  Interesting way of looking at inheritance and nothing wrong with it.
Just curious, though (and this may be very related to the question at hand),
do most people use inheritance to build down from some underspecified level to
something more specific to the problem at hand or do most people build up from
some highly specific to something that can be more easily generalized to later
problems?  If you say either-or (depending on the circumstances), what are the
circumstances that make you choose one over the other?

I think people will find that the answers to this will show the differences in
how people attack problems and that these differences have led to static
versus dynamic specification of problems.
--
====================================================================
David Masterson					Consilium, Inc.
(415) 691-6311					640 Clyde Ct.
uunet!cimshop!davidm				Mtn. View, CA  94043
====================================================================
"If someone thinks they know what I said, then I didn't say it!"

rick@tetrauk.UUCP (Rick Jones) (05/07/91)

In article <1513@fang.dsto.oz> dch@aeg.dsto.oz.au (Dave Hanslip) writes:
| rick@tetrauk.UUCP (Rick Jones) writes:
| 
| >anw@maths.nott.ac.uk (Dr A. N. Walker) writes:
| >]  [ ... ]
| >] That is why it
| >] is easy to describe *situations* in which dynamic typing is desirable,
| >] but we have not seen a completely specified *problem* for which it is
| >] needed.
| 
| >This has to be the most succinct description of the relative advantages of
| >static and dynamic typing that I have seen - thank you, I agree entirely.
| 
| 
| Agree you may, but very few real-world, complex problems are ever completely 
| specified. Even if the specification process is rigorous, the customer never 
| knows completely what he/she wants and over the life of software the
| requirements will inevitably change. That is why it's not only acceptable in
| a finished system, but desirable.

Note that I didn't say that static typing is always better, but that this is a
very good definition of the strengths of each approach.  I know I have been
arguing the case for Eiffel's static type checking mechanisms - that's because
I am an Eiffel user, and some of the type checking concepts were clearly
mis-understood by some contributors to this newsgroup.

The system I am developing is in fact a two-tier model.  In simple terms, there
is a statically typed, hard-coded lower level written by application
programmers (in Eiffel), which supports a higher level, dynamic model which
will be configurable by users.  The high level model is in effect dynamically
typed (although it isn't a "programming language" in the conventional sense).
This enables the dynamic model to be configured very freely, but without any
danger of breaking the static model.

In the context of Dr. Walker's statement, the low level model has a behaviour
which IS completely specified - the system would not be reliable if it wasn't.
The high level model is not completely specified, and so is open to change and
is dynamic.
-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

mario@cs.man.ac.uk (Mario Wolczko) (05/08/91)

In article <CIMSHOP!DAVIDM.91May5120037@uunet.UU.NET>, cimshop!davidm@uunet.UU.NET (David S. Masterson) writes:
    do most people use inheritance to build down from some
    underspecified level to something more specific to the problem at
    hand or do most people build up from some highly specific to
    something that can be more easily generalized to later problems?
    If you say either-or (depending on the circumstances), what are
    the circumstances that make you choose one over the other?

I can only speak for myself.  When trying to solve a specific problem
I will try to identify aspects of that problem which are of a general
nature, and which could be reused.  (Example: in writing a compiler, I
have developed classes to support directed acyclic graphs in a general
way).  In some cases, there will be parts of a class that are very
specific to the problem at hand, and then I will try to factor the
class into two classes -- an "abstract", reusable superclass, and a
"concrete" subclass.  Sometimes this factoring can be repeated,
leading to a hierarchy of abstract classes, with concrete classes at
the leaves.

Sometimes the abstract class will already be there, and I just have to
provide a concrete subclass, which is usually easy.  (I do all my
serious OO programming in Smalltalk, which has lots of abstract
classes.)  Occasionally I find a concrete class which does part of
what I want, but perhaps something else which I don't, and then I try
to split that class into abstract and concrete pieces, such that I can
inherit from the abstract part to get what I want.  (Splitting an
existing class while remaining compatible can be very hard.)  In any
case I try to avoid having one concrete class inherit from another
concrete class.

Mario Wolczko

   ______      Dept. of Computer Science   Internet:      mario@cs.man.ac.uk
 /~      ~\    The University              uucp:      mcsun!ukc!man.cs!mario
(    __    )   Manchester M13 9PL          JANET:         mario@uk.ac.man.cs
 `-':  :`-'    U.K.                        Tel: +44-61-275 6146  (FAX: 6236)
____;  ;_____________the mushroom project___________________________________

mario@cs.man.ac.uk (Mario Wolczko) (05/08/91)

In article <1158@tetrauk.UUCP>, rick@tetrauk.UUCP (Rick Jones) writes:
[long defence of global type analysis, followed by critique of my 6
line program fragment]

But now you're moving the goalposts.  Your original posting stated
quite unequivocally that global type analysis only rejected programs
that were not type-safe, and my example showed that this was false.
(The example was not meant to be particularly meaningful, only to make
the point.)

Describing such a program as "inherently unreliable" is similarly
bogus --- by appropriate substitution I could make it into a program
that was provably correct.

Having retreated from the earlier claim, you now claim that only
well-designed programs will be amenable to global type analysis, a
much more subjective claim.  

I feel sure that we could argue until blue in the face as to what
constitutes a well-designed program.  My guess is that your bottom
line will be that any program which is not amenable to global type
analysis is not well-designed!

I will make only two further observations:
1. Eiffel's "reverse assignment attempt" is a run-time type test
   anyway, and
2. The most common "type error" in Smalltalk is sending a message to
   nil, ie forgetting to initialize a variable.  Static typing doesn't
   help here at all.

Mario Wolczko

   ______      Dept. of Computer Science   Internet:      mario@cs.man.ac.uk
 /~      ~\    The University              uucp:      mcsun!ukc!man.cs!mario
(    __    )   Manchester M13 9PL          JANET:         mario@uk.ac.man.cs
 `-':  :`-'    U.K.                        Tel: +44-61-275 6146  (FAX: 6236)
____;  ;_____________the mushroom project___________________________________

sakkinen@jyu.fi (Markku Sakkinen) (05/09/91)

In article <2500@m1.cs.man.ac.uk> mario@cs.man.ac.uk (Mario Wolczko) writes:
> ...
>I will make only two further observations:
> ...
>2. The most common "type error" in Smalltalk is sending a message to
>   nil, ie forgetting to initialize a variable.  Static typing doesn't
>   help here at all.

Right, with type systems defined as they usually are, trying to access
'nil' cannot be caught as a type error even in statically typed languages.
The story becomes different if pointer types as such are defined as "strict",
i.e. 'nil' not allowed, and conventional (possibly 'nil') pointers are
then regarded as union types.  Of course, one could not very well build
typical dynamic, recursive data structures using only strict pointers.

Markku Sakkinen
Department of Computer Science and Information Systems
University of Jyvaskyla (a's with umlauts)
PL 35
SF-40351 Jyvaskyla (umlauts again)
Finland
          SAKKINEN@FINJYU.bitnet (alternative network address)

rick@tetrauk.UUCP (Rick Jones) (05/10/91)

In article <2500@m1.cs.man.ac.uk> mario@cs.man.ac.uk (Mario Wolczko) writes:
>In article <1158@tetrauk.UUCP>, rick@tetrauk.UUCP (Rick Jones) writes:
>[long defence of global type analysis, followed by critique of my 6
>line program fragment]
>
>But now you're moving the goalposts.  Your original posting stated
>quite unequivocally that global type analysis only rejected programs
>that were not type-safe, and my example showed that this was false.

Well, I didn't think I was moving them - but I accept that I probably
didn't make clear enough where I was planting them to start with.

>I feel sure that we could argue until blue in the face as to what
>constitutes a well-designed program.  My guess is that your bottom
>line will be that any program which is not amenable to global type
>analysis is not well-designed!

You could be right - how one approaches design is to some degree a subjective
matter.  The constraints of Eiffel's type system, including GTA (which Bertrand
Meyer commented has always been present as implicit rules anyway), do not
present themselves to me as impediments to program design - if anything, I find
them to be aids.  I say this from practical experience, not just ideology.

I have not used Smalltalk for any serious work, only played with it, but my
feeling is that if I were to tackle a project with it, I would design things in
a different way.  I can't say exactly how it would be different, I would need
some experience to say any more.

I tend to get the feeling that a lot of the "static v. dynamic" debate is in
fact a debate between programmers of the procedural-based school (typified by
Eiffel and C++), and those of the Smalltalk school.  The underlying styles are
so different that the typing issue is really only one part of the distinction.
A design which works well in Eiffel may not transfer directly to Smalltalk, nor
vice-versa.  However, the same _problem_ can be solved equally well in either
environment, if given to designers/programmers with equivalent familiarity in
the respective systems;  they will just build it differently.

Since everyone works best in the environment in which they feel most familiar,
they naturally tend also to feel it is the best environment.  There is in fact
no absolute criteria of "best" - it is in the end all subjective.

>I will make only two further observations:
>1. Eiffel's "reverse assignment attempt" is a run-time type test
>   anyway, and

True, but being an assignment between types it operates within the type system
rather than against it, which I find works very cleanly.

>2. The most common "type error" in Smalltalk is sending a message to
>   nil, ie forgetting to initialize a variable.  Static typing doesn't
>   help here at all.

And also, of course, in Eiffel (trappable by an implicit assertion).  Unless
there are no void references permitted - which introduces other problems -
there seems to be no practical way around this.  Now if this could be deduced
statically ...! but I guess that takes us into formal proofs, at which point I
exit stage left and leave it to others.

-- 
Rick Jones, Tetra Ltd.  Maidenhead, Berks, UK
rick@tetrauk.uucp

Any fool can provide a solution - the problem is to understand the problem

preece@urbana.mcd.mot.com (Scott E. Preece) (05/16/91)

In article <1991May3.212005.29453@visix.com> amanda@visix.com (Amanda Walker) writes:
|
|   ...  A trivial example of this idea is that a heapsort
|   should operate on any set of objects for which I can provide
|   comparison and exchange operations.
---
OK, let me expose my ignorance[1].  I'd like to see how object-oriented
design works.  Would you care to sketch the design for this?  My naive
model would be that a heap is an object and that it has add and remove
methods to take objects of unspecified type and stick them into the heap
in the right place.  So where do the comparison and exchange procedures
go?  I see two alternatives[2]: (1) the heap object has compare and
exchange methods that "know about" all of the types, figure out what's
involved, and operate on the member objects or (2) the objects have
methods that report their "comparable value" and size in canonical forms
that a generic routine in the heap object can operate on.  Neither of
these makes me awfully happy.  In case (1) the heap has to know much too
much about the objects, making it hard to believe that the desired
level of reusability can be reached.  In case (2) we have removed the
heap's ability to choose what attribute of the object is used for
sorting, since the object owns production of the sort key.

So where is "the right place" to put the comparison and exchange
knowledge, in the heap, in the objects living in the heap, or somewhere
else that I missed entirely?

NOTES:
[1] I first used SIMULA in 1973, but the way I used it didn't look much
like what people seem to be saying about o-o today.  I have a little
trouble with the notion that process should go away and design should be
strictly composition of objects (I've seen several statements along
those lines).  At any rate, I'm being quite honest here in asking for
examples of how an experienced O-O designer would think about the problem.
[2] Well, actually, of course, I see a lot of other alternatives, but
they mostly could be characterized as one of these, with embroidery.

--
scott preece
motorola/mcg urbana design center	1101 e. university, urbana, il   61801
uucp:	uunet!uiucuxc!udc!preece,	 arpa:	preece@urbana.mcd.mot.com
phone:	217-384-8589			  fax:	217-384-8550