[comp.ai.philosophy] Machine learning

jones@amarna.gsfc.nasa.gov (JONES, THOMAS) (12/11/90)

Dear comp.ai.philosophy,

The question has been raised as to whether or not we could put a learning or
"reinforcement" algorithm, perhaps along the lines of Skinner's concepts, and
make the machine *learn* all sorts of neat things without their having to
be programmed in by humans.  This is the oldest, worst idea in AI.  Dozens
of attempts have been made to carry this effort through (I myself have made
a dozen or so.), essentially without success.  The problem is that *all*
theories of learning in psychology are *unsound.*  For example, Skinner
would have us believe that, if an organism is rewarded for doing a certain
action in a certain situation, then he/she/it will become more likely to perform
the action in question in "similar" situations.  Horsefeathers!  What is a
similar action?  What is a similar situation?  All sorts of heavy machinery
have been swept under the rug and labeled "similarity."

    From the above it might be concluded that I am opposed to machine learning
in general.  On the contrary, I consider it one of the most important areas
of AI.  One bad habit which afflicts learning research is the failure to
distinguish between that which the machine can legitimately learn for itself,
and that which the human programmers jolly well better program in by hand.
For example, I doubt very much if a machine could do more than a few 
rudimentary things without the concept of a *subroutine hierarchy* (or the
related GPS goal tree).  Hence I believe that the machine should have
software for building up, testing, and using subroutine hierarchies on
its own.  But can the machine *invent* subroutine hierarchies?  Doubt.
Much of the *nerve net* research is marred by lack of making this distinction.
My experience with learning codes is that you start by working out just how
the performance program is to look and to operate.

References:

Jones, Thomas L., "A Computer Model of Simple Forms of Learning," MIT Ph.D.
thesis, September, 1970.

Jones, Thomas L., "A Computer Model of Simple Forms of Learning in Infants,"
Proc. AFIPS 1972 Spring Joint Computer Conference.

Tom Jones

All opinions are my own.

greenba@gambia.crd.ge.com (ben a green) (12/12/90)

In article <4158@dftsrv.gsfc.nasa.gov> jones@amarna.gsfc.nasa.gov (JONES, THOMAS) writes:
   The question has been raised as to whether or not we could put a learning or
   "reinforcement" algorithm, perhaps along the lines of Skinner's concepts, and
   make the machine *learn* all sorts of neat things without their having to
   be programmed in by humans.  This is the oldest, worst idea in AI.  Dozens
   of attempts have been made to carry this effort through (I myself have made
   a dozen or so.), essentially without success.  The problem is that *all*
   theories of learning in psychology are *unsound.*  For example, Skinner
   would have us believe that, if an organism is rewarded for doing a certain
   action in a certain situation, then he/she/it will become more likely to perform
   the action in question in "similar" situations.  Horsefeathers!  What is a
   similar action?  What is a similar situation?  All sorts of heavy machinery
   have been swept under the rug and labeled "similarity."

An excellent point, but I haven't given up trying yet. 

First, we have to choose the right level of description. Subroutines (mentioned
by Tom later) are too low. My choice is based on a robot with, say, 0.1 sec
clock speed that emits behavior (motor control signals) at that rate.

The problem of similarity is, indeed, swept under the rug by Skinner and all
exponents of his ideas I have read. My robot will deal with it in the following
way. First, invert the concept to that of _dissimilarity_. We need a map
from a pair of environmental vectors to a scalar dissimilarity, which can
be thought of as a metric in environmental space.

It will have to be a plastic map, since discrimination training increases
dissimilarity between previously similar environments. Dissimilarity
increases with differential reinforcement. (Think of a spanish speaker
learning to distinguish "b" and "v".)

My suggestion: Initialize the mapping to near zero and stretch the space
as a result of reinforcement. The volume of discriminable dissimilarity
starts out as near zero and expands rapidly in the first hours of the
robot's life.

It's the big bang theory of perception.

--
Ben A. Green, Jr.              
greenba@crd.ge.com
  Speaking only for myself, of course.

smoliar@vaxa.isi.edu (Stephen Smoliar) (12/12/90)

In article <4158@dftsrv.gsfc.nasa.gov> jones@amarna.gsfc.nasa.gov writes:
>  One bad habit which afflicts learning research is the failure to
>distinguish between that which the machine can legitimately learn for itself,
>and that which the human programmers jolly well better program in by hand.

There is an even worse habit which Minsky discusses in THE SOCIETY OF MIND:

		The problem is that we use the single word "learning"
	to cover too diverse a society of ideas.  Such a word can be
	useful in the title of a book, or in the name of an institution.
	But when it comes to studying the subject itself, we need more
	distinctive terms for important, different ways to learn.

Minsky then goes on to propose some of these terms, not all of which I am sure
I agree with;  and I suspect I could think up some more given the time.  The
point is that, like intelligence itself, we assume that anything that can be
captured in a single word can, somehow or another, be implemented in code.
Anything which counts as a result in machine learning has involved results
in a very narrow, highly specific scope.  Unfortunately, rather than trying
to explore the nature of that scope (let alone consider how it might interact
with other, equally narrow scopes), researchers are forever tempted to
advertise their results as advances in "machine learning," a claim which
lends little to our understanding of just what they have achieved.  If we
had less inflation of accomplishment, we might discover that our achievements
are not as weak as they tend to appear.

=========================================================================

USPS:	Stephen Smoliar
	5000 Centinela Avenue  #129
	Los Angeles, California  90066

Internet:  smoliar@vaxa.isi.edu

"It's only words . . . unless they're true."--David Mamet

powers@uklirb.informatik.uni-kl.de (David Powers ) (12/13/90)

Wow!  I don't know where to start and I haven't got the time for a
full analysis of Machine Learning vis-a-vis Human Learning
vis-a-vis Hand Coding.  But there's more to learning than meets the
eye - that much is agreed.

>In article <4158@dftsrv.gsfc.nasa.gov> jones@amarna.gsfc.nasa.gov writes:

>The question has been raised as to whether or not we could put a learning or
>"reinforcement" algorithm, perhaps along the lines of Skinner's concepts, and
>make the machine *learn* all sorts of neat things without their having to
>be programmed in by humans.  This is the oldest, worst idea in AI.  Dozens
>of attempts have been made to carry this effort through (I myself have made
>a dozen or so.), essentially without success.  The problem is that *all*
>theories of learning in psychology are *unsound.*  For example, Skinner
>would have us believe that, if an organism is rewarded for doing a certain
>action in a certain situation then he/she/it will become more likely to perform
>the action in question in "similar" situations.  Horsefeathers!  What is a
>similar action?  What is a similar situation?  All sorts of heavy machinery
>have been swept under the rug and labeled "similarity."

Similarity (or Metaphor) is one of the most important concepts in Learning,
and in Science for that matter.  Nothing is ever the same as anything else.
Even perceptions of the same object at different times are different.
So classification of similar things is the first and major step in much of
Machine Learning and metaphor is actually the outworking of the same
ubiquitous phenomenon in our use of language.

Theories of learning which are precomputational are not intended to be
complete in the sense that they dot every i and cross every t necessary to
code them into a learning program.  But they can be used, and the empirical
work laying behind them can be reinterpreted and used, to guide and inspire
computational theories of learning.  I personally agree that some aspects
of the dogmatics of certain of the greats of psychology, linguistics and
psycholinguistics are misdirected.  But who's perfect.  The real problem
lies with the blind followers who recognize the fundamental truths their
mentors exposed, but swallow blindly the inessential baggage as well.
No wonder we get indigestion when we try to do some useful work!

>    From the above it might be concluded that I am opposed to machine learning
>in general.  On the contrary, I consider it one of the most important areas
>of AI.  One bad habit which afflicts learning research is the failure to
>distinguish between that which the machine can legitimately learn for itself,
>and that which the human programmers jolly well better program in by hand.
>For example, I doubt very much if a machine could do more than a few 
>rudimentary things without the concept of a *subroutine hierarchy* (or the
>related GPS goal tree).  Hence I believe that the machine should have
>software for building up, testing, and using subroutine hierarchies on
>its own.  But can the machine *invent* subroutine hierarchies?  Doubt.
>Much of the *nerve net* research is marred by lack of making this distinction.
>My experience with learning codes is that you start by working out just how
>the performance program is to look and to operate.

"the failure to distinguish" is actually a consequence of "a failure to
examine".  The old maxim "you can only learn what you almost already know"
is really fundamental.  And as your system bootstraps itself from one level
to the next - which may not be very far away - A. you need to look for the
right techniques and the correct characterization of the prerequisites for
this learning (including teacher - what sort of examples, critic - what
sort of feedback,input - what sort of features,  ...), and B. you must
expect that you will only achieve bootstrapping to a level which is not
that far removed from where you started.  After all, "you can only learn..."

smoliar@vaxa.isi.edu (Stephen Smoliar) writes:

>There is an even worse habit which Minsky discusses in THE SOCIETY OF MIND:

>		The problem is that we use the single word "learning"
>	to cover too diverse a society of ideas.  Such a word can be
>	useful in the title of a book, or in the name of an institution.
>	But when it comes to studying the subject itself, we need more
>	distinctive terms for important, different ways to learn.

>Minsky then goes on to propose some of these terms, not all of which I am sure
>I agree with;  and I suspect I could think up some more given the time.  The
>point is that, like intelligence itself, we assume that anything that can be
>captured in a single word can, somehow or another, be implemented in code.
>Anything which counts as a result in machine learning has involved results
>in a very narrow, highly specific scope.  Unfortunately, rather than trying
>to explore the nature of that scope (let alone consider how it might interact
>with other, equally narrow scopes), researchers are forever tempted to
>advertise their results as advances in "machine learning," a claim which
>lends little to our understanding of just what they have achieved.  If we
>had less inflation of accomplishment, we might discover that our achievements
>are not as weak as they tend to appear.

Here I tend to agree more with the spirit of the comment.  And not only may
our "weak" achievements be more significant than they appear, in recent
times people have tended to apply the "strongest" techniques they can to
the learning in an attempt to make the "strongest" achievement, or "biggest"
jump in the level of complexity.  In fact, it is helpful to consider the
inherent structure of what we are learning and what is the weakest form of
learning we can use.  Some of the classical results about what cannot be
learnt are applicable to classes far more general and less restricted than
those where we actually want to learn.  We need to understand the
restrictions, and relate them to the observed phenomena, classes, learning
paradigms, etc.

We do, of course, have names for some different types of learning.  We need
no doubt to develop and characterize more.  And I have had some success in
machine learning in several domains, and in relating the restrictions of
the "language" to be learned, the appropriate learning algorithm, the
"psychological" correlates, and the "linguistic" classes and rules.

I won't list my book and other references here, but feel free to write for
a bibliography and/or a paper.

David Powers
------------------------------------------------------------------------
David Powers		 +49-631/205-3449 (Uni);  +49-631/205-3200 (Fax)
FB Informatik		powers@informatik.uni-kl.de; +49-631/13786 (Prv)
Univ Kaiserslautern	 * COMPULOG - Language and Logic
6750 KAISERSLAUTERN	 * MARPIA   - Concurrent Logic Programming
WEST GERMANY		 * STANLIE  - Natural Language Learning

Riddle:		What is the difference between the university and me.
Disclaimer:	My opinion.