[net.ai] Taxonomy Assistant

MANN@USC-ISIB.ARPA (08/17/84)

From:  Bill Mann <MANN@USC-ISIB.ARPA>

I would like to have some sort of computational aid for creating taxonomies.

In trying to understand a collection of objects or data, often one of
the most helpful things to do is to create a taxonomy of it.  Comparing
and classifying things makes one think about their attributes and how
they relate.  It also helps identify potential varieties of objects that
are "missing."

Often several attempts are required before a satisfactory result is
achieved, which can involve a lot of bookkeeping and an overwhelming
amount of detail, so much that significant patterns are missed.

Also, there are skills for doing taxonomies, and I don't have them all.

For all these reasons, it would be good to embed a lot of the support
operations for creating a taxonomy in a program, one that would let the
machine do bookkeeping, systematic evocation of data, consistency
checking and some pattern identification, but still leave me in charge.
(Perhaps it's already been done.)

What sorts of tools are out there?  Is this already embedded in some
collection of intellectual prosthetics?  Where should I look for such
programs?

Bill Mann

[There are indeed tools for creating numerical taxonomies --- see the
documentation for cluster analysis programs in statistical packages
such as BMD, SPSS, SAS, etc.  For other leads I would suggest the
Pattern Recognition journal, the seven (massive) IEEE conferences on
pattern recognition, and the Classification Society (c/o Dr. George
W. Furnas, Room 2C-572, Bell Communications Research, Inc., Murray
Hill, NJ 07974).  Can anyone suggest available software for nonnumeric
taxonomy construction or for handling the associated bookkeeping?  -- KIL]

MDC.WAYNE%MIT-OZ@MIT-MC.ARPA (08/25/84)

From:  Wayne McGuire <MDC.WAYNE%MIT-OZ@MIT-MC.ARPA>

     The problems of systematically representing the conceptual
relations among a set of abstract objects in any knowledge domain go
right to the heart of much leading-edge AI research. All inferencing
is based, among other things, on implicit taxonomic understanding.

     It seems to me that in the knowledgebase management systems which
I hope we will see developed in the near future will be embedded rich
resources for evoking and representing taxonomies. Semantic nets
provide an ideal scheme with which to do just that.

     The most useful thinking about taxonomies and classification
theory appears not in the computer science literature, but at the
interface of library science, information science, and philosophy. The
leading journal in the field is _International Classification_ (which
should be available in any world class humanities library). It is
published (as I recall) three times a year, and is chockfull of
pointers to articles, books, dissertations, etc. in the world
literature on all aspects of classification theory.

     You might want to scan the following subject headings in some
recent editions of the index _Library Literature_ (published by H. W.
Wilson): Classification analysis, Subject headings, Thesauri. File 57
(Philosopher's Index) and File 61 (LISA -- Library and Information
Science Abstracts) on Dialog are also fertile sources of information
on the literature about taxonomies and classification theory. There
are many insights in the theoretical writings on classification theory
in the library science literature which could be handily transferred
to advanced AI research and systems.

     It occurs to me that what we need is a _meta-taxonomy_, that is,
a thorough inventory of all the fundamental conceptual structures by
which objects in _any_ domain can be taxonomically related.

     One way a taxonomy assistant might operate is to combine each and
every significant term in a knowledge domain with every other term,
and offer a list of possible relations with which to tag each offered
matching set. Someday, perhaps, we will be able to buy off the shelf
"taxonomy packs" (dynamic thesauri) in many domains.

-- Wayne --