cjoslyn@bingvaxu.cc.binghamton.edu (Cliff Joslyn) (01/06/90)
I have in mind something very similar to what is being discussed here. The general need is obvious, although we all have our special purposes. The following is mine. I'm about ready to kludge something together with email, TeX macros, and shell scripts, but perhaps we can all come up with something better. Contact me for more information. _THE NECESSITY OF A NEW TOOL FOR PHILOSOPHICAL DEVELOPMENT_ Cliff Joslyn Systems Science SUNY Binghamton Copyright 1990 Cliff Joslyn Past development of philosophical texts and philosophical systems rested on existing technology: paper and ink. This medium was conducive to the construction of linear documents, written and read from the begining to the end. While the size of these linear forms could be greater or lesser, from long treatises to short paragraphs or sections [e.g. Aristotle 1943, Wittgenstein 1953], over the years this linear form has been maintained. In recent years dictionaries, encyclopedias, and other reference works have been developed which partially introduce non-linear structures through internal references [e.g. Edwards ed. 1967, Krippendorf 1986, Flew 1979, Raymond and Topa 1988, Kluwer 1988, Wiley 1979]. Some have made halting efforts in the direction of non-linear documents [e.g. Minsky 1986], and others have used pictures to aid in understanding [von Foerster 1971, Abraham and Shaw 1985]. And certainly formal systems (mathematics and logical notations) have given the ability to construct large, complex linguistic systems. But to date, individual scholars working in natural language have been essentially limitted to traditional development forms. The development of large, complex systems of philosophical thought in non-formal domains have been difficult, and collaborative work on such systems next to impossible. Scholars who wish to develop such systems of thought require a new tool. In particular, the Principia Cybernetica project [Joslyn and Turchin 1990, Turchin and Joslyn 1989] demmands the use, and possibly the development, of a computer-based tool to provide a new form of linguistic expression. While this project will use such a tool for specifically philosophical development, a tool is envisioned that could be used to support scholarly and scientific writing, research, and development in general. CRITERIA OF THE PRINCIPIA CYBERNETIC PROJECT The following is a partial list of desiderata for the "Principia Cybernetica" project: 1. For a group of researchers, perhaps not all geographically close, to collaboratively develop a system of philosophy, where philosophy is taken in the general sense of clear and consistent language about ideas and concepts; 2. To allow these researchers different levels of access to the system according to their role in the project development; 3. To produce a system of philosophy that can develop dynamically over time, with continuing refinement and expansion; 4. For the system of philosophy to fully reflect and incorporate the semantic relations inherent among the terms being explicated; 5. To allow the explication of terms and senses of terms, and to unify and synthesize notations and terminology among researchers in different disciplines; 6. To support the process of argument and dialog among experts toward the end of consensus at the level of the meanings of words and the relations among those meanings; 7. To support the publication of intermediate and final stages of parts or the whole of the philosophical system; 8. To support bibliographical and historical reference; 9. To support mathematical notation and the easy movement among natural language, formal language, and mathematics; 10. To allow researchers to develop or read the philosophical system in various orders and in various degrees of depth or specificity; 11. To allow access to the system for both participants who wish to author and users who wish to read, browse, or study; 12. To support the publication of various special-purpose documents, including dictionaries, encyclopedias, texts on a subject, reference pages, essays, dialogs on a subject, or "streams of consciuosness"; 13. To allow the representation and utilization of knowledge in both its breadth and its depth. ON FORM AND CONTENT There is a mutual affect between the form of an expression (the medium, or syntactic language) and its content (the message, or semantic meaning): the existence of linguistic forms constrains and allows the possible expressions; and simultaneously the desire to express additional meanings demands the changing of the language. The form and content of linguistic systems evolve together, and such a tool will evolve with the system of philosophy itself. A complete implementation will not be possible before beginning development of the philosophy. However, an ideal final state for such a tool can be envisioned at this time, and a minimal implementation with existing technology is possible with relatively little effort. THE NATURE OF SUCH A TOOL _Logical Structure_ Define a semantic network S = ( N, R ) [Rich 1983 pp. 213, Shastri 1988, Evens ed. 1988, Brachman 1977]. Define a set of nodes N = { ni }, each of a specific node type. Node types would certainly include traditional document units like words, sentences, and paragraphs. But also included would be special document units like bibliographical references and equations, and other types like pictures, moving pictures, sounds. Nodes can be complex, that is any ni can be composed of further nij, for example a paragraph composed of a number of sentences and a picture. Further define a set of relations R = { Rj }, where each Rj is in the full cross product of all the ni. When ni1 Rj ni2 we say that there is a link of type j from node i1 to node i2. The Rj represent semantic relations among the nodes. For example, one relation R1 could be that of definitions, where n1 R1 n2 indicates that the term represented by node n1 is defined in node n2. Then the inverse relation R1-1 would indicate those nodes referencing that definition. An example would be where the term _entropy_ is defined in using the terms _probability_ and _summation_; or, where _Shannon entropy_ is defined using _entropy_. But at the same time, a link to _thermodynamic entropy_ and _irreversible processes_ would be established, thus maintaining full semantic linkage throughout the system. Other examples of semantic relations could include R2 being simple reference, where n1 R2 n2 indicates that n1 is referenced in the node n2. Other examples include implication and/or generalization (e.g. probability to Dempster-Shafer theory, Newtonian to relativistic equations of motion), historical precedence, bibliographical reference, alphabetical order, author groupings, etc. _Complexity of the Semantic System_ One point that deserves consideration is the overall complexity of the network and the relative number of nodes and relations, |N| and |R|. One criticism that is made of semantic networks is the necessity to maintain a large number of types of links in order to reflect the semantic richness of natural language and philosophical concepts. It is claimed that this makes semantic networks confusing to implement and manage, because of the great number of links involved, allowing authors and browsers to "get lost in hyperspace" [e.g. Brockman 1989]. However, the maximum number of links in our system S is: |R| ( |N|2 - |N| ), which grows faster with the number of nodes N than with the number of link types R. Further, it has recently been recognized [e.g. Kanerva 1988] that complex systems are characterized by a relatively high dimensionality |N| and relatively low cardinality |R|. This makes such semantic nets systems in that class, and makes the repertoire of tools developed for the management of complex systems available for this project. But in any event, it will be certainly be necessary to carefully maintain an appropriate balance between these quantities |R| and |N|, and a relatively sparsely populated network to preserve tractibility. _Uses of the Relations_ Since each of the relations Rj represents a distinct semantic category, they can be very useful for accomplishing specific ends while using the network. For example, when the Rj are partial orderings, then it is possible to recognize hierarchical relations among the nodes. This would be the case, for example, for the relations of definition or entailment. This would allow users and authors to ascend manage. Linear orderings may also be available, for example alphabetical or chronological order. When the whole network is projected through one or more of the various relations, it is possible to view the network or produce specific documents of a specific form. In this way, semantic networks can be used to produce the following kinds of special systems: 1) Encyclopedias (alphabetical order by subject, deep content) 2) Dictionaries (alphabetical order by subject, shallow content) 3) Bibliographies (alphabetical order by author) 4) Histories (chronological order) 5) Reference notebooks (subject order, shallow content) 6) Textbooks (subject order, deep content) 7) Subject-oriented expositions (ad-hoc order) 8) Development of arguments on specific themes (ad-hoc order) _Existing Similar Work_ Reference works in Systems and Cybernetics currently exist [Singh 1987, Krippendorff 1986, Ashby 1965] and others [Don McNeil, Len Troncale, Carl Slawski] have approached this issue in the past. The whole or portions of these existing works can be extended into semantic network structures, as can or have other existing reference works [e.g. Flew 1979, Raymond and Topa 1988, Kluwer 1988, Wiley 1979]. We will actively pursue the cooperation of these authors and editors. IMPLEMENTING THE TOOL Beyond considering the logical design of this tool, there is the task of implementation. _Necessary Features_ It will be necessary for any implementation of the tool to include the following: 1) _Electronic Mail_ Since participants in the project live in different places, it will be necessary for them to use telecommunications to carry out development. This includes the use of one or more of the international computer-based telecommunications networks (e.g. BITNET, INTERNET, USENET). Portions of the semantic network can be developed by individuals at one site and them distributed to the others, or to a central site, using email. The actual operation of the network should be available on a remote-control basis by users at other sites. 2) _Portable Implementation and Hybrid Environments_ The participants in the project will also be based on different computer systems using different operating systems (e.g. UNIX, VMS, CMS, DOS) and display and printer devices. Thus it will be necessary for the tool to work in these different hardware and portability and adherence to industry standards will be critical. Some participants may have trouble accessing computer systems at all, and so at least a rudimentary mixed electronic/print capability is needed. 3) _TeX Output_ Since it is intended to be able to submit either a projection of the network or the whole of the network at any time for publication, it will be necessary to produce documents that publishers will accept. It will further be necessary to be able to format the resulting documents in a variety of ways, for local, high quality printing at participant sites or for electronic submission to publishers. The current industry standard for these capabilities is the TeX electronic typesetting language [Knuth 1979]. TeX files use a plain ASCII format, and TeX is implemented for a great many operating environments. This allows maximum data portability across environments. TeX also provides full support for the formatting of mathematical equations and, using the BibTeX extension, of bibliographies. 4) _Version Control_ As nodes in the network are developed by multiple authors, it will be necessary to track their change [see also Irish and Trigg 1989]. This will allow contributors to see the changes made by others, and the recovery of previous work. Version control can either be done on the basis of maintaining a history of the nodes in the form of differences between various versions (as is common today for maintaining software source code), or in terms of a "markup language" [see ISO 1986] which maintains editing and revision marks in the body of a text file. _Currently Available Software_ Of course, the desirability of such tools to support the construction of knowledge systems and scholarly activity has been seen for many decades. Beginning in the 1960's articles and books have been written and software systems have been developed to fill these needs. Today there are a great many tools and systems available [e.g. Kimura 1986, Seiler 1989]. Also, other people are pursuing projects which are quite similar to the system envisioned. For example, the Matrix of Biological Knowledge is an effort to create a distributed, coherent knowledge system for the biological sciences [see _The Matrix of _ _Biological Knowledge Newsletter_, edited by Dan Davison, Theoretical Biology, T-10 MS K-719, Los Alamos National Laboratory, Los Alamos, NM 87545]. But by far the technology with the greatest applicability to the current problem is that of Hypertext and Hypermedia [e.g. Smith and Weiss 1988, Kahn 1988, Kahn and Meyrowitz 1988, Shneiderman 1989, Conklin 1987, Louie 1989, and see the journals _Hyperage_ and _Hypermedia_] originated by Ted Nelson [1965]. A true Hypermedia system is very close to the ideal logical form described above. Thus the first step in implementing this tool is the investigation of existing Hypermedia implementations. That task has just begun, but the following is a list of commercially available systems: Company Name System Name ----------- ----------- OWL Guide Xanadu Xanadu Brainpower ArchiText Software Aristry Hypertext Programming Library Texas Instruments HyperTRANS Neil Larson Houdini Symantec Grandview MetaGeneric Hypertext Viewer Scribe KMS MCC Planetext Apple Computer Hypercard Xerox NoteCards Brown University Intermedia Cognetics HyperTIES SNeps Each of these products has different strengths and weaknesses, satisfying to varying degrees the necessary conditions above. To the extent that I have investigated them, none seems ideal or even sufficient. And all of them cost money. _Minimal Immediate Implementation_ It would, however, be possible to implement a preliminary (and rudimentary) form of the tool at the present time. The basis for such an implementation would be the UNIX operating system available at the SUNY Binghamton computer center. Nodes would exist as individual text files written in TeX format. Such files would be fully available to electronic mail, and version control is available through the RCS Revision Control System. Custom software would be written to maintain the relations among the nodes. The software would be written in ANSI C for maximum portability, mixed with UNIX system software for text manipulation (e.g. the 'grep' and 'awk' programs). Ideally, links would be indicated by special macros written in TeX and then managed by the custom software. Implementation of a full-fledged hypertext development environment would be a very large task, so such an implementation is seen as a temporary stage, until a sufficiently powerful tool was available commercially and there was a budget available to afford it. The extent of the customer software would depend mostly on the time available for design and implementation. It will probably require a fair amount of time to operate the software, including formatting of documents and manual management of version control and operation of the software system. However, the implementation must be sufficiently general to allow whatever structure exists to be importable by the commercial product once it is available. REFERENCES Abraham, Ralph, and Shaw, Chris: (1985) /Dynamics: the Geometry of Behavior/, v. I-III, Ariel Press Aristotle, : (1943) /On Man in the Universe/, Walter J. Black Inc., Roslyn NY Ashby, Ross: (1956) /Introduction to Cybernetics/, Methuen, London Barrett, Edward: ed. (1989) /Soc. of Text: Hypertext Hypermedia + Soc. Const.of Info/, MIT Press, Cambridge Brachman, Ron J: (1977) "What's in a Concept: Struc. Found. for Semantic Nets", /Int. J. Man-Machine Studies/, v. 9, pp. 127-152 Brockman, R John, and Horton, W et. al.: (1989) "From Database to Hypertext via Electronic Publishing", in: /Society of Text/, ed. Edward Barret, pp. 162-206, MIT Press, Cambridge Conklin, Jeff: (1987) "Hypertext: An Introduction and Survey", /IEEE Computer/, v. 20:9, pp. 465-472 Edwards, Paul: ed. (1967) /Encyclopedia of Philosophy/, v. 1-8, MacMillan, New York Evens, Martha W: ed. (1988) /Relational Models of the Lexicon/, Cambridge U, Cambridge Flew, Antony: ed. (1979) /Dictionary of Philosophy/, Pan Books, London Irish, Peggy M, and Trigg, Randall H: (1989) "Supporting Collaboration in Hypermedia", in: /Society of Text/, ed. Edward Barret, pp. 90-106, MIT Press, Cambridge ISO: (1986) /Text + Office Sys.: Standard Gen. Markup Language(SGML)/ Joslyn, Cliff: (1990) "Necessity of a New Tool for Philosophical Development", NOTE: To be published Joslyn, Cliff, and Turchin, Valentin: (1990) "Introduction to the Principia Cybernetic Project" Kahn, Paul: (1988) /Linking Together Books/, in: /IRIS TR # 88-8/ Kahn, Paul, and Meyrowitz, Norman: (1988) /Guide, HyperCard, and Intermedia/, in: /IRIS TR # 88-7/, NOTE: Inst. for Res. in Inf. + Schol Kanerva, Penti: (1988) /Sparse Distributed Memory/, MIT Press, Cambridge Kimura, GD: (1986) "Structure Editor for Abstract Document Objects", /IEEE Trans. Software Engineering/, v. 12:3, pp. 417-435 Kluwer Publications: (1988) /Encyclopedia of Mathematics/, Hingham MA Knuth, Donald: (1979) /TeX and Metafont: New Directions in Typesetting/, Digital Press, Bedford MA Krippendorf, Klaus: (1986) /Dictionary of Cybernetics/, NOTE: Available through ASC Louie, S, and Rubeck, F: (May 89) "Hypertext Publishing and theRevitalization of Knowledge", /Academic Computing/, pp. 20-23, NOTE: SS lib Minsky, Marvin: (1986) /Society of Mind/, Simon and Schuster, New York Nelson, Theodor H: (1965) "The Hypertext", in: /Proc. Int. Documentation Fed. Conf./ Raymond, Darrell R, and Tompa, Frank W: (1988) "Hypertext and the New Oxford Dictionary", /Communications of the ACM/, v. 31:7, pp. 871-879, Rich, Elaine: (1983) /Artificial Intelligence/, McGraw-Hill, New York, Seiler, Lauren: (1989) "Future of the Scholarly Journal", /Academic Computing/, v. September, pp. 14 Shastri, Lokendra: (1988) /Semantic Networks/, Morgan Kaufman, Los Angeles Shneiderman, Ben: (1989) "Reflections on Authoring, Editing + Managing Hypertext", in: /Society of Text/, ed. Edward Barret, pp. 115-131, MIT Press, Cambridge Singh, MG: ed. (1987) /Systems and Control Encyclopedia/, Pergamon, Oxford Smith, John B, and Weiss, Stephen F: (1988) "Overview of Hypertext", /Communications of the ACM/, v. 31:7, pp. 816-819 Turchin, Valentin, and Joslyn, Cliff: (1989) "Cybernetic Manifesto", to be published von Foerster, Heinz: (1979) /Cybernetics of Cybernetics/, ed. K. Krippendorf, Gordon+Breach, New York Wiley Publications: (1979) /Citation Indexes/, New York Wittgenstein, Ludwig: (1958) /Philosophical Investigations/, MacMillan, New York -- O-------------------------------------------------------------------------> | Cliff Joslyn, Cybernetician at Large, cjoslyn@bingvaxu.cc.binghamton.edu | Systems Science, SUNY Binghamton, Binghamton NY 13901, USA V All the world is biscuit shaped. . .
UH2@PSUVM.BITNET (Lee Sailer) (01/06/90)
I'd like to thank Cliff for his long exposition, and make a few comments. The idea of HyperText is very appealing, but as Cliff says, its implementation on the grand scale will be long and difficult. it seems unlikely to me that any attempt to analyze, design, and construct such a system will fail. At the same time, such systems will evolve, borrowing ideas here and there till such capabilities are widely available. By analogy, I offer our experience with electronic mail, where despite numerous attempts to offer its capabilities to the general public, it is finally something like FAX that really takes off. My second point is that a general purpose distributed semantic network architecture is not the hard part. The hard part is deciding what to put in the general purpose distributed semantic network. To make these decisions people will have to debate endlessly on the merits of this or that approach. That's the point of the previous groupware thread---how can we create an efficient discussion and decision making environment? It seems to me that there are two tracks evolving. On is to take a system like email or UseNet and try to add more structure to it, and the other is to take a system like HyperText and try to make it more dynamic so that is can manage discussions and debates as well as archival knowledge. Fun, eh? lee