viking@iuvax.UUCP (Jon W. Backstrom) (01/29/88)
Since there has been a request for more information, I offer an article that I originally posted three months ago on 'that other board'. This paper details the history and development of hypertext as originally envisioned by Vannevar Bush and defined in the context of a computer implementation by Ted Nelson. Reactions are welcome. Warning! LONG MESSAGE...but hopefully informative. --------- Harnessing the Power of Information Technology: The Genesis of Hypertext Copyright (C) 1987 by Jon W. Backstrom All rights reserved. INTRODUCTION The term 'hypertext' was first coined by Ted Nelson, a computer visionary. In the 1960's, Nelson was responsible for designing computer systems that would allow users to input and manipulate text information (early word processors). Along with systems designed to manipulate text, Nelson began the design of a system that would allow documents to be linked together with pointers forming meaningful associations between various topics. Nelson's early work was inspired by the work of Vannevar Bush, science advisor to Franklin D. Roosevelt. Bush is generally regarded as the Father of Information Science and served as Director of the Office of Scientific Research and Development. In 1945, he authored an article calling for a new relationship between man and the sum of his knowledge: Bush, Vannevar "As We May Think", The Atlantic Monthly, July 1945, pgs. 101-108. DESCRIPTION Nearing the end of World War II, Vannevar Bush wondered which direction science would lead mankind. The scientists who were mobilized for the war effort were finally free to pursue their own interests, rather than work on destructive weapons. A great deal of research was conducted during the war and many documents were created. Bush argued that sifting through and utilizing the record of writings produced by scientific research was a difficult process. The problem was not that too much data was being produced, but the ability to make use of it was hampered by a lack of tools and references which would lead you to any desired information. Information Storage Technology The development of new technologies and fact that complex devices could then be produced cheaply, led Bush to speculate that there were many mechanical aids which could effect a transformation in scientific records. In his words, "The world has arrived at an age of cheap complex devices of great reliability; and something is bound to come of it." A record must be extended, stored, and consulted, if it is to be useful to science. The technology of the day allowed the record to be stored through photographic means. Microfilm was being introduced and developed as a means of storing and compressing the scientific record. The compression and distribution of information was very important to Bush. With the technology of the day, Bush speculated that it would be possible to record the entire Encyclopedia Britannica on a single sheet of film, as big as a normal sheet of paper. Copies of this material could then be reproduced, providing inexpensive access to information. Creating the record is a process of collecting observations and data, extracting parallel material from the existing record, and inserting the new material into the general body of the common record. While there is no substitute for mature thought, Bush predicted great advances in the ability of people to store their ideas. He described mechanical typewriters that would respond to speech, scanners that would read and encode typed text, and the advent of machines that would perform repetitive calculations and other instructed operations at great speed (computers). Bush stated that the process of repetitive thought, as opposed to creative thought, would be greatly aided through the use of these powerful mechanical aids. These aides would help the scientist add his ideas to the record of knowledge and relieve him from having to do mundane tasks. He would then be free to think about those things that require creative thought. Referencing Stored Information After ideas have been manipulated and inserted into the record, there is still the problem of consulting the information. The prime action of use is selection, but there are problems in locating information that relates to one's interest. The time involved in searching through millions of ideas, looking for appropriate information, may cause the scientist to not keep up with the current scene. There are two types of searches: simple selection and selection by class and subclass. Simple selection involves searching all elements of the record and matching specific characteristics. Class and subclass selection involves searching for sequential characteristics that continually narrow the possibilities. A good example of this is the way a telephone exchange operates. With each successive digit that is dialed, the number of possible connections is narrowed and the specific phone line is selected. The problem in finding information is caused by the artificiality of systems of indexing, according to Bush. "When data of any sort are placed in storage, they are filed alphabetically or numerically, and information is found (when it is) by tracing it down to from subclass to subclass." Bush argues that the human mind doesn't work this way. The mind operates by association. When one idea is in mind, the next idea is suggested by an association of thoughts that was forged through previous experience. In the mind, associations that are not used often tend to fade over time, ideas are not fully permanent, and memory is transitory. Despite this, the speed and intricacy of associations is awe-inspiring. Bush proposes that selection by association can be mechanized. One can't hope to match the mind for speed or intricacy, but the associations made would be permanent and clear. In exploring this idea, he discusses a hypothetical mechanized private file and library: the memex. A memex would be a device in which an individual stores all his books, records, and communications. It is mechanized so that it may be consulted with exceeding speed and flexibility. The memex is described as a desk containing stored material on microfilm, projection screens for convenient reading, a keyboard and scanner for the input of information, and a working surface. Bush envisioned that material of all sorts would be available on microfilm: books, pictures, current periodicals, newspapers, etc. In addition, the keyboard and scanner would convert the owner's ideas and materials onto microfilm for inclusion with the stored library of information. If a person wanted to consult a certain book, he would enter a code and the book would be presented on the display screen. The book could then be read or skimmed by the user. At any point, marginal notes or comments may be added to the book. Also, the index is easily accessible and other books may be consulted at the same time on adjacent projection screens. Bush states that the user can then build an association between the two books being displayed on adjacent screens. By pressing a button, an association would be made and indexed. Anytime either reference was accessed in the future, the other could be accessed at the press of a button. Trails between multiple documents could be built and the net effect would be as if you edited a new book, using parts of existing documents. It would then be possible to produce a new microfilm document, composed of information and the trails created by previous associations. This microfilm could then be given to another memex owner and used in his machine, giving him the benefit of your notes, associations, and references. Bush predicted that new forms of encyclopedias would appear, ready-made with a mesh of associative trails running through them, ready to be dropped into a memex and amplified. The user could then access the information, adding notes and new trails associating it with his own present library of information. DISCUSSION The power of Bush's ideas is amazing. Microfilm technology has not advanced to the point where it can be used as described in the memex, but the idea of associatively indexing information may yet be realized through the use of computer technology. Bush's ideas inspired the work of Ted Nelson, who began the design of a system he called 'hypertext'. Hypertext refers to the idea of creating and linking documents with associative trails. The concept works much the same way as Bush's memex, except electronic storage of information is used instead of microfilm technology. The implementation of hypertext has been held up by the high cost of mass storage, however. We are finally at an age where Bush's ideas may be realized. The cost of mass storage systems is falling quickly, computers are being developed with sophisticated graphic displays, and the use of telecommunications allows us to easily copy information from place to place. All of these factors will play a role in the success or failure of hypertext. Optical storage systems can provide a great deal of information for an individual computer user. With a storage capability of 540 million characters, each optical disc in a CD-ROM system can hold the equivalent of three sets of encyclopedias. The ability to read text and graphics data from these devices and display it to the user is currently being demonstrated. Libraries are the major users of this technology, allowing quick searches through large volumes of abstracts and citations. True hypertext cannot be implemented with this technology alone, however, because data cannot be written back onto the optical disk surface. In order to establish true associative links, the user must be able to build his own trails and index other types of documents within the system. Current advances in magneto/optical disc technology promise to provide a solution. These devices would allow about 500 million characters to be stored on a single disc, with the ability to read, write, and delete information. The marriage of CD-ROM systems with magneto/optical devices would allow the user to make an initial copy of the original text and then build his own network of trails and associations, adding additional notes, pictures, and other information to the system over time. The importance of the computer display becomes evident here. Bush envisioned a system with multiple projection screens that would allow the user to view and then link documents together. The ability to display graphics and pictures was inherent to the microfilm media of the memex. Computer displays, however, are not necessarily suited to the task of displaying all types of data. In recent years, advancements in display technology are allowing large screen displays to be built with the ability to display both text and graphics. Graphics can be either simple line drawings or pictures with the resolution of a television signal; this is an important development. The ability to display multiple documents on a screen and have naive users understand what is happening is also important. The development of sophisticated user environments that are intuitive and easy to use will help make hypertext technology accessible to a wider audience. These environments will allow multiple windows to be opened and documents to be viewed without confusing users or demanding a high degree of technical sophistication. This is important if these systems are ever to become pervasive. Bush envisioned that users of the memex would correspond with each other and trade information by mailing microfilm that would then be inserted into another memex. It is becoming possible to transfer large volumes of electronic data quickly and easily. Recent developments in high speed telecommunications equipment are allowing computers to transfer data at increasingly higher speeds and the impending marriage of telephones and computers will have a big impact on the utility of hypertext systems. It may soon be possible to correspond with another person who owns a hypertext system through electronic messages and other data sent directly to his computer over telephone lines that are simultaneously used for voice communication. This would make it possible to talk with someone while your respective computers are busy sending data to each other. The ability of people to quickly share information may be the ultimate legacy of hypertext. Even if two users share the same reference materials, the associations made by one individual can be shared with another who may not have had the same insights. The ability to send pictures and other graphical data will also enhance the communication between people in society. It is important to stress the fact that the associations built by an individual are highly personal. Nelson envisioned a hypertext system where mankind's entire body of knowledge would be stored and exploited. Links and associations would be made by all users of the system and new information would be contributed, extending man's body of knowledge. One problem I see with this idea is that your associations may not necessarily be mine. If all of this data has several threads of association running through it, a person may reach a point where he can no longer navigate the existing trail of thoughts. Too much generalization in the creation of associations may hinder the ability to research topics and remain clear about one's goals. The best hypertext system may be your own: a personal computer with access to large volumes of data, collected over time. You may want to contact centralized systems in order to collect new information and add it to your personal hypertext. New data and associations could be added, but only when you desire them. CONCLUSION Vannevar Bush was an incredible visionary. His ideas concerning associative indexing of information may find realization through the use of computer technology. Computer media can provide many of the useful characteristics envisioned in the microfilm memex: storing and retrieving data, viewing multiple documents at the same time, and transferring that information to other users. Ted Nelson's vision of hypertext is based on Bush's work. Nelson envisions a centralized system where all of man's knowledge is electronically stored and retrieved. Individual systems, enhanced with communications ability, may also emerge. Personal hypertext, providing both local storage of information and access to large centralized volumes of data, may change the way we think and live. REFERENCES Bush, Vannevar "As We May Think", The Atlantic Monthly, July 1945, pgs. 101-108. Byers, T. J. "Built by Association", PC World, April 1987, pgs. 244-251. Nelson, Ted Computer Lib/Dream Machines, Personal Publication, 1974. Nelson, Ted Literary Machines, Personal Publication, 1981. Onosko Timothy "Let There Be Light", Creative Computing, September 1985, pgs. 43-49. -- Jon Backstrom "Yah sure...we gonna have fun, you bet!" Computer Science Department Indiana University ARPA: viking@iuvax.cs.indiana.edu Bloomington, IN 47405 UUCP: {pyramid,ihnp4,pur-ee,rutgers}!iuvax!viking