hwb@beach.cis.ufl.edu (Howard Beck) (12/07/90)
I am compiling a report on the relationship between SGML and databases. Any information anyone can provide on the following topics would be greatly appreciated: 1. Justification for using either a customized or general purpose semantic data model or an object oriented database to store fragments of documents tagged using SGML or any other markup language. Query languages for such databases. 2. Software which extracts data from tagged documents, for example, software which converts tagged documents into a hypertext database. Same for untagged documents for that matter. 3. Software which discovers interrelationships between tagged documents, for example, automatically creating hypertext links between two related documents by different authors by identifying common terminology. 4. Linguistic analysis of tagged documents, for example, parsing of section headings to extract domain-dependent terminology. This could be used to enhance fulltext searching or build a knowledge base of concepts associated with a document. I am aware of various related work on the Oxford English Dictionary, Windowbook, and EBT's DynaText. I will summarize responses and post here. Thanks in advance...