clarke@csri.toronto.edu (Jim Clarke) (01/21/89)
INFORMATION SYSTEMS SEMINAR SPONSORED JOINTLY BY DCS/FLISS Friday, January 27, 2 p.m. in Room GB 420 (GB = Galbraith Building, 35 St. George Street) Carolyn Watters Dalhousie University "Accessing Information in Textual Data Streams" Most collections of textual data are excluded from the power of DBMS be- cause they do not display the required structure. Some common examples of textual data streams which one might wish to query are electronic mail, personal files (articles, memos, programs), bibliographic data, dic- tionaries, and the daily paperwork at an office. This talk will present a conceptual framework, using relational algebra and relational databases, within which data streams may be queried. The data are extracted using special operators, defined by relational algebra, and put into a relational database which can be queried in the usual manner. The schema of the resulting database is allowed to evolve as the user's knowledge of the content and/or structure of the data stream increases. This approach to querying textual data streams permits the integration of unstructured textual data with structured data for manipulation and access. -- Jim Clarke -- Dept. of Computer Science, Univ. of Toronto, Canada M5S 1A4 (416) 978-4058 BITNET,CSNET: clarke@csri.toronto.edu CDNNET: clarke@csri.toronto.cdn UUCP: {allegra,cornell,decvax,linus,utzoo}!utcsri!clarke