[comp.databases] Information Retrieval Algorithms Wanted

mpledger@cti1.UUCP (Mark Pledger) (02/18/91)
Sorry for the wide cross-posting but the following request is applicable
to a number of the news groups.

I am in the midst of doing Information Retreival (IR) research in a parallel
environment.  I would appreciate any information leading to algorithms
for word stemming and/or thesaurus searching.  Possible information includes
technical journals, research papers, source code, etc.  I would prefer either
pseudo or source code, however I will extract the algorithms from written 
papers if needed.

Word stemming involves paring off the prefix or suffix of a word reducing it
to the most basic form.  For example, taking the word "running", and passing
it through the word stemmer results in the output word of "run". 

Thesaurus algorithms have the ability to search for a set of similar words
based upon the input word given (i.e., thesaurus capabilities in many word
processors).

If you can help me I'd appreciate any and all responses.  I will summarize
if enough interested is generated.  Thank you.



-- 
Sincerely,


Mark Pledger

--------------------------------------------------------------------------
CTI                              |              (703) 685-5434 [voice]
2121 Crystal Drive               |              (703) 685-7022 [fax]
Suite 103                        |              
Arlington, VA  22202             |              mpledger@cti.com
--------------------------------------------------------------------------