mpledger@cti1.UUCP (Mark Pledger) (02/18/91)
Sorry for the wide cross-posting but the following request is applicable to a number of the news groups. I am in the midst of doing Information Retreival (IR) research in a parallel environment. I would appreciate any information leading to algorithms for word stemming and/or thesaurus searching. Possible information includes technical journals, research papers, source code, etc. I would prefer either pseudo or source code, however I will extract the algorithms from written papers if needed. Word stemming involves paring off the prefix or suffix of a word reducing it to the most basic form. For example, taking the word "running", and passing it through the word stemmer results in the output word of "run". Thesaurus algorithms have the ability to search for a set of similar words based upon the input word given (i.e., thesaurus capabilities in many word processors). If you can help me I'd appreciate any and all responses. I will summarize if enough interested is generated. Thank you. -- Sincerely, Mark Pledger -------------------------------------------------------------------------- CTI | (703) 685-5434 [voice] 2121 Crystal Drive | (703) 685-7022 [fax] Suite 103 | Arlington, VA 22202 | mpledger@cti.com --------------------------------------------------------------------------