[net.general] name parsing

mo@wgivax.UUCP (Michael O'Shea) (03/17/86)

Has anyone out there written a parser for human name recoginition?

I have need of a program or effective algorithm which will recognize and
identify individual names and parts of names (first name, middle name(s),
last name, prefix, suffix, etc.) in cases where an individual name appears
alone AND where multiple names are grouped together.

Some examples of the latter:

	Dr. and Mrs. John L Davis
	John and Tammi Davis
	John Davis, Executor for Tammi Davis
	John Davis for minor Julie
	John, Tammie, and Julie Davis

As you can see, the groupings can get rather complex from the point
of view of a parser.  I need to know as much about each individual
name as possible.  For instance, in the last example above, I need
to know there are 3 individuals: John Davis, Tammie Davis, and Julie
Davis.

I make this request out of total selfishness, in that I don't want to
tackle it myself.  I've already written a program to break down an
address and match it with addresses in postal files to generate zipcodes,
(-: thank GOD for LEX and YACC :-) but I'm getting lazy, now.

I welcome partial, as well as complete, algorithms and programs.  I
prefer C, LEX, and/or YACC, but I'll take anything I can get my hands
on.  (I'm even desperate enough to consider reading COBOL code if
that's all you have :-( )  Naturally I prefer public domain or other
free software, but my company is willing to pay for a working program
(running on 11/780 w/ 4.2 or SUN 3) that meets their needs.

Please mail replies to me, and I will summarize if there is enough
interest.  Thanks in advance.

email: decvax!mcnc!unccvax!wgivax!mo

USMAIL: Washburn Direct Marketing
        1123 South Church Street
        Charlotte, NC 28234
		attn: Mike O'Shea
		704/334-5371

 ***** all of my opinions are my own, and you can't have them! *****