nl-kr-request@cs.rochester.EDU (NL-KR Moderator Brad Miller) (10/23/87)
NL-KR Digest (10/22/87 22:36:18) Volume 3 Number 40 Today's Topics: Re: The Definite Article Re: Native American Languages Re: natural kinds and Indians Re: Infinite alphabets - (Turing via Berke) Re: Infinite alphabets (and CFness of NLs) ---------------------------------------------------------------------- Date: Wed, 14 Oct 87 08:04 EDT From: Chris Torek <chris@mimsy.UUCP> Subject: Re: definite article In article <185@Aragorn.dde.uucp> ct@dde.uucp (Claus Tondering) writes: [edited for effect] >New Testament gives good example of need for definite article. In >John's Gospel (I don't remember exact chapter and verse) Jesus says: >"I am the way, the truth, and the life." By this he meant only way, >only truth, so that there is no other way to God except through Jesus. >Now, in Russian translation of Bible, verse is presented without >definite article, because Russian has none. In Russian verse >becomes: "I am way, truth, and life", which may either mean same thing >as English translation, or may mean "I am a way, a truth, and a life", >which does not exclude existence of other ways to God. Curious. Other Russian translation of Bible says `I am only way---no other ways---only truth, and only life'. Seemed to exclude existance of other ways to God. In language without definite article, other words fill in. Anyone who has read, e.g., Heinlein's _Moon_Is_Harsh_Mistress_ (in English) can see. Native speakers of English seem to find style jarring at first, but soon stop complaining. After short while seems natural. Can even do it themselves. (Apply liberal quantities of :-), and no, I do not speak Russian.) -- In-Real-Life: Chris Torek, Univ of MD Comp Sci Dept (+1 301 454 7690) Domain: chris@mimsy.umd.edu Path: uunet!mimsy!chris ------------------------------ Date: Tue, 20 Oct 87 14:27 EDT From: pjt@BRL.ARPA Subject: Re: The Definite Article As was illustrated by Mitchell Spector's examples from Beckman, semantic distinctions that one language community considers essential may be considered minor by others. The sentences 1) Leibniz found a solution. 2) Leibniz found the solution. clearly have different meanings, and if that difference were crucial <and> unavailable from context, a Russian speaker could say 3) Leibniz found one solution. 4) Leibniz found that solution. But I don't believe that the notion of definiteness of NP's is vital to the survival of human intellect. The other side of the coin is that Russian speakers have means of being precise in ways for which English provides only more clumsy mechanisms. The semantic difference in: 5) Ivan routinely ate borscht. 6) Ivan ate the borscht all up. can be handled in Russian by simply using two different aspects of the general verb "to eat," a la 7) Ivan ate_1 borscht. 8) Ivan ate_2 borscht. This has something of the same flavor as the difference in French between the imparfait and the passe compose. And then there's always the notoriously elaborate system that Japanese provides for handling nuances of respect and class-differentiation. A Japanese designing a language might consider these distinctions necessary. A Russian might insist on the inclusion of a perfective aspect. But neither would feel an aching need to handle definiteness of NP's. Is this Sapir-Whorf again? While I'm going, let me toss out two questions I've been wondering about for a while: - The English verb "to know" can mean both "to be aware of (a fact)" and "to be acquainted with (a person)." French has savoir/connaitre, German has wissen/kennen, Russian has znats/(I forget the other). Is there any other language besides English that uses one word for both senses? - In English, the first-person plural (e.g., "we") may or may not include the listener. That is "we" can be either 1st- or 1st-and-2nd-person. Likewise, the second-person plural (e.g., "you") may or may not include stray third persons. Russian addresses the former ambiguity with its "mi s'vami" construction, literally, "we including you." Are there any languages that go further towards encoding these nuances? Paul Tanenbaum <pjt@brl.arpa> ------------------------------ Date: Thu, 15 Oct 87 15:10 EDT From: Rich Alderson <ALDERSON@Score.Stanford.EDU> Subject: Re: Native American Languages In V5 #228, Bruce Nevin complains about "dragging the Indians into it." While I agree with his general point about relations across the 55 (or 57) linguistic families in North America north of Mexico, I do have to say that I have seen at least one paper on the topic of native-language taxonomy. I don't have all the details (it's been about ten years), but as I recall, it concerned a Mixtecan dialect (southwestern Mexico), and was done by a Summer Institute of Linguistics type. The match-up was done by first collecting the classifications of various plants IN THE LANGUAGE IN QUESTION, and only then checking with a botanist. The matchup was close, but not exact: One non- useful pair of plant species in the same genus were lumped under a single term, while a couple of useful plants had different names based on the uses to which they could be put. I'm sure that someone who has looked at this stuff more recently than I can provide a reference based on these notes. Rich Alderson A.Alderson@Macbeth.Stanford.EDU ------- ------------------------------ Date: Fri, 16 Oct 87 22:18 EDT From: James J. Lippard <Lippard@BCO-MULTICS.ARPA> Subject: Re: natural kinds and Indians > Date: Sat, 3 Oct 87 10:30:14 EDT > From: Bruce Nevin <bnevin@cch.bbn.com> >> From: cugini@icst-ecf.arpa >> I believe there have been anthropological studies, for instance, >> showing that Indian classifications of animals and plants line >> up reasonably well with the conventional Western taxonomy. > I saw this go by in AIList, and here it comes again in NL-KR, and I just > can't let you get away with it, John. Perhaps some of the cases to which John Cugini is referring are Ernst Mayr's account of the Papuans of New Guinea, who have 136 names for the 137 species of birds there (confusing only two species of warblers), Ralph Bulmer's work on vertebrate taxonomies of the Kalam people of New Guinea, and Brent Berlin's study of classification by the Tzeltal Indians of Mexico. All of these are briefly described in the essay "A Quahog is a Quahog" by Stephen Jay Gould (reprinted from Natural History magazine in his anthology _The Panda's Thumb_, 1980, W.W. Norton & Co.). The actual journal articles and book are: Berlin, B., Breedlove, D.E., and Raven, P.H. 1966. "Folk taxonomies and biological classification," Science 154:273-75. Bulmer, R., and Tyler, M. 1968. "Karam classification of frogs," Journal of the Polynesian Socity 77:333-85. Mayr, E. 1963. _Animal species and evolution_, Cambridge, Mass.: Belknap Press of Harvard University Press. > Bruce Nevin > bn@cch.bbn.com Jim Lippard Lippard at BCO-MULTICS.ARPA ------------------------------ Date: Fri, 16 Oct 87 04:23 EDT From: Jeffrey Goldberg <goldberg@russell.STANFORD.EDU> Subject: Re: Infinte alphabets - (Turing via Berke) In article <959@uhccux.UUCP> lee@uhccux.UUCP (Greg Lee) writes: >The number of phonetic characters must be finite if language is context-free. That is true, but it is uninformative. Any recursively enumerable language will have a finite terminal vocabulary. >There are apparent context sensitivities in pronunciation, such as for >some English speakers, pronouncing "band" as "bam" before a word beginning >with "p", as in "The bam played on". "Context Sensitive" and "Context Free" are technical terms. You can get into trouble by using both the technical and non technical meanings in the same argument. >Yet such context sensitivities may >be represented in a context-free grammar, provided that the sensitive item >and its context are not separated by an unbounded number of constituent >boundaries. Dependencies over arbitary distance can be maintained in a CF grammar: S -> a B a S -> c B c B -> b B B -> b (lowercase letters are terminal symbols.) >Perhaps, though, language is only almost context free, failing to be >so through having an infinite number of terminal symbols. The linguistics literature from 1960 to 1985 is filled with arguments about why NLs are not CF. All of those arguments had holes in them. In 1985, however, two mathematically well founded arguments were published in the "Linguistics and Philosophy". One by Christopher Culy and the other by Stuart Shieber. The whole question raised had to do with writing systems, and I don't quite see what this has to do with the (mathematical) class languages fall into. >Greg Lee, Lee@uhccux.uhcc.hawaii.edu -- Jeff Goldberg ARPA goldberg@russell.stanford.edu UUCP ...!ucbvax!russell.stanford.edu!goldberg ------------------------------ Date: Sat, 17 Oct 87 15:49 EDT From: Greg Lee <lee@uhccux.UUCP> Subject: Re: Infinte alphabets - (Turing via Berke) In article <434@russell.STANFORD.EDU> goldberg@russell.STANFORD.EDU (Jeffrey Goldberg) writes: >In article <959@uhccux.UUCP> lee@uhccux.UUCP (Greg Lee) writes: >[stuff ...] The commentary on my posting seems to me to be rather obtuse. Rather than proceeding point by point, I think I should restate the matter. Perhaps this time I can make myself clearer. Before doing that, I'll explain that by "separated by ... constituent boundaries" I meant to refer to the circumstance that one item is within a constituent and another is outside that constituent. Now, to begin again. Gerald Gazdar made the point that a local context sensitivity, such as a transitive verb occurring always with a sister noun phrase, does not prevent a language from being context free. And since there is evidence that such subcategorization context sensitivity is in fact local in natural languages, this provides us some evidence (not probative) that language is context free. Suppose now that we apply the same reasoning in the case of phonology. If language is phonologically context free, we would predict that context sensitivities are local. It is a commonplace that phonological rules tend not to apply across major constituent breaks. The prediction appears to be correct, and so we have some evidence (not probative) that language is phonologically context free. Because of the qualification in my original posting, I concede in advance that this does not provide a very compelling argument for finiteness of alphabets. But the logic, at least, seems clear enough: language is context free (hypothesis supported by empirical argument) if language is context free, the alphabet is finite (obvious) therefore, the alphabet is finite (well known rule of logic) (We could talk about the logic in the Schieber article sometime, if you like.) Greg Lee, lee@uhccux.uhcc.hawaii.edu ------------------------------ Date: Sat, 17 Oct 87 22:51 EDT From: Jeffrey Goldberg <goldberg@russell.STANFORD.EDU> Subject: Re: Infinte alphabets (and CFness of NLs) In article <969@uhccux.UUCP> lee@uhccux.UUCP (Greg Lee) writes: >In article <434@russell.STANFORD.EDU> goldberg@russell.STANFORD.EDU > (Jeffrey Goldberg) writes: >>In article <959@uhccux.UUCP> lee@uhccux.UUCP (Greg Lee) writes: >>[stuff ...] > [ ... ] >Now, to begin again. Gerald Gazdar made the point that a local >context sensitivity, such as a transitive verb occurring always >with a sister noun phrase, does not prevent a language from being >context free. And since there is evidence that such subcategorization >context sensitivity is in fact local in natural languages, this provides >us some evidence (not probative) that language is context free. Gazdar also deals with so-called "Unbounded" Dependencies. See "Unbounded Dependcies and Coordinate Structures" in _Linguistic Inquiry_ 12, 1981. Also, "Generalized Phrase Structure Grammar" (Gazdar et al, 1985). >Suppose now that we apply the same reasoning in the case of phonology. >If language is phonologically context free, we would predict that >context sensitivities are local. It is a commonplace that phonological >rules tend not to apply across major constituent breaks. The prediction >appears to be correct, and so we have some evidence (not probative) >that language is phonologically context free. I'm not sure that I follow this argument. Are you saying: "All these things involve local dependencies and therefore ought to be dealt with by a CF grammar."? That same argument could be applied equally well (or badly) to conclude that NLs are finite state. >Because of the qualification in my original posting, I concede in >advance that this does not provide a very compelling argument for >finiteness of alphabets. But the logic, at least, seems clear >enough: > language is context free (hypothesis supported by empirical > argument) > if language is context free, the alphabet is finite (obvious) > therefore, the alphabet is finite (well known rule of logic) I'm afraid the logic isn't clear to me. If a language is is CF (or CS for that matter) its terminal vocabulary is finite. Agreed. But it is not obvious to me how you get from there to concluding that the WRITING system of a language must employ a finite alphabet. >(We could talk about the logic in the Schieber article sometime, >if you like.) I think that that would be a refreshing change of topic. Anyway, I too believe that there are no writing systems based on infinite alphabets. But I have nothing more to add to what I have already said on that topic. >Greg Lee, lee@uhccux.uhcc.hawaii.edu -jeff goldberg -- Jeff Goldberg ARPA goldberg@russell.stanford.edu UUCP ...!ucbvax!russell.stanford.edu!goldberg ------------------------------ Date: Sun, 18 Oct 87 14:13 EDT From: Greg Lee <lee@uhccux.UUCP> Subject: Re: Infinte alphabets (and CFness of NLs) In article <448@russell.STANFORD.EDU> goldberg@russell.STANFORD.EDU (Jeffrey Goldberg) writes: *In article <969@uhccux.UUCP> lee@uhccux.UUCP (Greg Lee) writes: *>In article <434@russell.STANFORD.EDU> goldberg@russell.STANFORD.EDU *> (Jeffrey Goldberg) writes: *>>In article <959@uhccux.UUCP> lee@uhccux.UUCP (Greg Lee) writes: *>>[stuff ...] *>... *>Now, to begin again. Gerald Gazdar made the point that a local *>... * *Gazdar also deals with so-called "Unbounded" Dependencies. See *"Unbounded Dependcies and Coordinate Structures" in _Linguistic *Inquiry_ 12, 1981. Also, "Generalized Phrase Structure Grammar" *(Gazdar et al, 1985). He does not deal with all unbounded dependencies, because they cannot in general be expressed in a CFPSG. If there is a problem with the argument I gave it would hinge on whether unbounded phonological dependencies are of the sort which can be expressed in a CFPSG. I don't know whether they are. For the special case of a transparently applying phonological rule, i.e. one whose conditions are met in the pronunciation, it seems to me that unbounded phonological dependencies could indeed be expressed, by the device of "foot initial features" which propagate up the tree from left-most daughters, "foot final features" which propagate up from right-most daughters, and appropriate agreement rules. So, upon more careful consideration, I would say my argument was incomplete, at best. * *>Suppose now that we apply the same reasoning in the case of phonology. *>... * *I'm not sure that I follow this argument. Are you saying: "All *these things involve local dependencies and therefore ought to be *dealt with by a CF grammar."? That same argument could be applied *equally well (or badly) to conclude that NLs are finite state. * Yes, that is what I'm saying. And of course regular languages are context free, so that possibility is included. So far as I'm aware, all the evidence that has been cited in favor of language being context free also supports the hypothesis that language is regular. (Remember Ingve's "A Model and an Hypothesis for Language Behavior"?) *>Because of the qualification in my original posting, I concede in *>... *I'm afraid the logic isn't clear to me. If a language is is CF (or *CS for that matter) its terminal vocabulary is finite. Agreed. *But it is not obvious to me how you get from there to concluding *that the WRITING system of a language must employ a finite *alphabet. * alphabet = segments of adequate transcription system *>(We could talk about the logic in the Schieber article sometime, *>if you like.) * *I think that that would be a refreshing change of topic. * Look at his footnote 4. Is he right that the optionality of objects makes no difference to his argument? One can't simply take an intersection with a language `... some number of verbs ... same number of nouns' because this is not regular. *Anyway, I too believe that there are no writing systems based on *infinite alphabets. But I have nothing more to add to what I have *already said on that topic. *... Greg Lee, lee@uhccux.uhcc.hawaii.edu PS: Fixing a reference in a recent posting: Yngve, Victor. 1960. "A model and a hypothesis for language structure", Proc. of the Amer. Philos. Society 104, pp. 444-466. (Criticised in a footnote in Aspects.) Greg Lee, lee@uhccux.uhcc.hawaii.edu ------------------------------ End of NL-KR Digest *******************