nl-kr-request@CS.ROCHESTER.EDU (NL-KR Moderator Brad Miller) (12/01/88)
NL-KR Digest (11/30/88 21:06:43) Volume 5 Number 33 Today's Topics: Subjectivity, Mud-slinging, Behaviorism was predictive knowledge... best grammatical theories Generation and Recognition of Affixational Morphology (Unisys Seminar) Submissions: NL-KR@CS.ROCHESTER.EDU Requests, policy: NL-KR-REQUEST@CS.ROCHESTER.EDU ---------------------------------------------------------------------- Date: Wed, 9 Nov 88 20:06 EST From: Clay M Bond <bondc@iuvax.cs.indiana.edu> Subject: Subjectivity, Mud-slinging, Behaviorism Walter Rolandi: >Sure Rick. I think that the body of knowledge that is linguistics is >somewhat obscure and scarcely known to anyone outside of the field. >When linguists speak, mostly just linguists listen. This is not without some element of truth, but only if you leave out cognitive/computational/psycholinguists. Linguistics, at least that done by computational linguists is more than scarcely known in AI, for example, as is psycholinguistics in psychology, and cognitive linguistics in psychology/neuroscience/compsci. As far as who listens when "linguists speak" as much responsibility falls on the (potential) listener as does the speaker. That's not blame that can be laid solely at the door of linguistics. >I am suggesting that >this would not be the case if the discipline were to address questions >which, if answered, would be of interest to the general scientific community. >More specifically, I am suggesting that linguistics will take a giant >step towards science when the field embraces causal analysis and >controlled experimentation. This is the stuff of all real science. And just who is to determine which questions are of interest? It is true that analysis and experimentation has played next to no role in linguistics *until recently*, but again, who determines what is "real science"? Besides you, that is. And you can produce bullshit just as easily with "real science" as you can with "irreal (?) science". >Silly me, I supposed that to mean that pragmaticists were busy producing >a body of data that defines the causal relationships between the >people, places, and objects around us that effect [sic] the contents of what >we say. What I found instead was a lot of pompous intellectualizing on >the nature of "communication". _hat I have seen is little more than >mentalistic, philosophical, even literary analysis of things like >"information", "messages", and "intentions". Pompous intellectualizing! Not a wee bit resentful here, are we? I can sympathize with your objection to mentalism/philosophy/literature masquerading as science, but I don't think you can remove information/messages/intentions from language without throwing the baby out with the bath water. If language is none of these things, then what, pray tell, do you think it is? When you use language does it contain no information/messages/intentions? >What's wrong with pragmatics and linguistics in general is that neither >field addresses any issues of scientific import. According to whom? What exactly has given you the authority to make this bogus and unsubstantiated claim? >Who cares about >anti-homophones, u-umlaut and y (or rather, u-double-dot and y), and >embedded, recursive prepositional phrases? A lot of people, obviously. Your statement had no purpose but to be offensive. >When you know all about >these things, what do you know? I want to know why people say the >things that they say and why their utterances take the forms that they do. When you know all about particle physics, what do you know? The answer is the same. I also want to know why people say the things that they say. And the things that they say are intrinsically bound up with information, messages, and intentions, those things you thumbed your nose at earlier. >I want a scientific answer. Is it unfair or unkind of me to ask this >of linguistics? It is if you don't ask or require it of yourself. Double standards are never fair or kind. >>(BTW, use the term 'linguist' to refer >>to linguists in general. If you are just complaining about generative >>linguists, then say so.) > >I stand accused. You do indeed, twice. And at least once by one who is not a generative linguist. >I want it [linguistics] to aspire to >substance: give me a causal explanation of verbal behavior. You cannot explain behavior without explaining cognition. Without cognition, behaviorism *is* push-pull mechanism, which gives no more causal explanation than does mentalism. Behaviorism without cognitive theory has no explanatory power. I suggest that you find out a bit more about what is going on in linguistics before you sling mud again. Chomsky is not the only linguist out here. -- << Clay Bond -- IU Department of Leath-er, er, uh, Linguistics >> << bondc@iuvax.cs.indiana.edu AKA: Le Nouveau Marquis de Sade >> << {pur-ee,rutgers,pyramid,ames}!iuvax!bondc ------------------------------ Date: Fri, 11 Nov 88 14:14 EST From: w.rolandi <rolandi@gollum.UUCP> Subject: was predictive knowledge... In response to Clay's: >Please object. But if you have no alternative to offer which >addresses your complaints by correcting what you perceive to >be deficiencies, you are contributing nothing to anything. Or >to be more colloquial, put your money where your mouth is. What on earth do you think I've been talking about? From the onset, I have been suggesting that the science of human behavior, behavior analysis, offers specific advantages, both in its causal model and in its experimental methodology for addressing the deficiencies of linguistic knowledge. Ironically, your emotional responses have unwittingly served to reinforce my impression that linguists possess an impoverished understanding of the purpose, spirit, and methods of science. I have stated methodological and epistemological criticisms of linguistics. Curiously, you do not address my specific complaints. Forfeiting an opportunity to cite data that would prove me wrong--on my own terms, no less--you act as if you are stepping forward to avenge the honor of an unjustly maligned loved one. Your first response questioned my authority to make such statements, as if facts are untrue unless cited by authorities. Your second response essentially entreats me to find something nice to say about linguistics or to refrain from saying anything at all. What can I say? I am talking about methodological issues associated with obtaining a predictive knowledge of natural phenomena. The problem is that I am talking specifics in an audience that is oblivious to the general plan. Who else but, Walter Rolandi rolandi@ncrcae.Columbia.NCR.COM NCR Advanced Systems Development, Columbia, SC ------------------------------ Date: Fri, 11 Nov 88 16:11 EST From: Avery Andrews <munnari!fac.anu.oz.au!andaling@uunet.UU.NET> Subject: best grammatical theories > How can you tell if one theory of syntax is better than another? >Walter Rolandi >rolandi@ncrcae.Columbia.NCR.COM >NCR Advanced Systems Developmbou, Columbia, SC Well, to begin with, you have to define what the NL syntax is actually doing. (which I will henceforth call `grammar', to distinguish it from various other things that people sometimes call `syntax'). I'd suggest that a grammar is a scheme whereby a classification of the the vocabulary of a language into a finite collection of (in general overlapping) categories, plus some additional principles, is used to indicate how the meanings of the words (or formatives, or morphemes) are to be put together. In a PSG-based account of English, for example, we would say that there were lexical categories, or `parts of speech', such as Det, Adj, N and V, and phrase-structure rules such as S -> NP V (NP) (NP) NP -> (Det) Adj* N (grossly oversimplified, of course) We'd also say something to the effect that NP's refer to entities, that their components indicate the properties of the entity referred to, and that the V in an S indicates the relation between the entities referred to. How this is said depends on what approach to semantics we're taking. So the PS rules and the semantic principles tell us how to sort out the attributes between the entities in a sentence like The naive herpetologist handed the beautiful princess a worried-looking crocodile Other languages employ different sorts of principles, often involving grammatical feature-based methods such as case-marking and agreement, and other theories describe the facts in rather different ways--categorial grammar, for example, builds virtually all of the language-particular aspects of grammar directly into the system of categories, with the rules being universal principles (function application, composition, and type-raising, for example). But these variations are always based on classifying the vocabulary into a finite set of classes (including those defined by features). Generative grammars frame these principles in such a way that they interact so as to assign each string of words a possibly empty set of grammatically legitimate ways of composing the meanings of its words. Ungrammatical strongs have none, ambiguous ones more than one. (By speaking of ways of composing word-meanings, rather than of meanings, we allow for a degree of fluidity in the lexicon, and for the use of contextual knowledge to discover the meaning of unknown words by reverse engineering.) In a successful utterance, knowledge of context + common sense is sufficient to allow the hearer to choose the intended meaning from the grammatically + lexically legitimate ones. It turns out that there are lots of alternative ways of presenting such principles, which are *not* created equal. We could replace the PSG above with the following finite-state grammar (FSG): S = (Det) Adj* N V ((Det) Adj* N) ((Det) Adj* N) (Technically, this isn't really an FSG, since it describes the arrangement of word-classes rather than individual words. Maybe it should be called a Word-Class Grammar (WCG). Linguistically, FSG's have all the faults of WCG, and more besides). Although this generates the same strings as the PSG, and could be equipped with an equivalent semantics, it can be read as making very different claims about the causation of linguistic behavior. The PSG says that there are two distinct causal factors, S-structure, and NP-structure, which interact in a certain way, essentially, the NP-structure principles govern the internal form of some sequental blocks of words, whose external arrangement is determined by the S-structure principles. The FSG says no such thing--there is just a big formulae saying what order words appear in. The PSG says that there is one causal factor behind the relative order of Det, Adj and N in the three positions, while the FSG says that there are three different ones, which is definitely wrong. (If they were different, why would they have the same effect, in position after position? And why would similar phenomena appear, with different details of order, in language after language?) PSGs are better than FSGs, but they get into trouble quickly, for example, with subject-verb agreement, which, for `classical' (e.g. feature-free) PSGs, requires that many rules be split into `singular' and `plural' verions. eg.: S -> {NPsg Vsg (NP) (NP), NPpl Vpl (NP) (NP)} NP -> {NPsg, NPpl} NPsg -> (Detsg) Adj* Nsg NPpl -> (Detpl) Adj* Npl What's wrong with this is that it says that there are different causal factors behind the order of words in singular S and NP than in plural ones, and this is clearly wrong: phrases differing properties such as number, gender or case show essentially the same gross internal structure. There are many known cures for this ailment: agreement transformations were the first formal proposal widely accepted by linguists. Various kinds of feature-sharing principles are today most popular, especially among those with computational inclinations. All of these solutions have in common that they treat the arrangement of the agreeing elements as one factor and the details of what is shared as another. Wh-movmement is a third example: here we have superposed on the basic machinery for assigning semantic roles (on the basis of agreement, word order, case-marking, etc., depending on the language) some additional machinery that allows constituents to appear elsewhere than their basic expected position, subject to interesting constraints. Many ways of accomodating the most obvious facts are now known: transformational movements or deletions, coindexing schemes, `slash-feature' manipulations, `functional uncertainty', functional composition+type raising are some current contenders. Thus one major desideratum a grammatical theory is to allow one to formulate principles that correspond accurately to the underlying causal factors behind grammatical phenomena, and correctly predict their interactions. The corresponds pretty accurately to what Chomsky has sometimes called `descriptive adequacy' (he has unfortunately also applied this term to less interesting properties, such as simply generating the right set of sound-meaning correspondences, without regard to whether it has been done right.) It also corresponds pretty closely to being a pleasant language to specify grammars in. Current grammatical theories (GTs) use a wide range of mechanisms to say to a considerably extent the same kinds of things about how grammatical phenomena are to be carved up, They thus have a wide range of overlap in the phenomena they handle well. Nonetheless, they nonetheless differ significantly in how well they approximate descriptive adequacy in various areas. LFG seems to me to be unequalled for basic predicate-argument structure and the complement system: proponents HPSG, GPSG, and the presently available varieties of Categorial Grammar have not yet told in public any convincing story about things like Quirky Case in Icelandic, or preverbal pronouns in French and Spanish. The other theories seem to do better with Wh-movement and related phenomena: proponents of LFG have not yet told in public a sensible story about parasitic gaps or accross-the-board extraction. People tend to prefer the theory that does best with the stuff they actually are working on at the moment. Beyond descriptive adequacy are somewhat murkier and more controversial concerns, which tend to go under the label of `Explanatory Adequacy'. The most generally accepted of these is the desideratum that a GT help explain why language-learning is possible. The basis of this is the idea that the less one has to specify to determine the language, the easier it is to imagine how children can do it. The ultimate position here is that taken by Chomsky and his followers in GB, who believe that an interesting portion of the grammar of all languages (`core grammar') can be specified by setting the values of a relatively small number of binary-valued (or maybe finite list-valued) parameters. I find this idea pretty bizarre, but many linguists seem to accept it. A more modest claim is the difference between a transformational passive rule and a `classic' LFG one: TG Passive: NP - X - V - NP - X => 1 2 3 4 5 4 2 be+en+3 0 5+by#1 LFG Passive: SUBJ -> OBLby/0 OBJ -> SUBJ 0 -> Pass.Part The TG rule is a structure changing operation which has a complicated statement because it has to recapitulate most of the basic facts of English clause structure (the subject comes before the verb, the object after, auxiliaries come between the subject and the verb, and prepositional phrases come after, and before their NPs). And it doesn't *really* work anyway. The LFG rule is a lexical rule operating on lexical items whose argument positions are associated with grammatical relations: it creates a new lexical entry, altering the grammatical relation/semantic role assignments and the feature composition of the item as indicated. It is much simpler because it doesn't recapitulate basic facts of clause structure, but depends on other components of the grammar that account for these facts (the rules associating grammatical relations with NPs, and connecting the feature-composition of verb forms with their distribution). Furthermore, just about everything in it corresponds to an actual parameter of variation in passive-like constructions: sometimes the old subject is always suppresssed (the 0 option for SUBJ), sometimes never; sometimes OBJ must be present and become SUBJ, sometimes not; sometimes the grammatical category of the verb form become participial, sometimes not, It is most unlikely than language learners would learn the TG rule, but it isn't crazy to suppose that they could pick up the LFG one. This concept of explanatory adequacy is actually closely connected with descriptive adequacy, because the greater the extent to which the notation is free from redundancy and formal fuss, the easier it is to imagine how a learning procedure might successfully find its way through the space of possibilities it provides. But there is still nothing in descriptive adequacy that says that the GT shouldn't provide more facilities than one actually needs to describe languages right, which this kind of explanatory adequacy does. One can also ask that a GT illuminate actual facts about acquisition, rather than just make it easier to imagine how it might be possible. Recent work by Hyams and Pinker attempts to do this. But this requires an actual theory of learning as well as a GT. Another brand of explanatory adequacy pertains to processing. The way in which a GT presents grammatical information can make a big difference to the possibilities for using it in comprehension and production. TG's are pretty terrible in this regard, many more recent theories are much better. (You can get an LFG parser to run on your PC from the Institute for Machine Translation at the University of Stuttgart; nothing like this is even envisionsable for classical TG). A complicating factor is the possibility of compilation: maybe language learning, and the halting performances of beginners, are based on direct processing of some kind of augmented PSG, while the highly overlearned performance of fluent speakers is based at least in part on the use of FSGs + strategies generated by various kinds of compilation processes. Only psycholinguistics will be able to sort that out. But the generalizations (e.g. basic causal factors) detected by linguists still have to be taken as part of the causal order, for otherwise there is no account of why they are there). The import of Explanatory Adequacy is that the generalizations discovered by linguists represent real causal factors. Thus, they ought to interact sensibly with other kinds of causal factors, such as learning procedures and and processing mechanisms. There is at least one complicating factor. The generalizations (e.g. causal factors) identified by grammatical analysis are supposed to reside in the minds of language users. Because sometimes linguists get fooled. For example, languages tend to undergo sequences of regular sound changes, which can turn formerly regular and uninteresting systems of forms into things that look like the results of applying long sequences of phonological rules to rather abstract underlying representations. There is good reason to believe that there exist things that look and act pretty much like standard generative phonological rules (the original sound changes, at a minimum), but looking at say, the residue of the great vowel change in English, it is hard to say what is historical residue and what is genuine Modern English grammatical structure. Chomsky + Halle's rules in SPE do roughly represent genuine causal factors in linguistic behavior, but it's not so clear what there locus is: history, or the brains of modern speakers. Happily, most of syntactic and contemporary phonological theory seems reasonably secure against this line of attack, but one has to rely on common sense to ascertain what is the actual locus of causal factors revealed by grammatical analysis. (If a group of people started speaking a language taught to them by aliens, its grammatical generalizations would not be of direct relevance to human GT, since their utlimate locus would be the minds of the aliens, not the human speakers. On the other hand, the changes that human speakers introduced into the language might be highly revealing for human GT.) Avery Andrews The Australian National University andaling%fac.anu.oz@seismo.css.gov ------------------------------ Date: Sun, 13 Nov 88 07:28 EST From: Greg Lee <lee@uhccux.uhcc.hawaii.edu> Subject: best grammatical theories In the NL-KR Digest, Avery Andrews <munnari!fac.anu.oz.au!andaling@uunet.UU.NET> replies to: > > > How can you tell if one theory of syntax is better than another? > > >Walter Rolandi > >rolandi@ncrcae.Columbia.NCR.COM by giving a very useful and readable summary of some syntactic theories and their problems. Among other things, he compares the "causal factors" they ascribe to some example expressions. But there is a little problem here. If we were to suppose that any theory causes the facts that it describes, then it would be fair to refer to the parts of the theory as "causal factors", I suppose, as he does. However I doubt that Walter, or any reasonable person, would agree to this supposition. Andrews says: >... generalizations (e.g. basic causal factors) detected by linguists ... This is even worse. Having discovered a general fact, a linguist may assume he has uncovered its cause, apparently, because it causes itself. This completely trivializes the idea that human behavior has causes and that we should search for them. So I suggest replacing 'causal' with 'descriptive' in Andrews' article everywhere it occurs. It reads better. Greg, lee@uhccux.uhcc.hawaii.edu ------------------------------ Date: Thu, 17 Nov 88 16:15 EST From: Chrystopher Nehaniv <nehaniv@oreo.berkeley.edu> Subject: Re: best grammatical theories There is a very pratical way of comparing syntactic theories: Compare how they explain a huge set of sentences. Do an analysis of each sentence in each of theories being compared. No known theory will be always right. Perhaps no possible non-trivial theory will ever be always right. If one theory explains more kinds of sentences, it's good. If they explain about the same number, look for simplicity and elegance as a comparision. As a related criterion, consider how easy is it to do an analysis using each theory. In some cases, one may choose a theory that gets less sentences right if its much simpler than the competing theory. All the factors should be considered to decide which theory is better. ---------------------------------------------------------- No theory should be dismissed for making wrong predications - all do. A theory should only be superseded by a better theory. It's like this for chemistry and physics. To reiterate, if Chomsky's GB or GPSG or what have you fails on some sentence that's no reason to stop doing that theory. In fact, many grammarians do not aspire to explain all sentences anymore, but only a "core" of grammatical utterances. They expect fuzzy applications of any theory on the periphery. (We all make ungrammatical utterances sometimes even when trying to be formal. Also idiosyncratic properties of certain lexical items may make them fail to fit in a reasonable grammar). C.L. Nehaniv | " Things fall apart. Dept. of Mathematics | It's scientific." UC Berkeley, CA 94720 | -D. Byrne C.L. Nehaniv | " Things fall apart. Dept. of Mathematics | It's scientific." UC Berkeley, CA 94720 | -D. Byrne ------------------------------ Date: Fri, 4 Nov 88 22:44 EST From: finin@PRC.Unisys.COM Subject: Generation and Recognition of Affixational Morphology (Unisys Seminar) AI SEMINAR UNISYS PAOLI RESEARCH CENTER John Bear SRI International Generation and Recognition of Affixational Morphology Koskenniemi's two-level morphological analysis system can be improved upon by using a PATR-like unification grammar for handling the morphosyntax instead of continuation classes, and by incorporating the notion of negative rule feature into the phonological rule interpreter. The resulting system can be made to do generation and recognition using the same grammars. 1:00 am - November 7, 1988 R&D Conference Room Unisys Paoli Research Center Route 252 and Central Ave. Paoli PA 19311 -- non-Unisys visitors who are interested in attending should -- -- send email to finin@prc.unisys.com or call 215-648-7446 -- ------------------------------ End of NL-KR Digest *******************