[net.sources.bugs] ispell dictionary fixes 01/03 - noncontroversial changes

geoff@desint.UUCP (03/14/87)

This is the first of three postings correcting errors in ispell's
dictionary file.  The postings are divided not so much because of size
as to give people some options.

Description of postings:

    (1)	Non-controversial changes that everyone will want to apply (unless
	you think "distritbute" is a word.
    (2) Possibly-controversial changes;  mostly removing one-word
	combinations of two-word phrases such as "sealevel" for "sea level".
	You may want to examine this file to remove any deletions you
	disagree with;  that's why it's separate.  (The intention was to
	reduce your work by separating the stuff you're sure to want from
	the stuff you might not like.)
    (3) More non-controversial changes that I missed in (1).

This is posting #1, the non-controversial changes.  To apply these changes,
run this entire article through "patch -d dir", where dir is the
directory containing dict.191.  Then re-sort dict.191 with
"sort -u -o dict.191 dict.191".

Oh yes, don't forget to pick up my new, enhanced, bugfixed ispell
(dict.191 not included) from net.sources!

	Geoff Kuenning
	{hplabs,ihnp4}!trwrb!desint!geoff

About these postings:

I created these postings by running dict.191 through UNIX 'spell'.  The
initial output, in which I only checked root words, was about 350 lines.
Many were medical terms not found in /usr/dict/words;  many others
were cases of legitimate alternate spellings that spell didn't accept.
Here's what I did:

(1) Unless I recognized that a word was obviously legal ("brooch") or
    a typo ("distritbute" is my favorite), I looked it up in a dictionary.
    I didn't declare a word not-found until I looked in both my 1968
    Funk & Wagnall's and my less-abridged 1974 "Living Webster
    Encyclopedic Dictionary".  I did this even if I thought I knew
    how to spell the word.
(2) If the dictionary just plain disagreed (both say "campuses";  dict.191
    had "campusses") I corrected dict.191.
(3) If a word wasn't in my dictionaries, I usually took it out.  The
    notable exception was obvious medical terms, which I trusted
    (except "arthogram," replaced with "arthrogram").
(4) I also took out one or two very archaic words that I feared might
    hide a typo in a common word.
(5) If there was an alternate spelling for a word ("formatted" or
    "formated" are both acceptable), I put in the alternate spelling.
    These usually involved doubled consonants on unaccented syllables,
    as in the example.
(6) There were a few words I left in because they are obviously in
    common use in our community or by analogy with my dictionaries
    (Webster's lists "keypunch", so I left in "keypad").
(7) Finally, I took out a whole bunch of cases where a two-word phrase
    had been written as one;  my favorite is "sealevel" for "sea level".
    Some were a harder judgement to make, such as "hardwired" (which I
    eventually took out, with many misgivings, but I think "hard wired"
    is a more acceptable usage).  Most of these are in part (2), the
    "controversial" changes.
(8) After making the corrections, I separated out the ones I was less
    sure, such as most of (7), plus some things I didn't know like
    "silkine" (a misspelling of "silken"?) and stuff I don't consider
    words like "artifactually" (give me a break!) and "termwise".
    The intention was to reduce your work by separating the stuff you're
    sure to want from the stuff you might not like.

The third posting arises because I wrote the "expand" scripts (see
net.sources) later, and they allowed me to detect some further problems;
I also collapsed a few suffixes I happened acrosse.  All of these changes
were noncontroversial.

So, having wasted a lot of net.breath on explanations, without further ado,
here is part 1:

Index: dict.191

26a27
> ABETTOR
1772,1773c1773
< BARBECUE/D/S
< BARBECUEING
---
> BARBECUE/D/S/G
1800c1800
< BARREL/M/S
---
> BARREL/M/S/D/G
2006c2006
< BELLYFULL
---
> BELLYFUL
2037,2039c2037
< BEQUEATH/D/G
< BEQUEATHAL
< BEQUEATHS
---
> BEQUEATH/D/G/S
2219,2220c2217
< BLABBERMOUTH
< BLABBERMOUTHS
---
> BLABBERMOUTH/S
2691c2688
< BUFFERRER/M/S
---
> BUFFERER/M/S
2862d2858
< CAMPUSSES
3121,3122c3117
< CHATTER/D/R/G/S
< CHATTERERZ
---
> CHATTER/D/R/G/S/Z
3391c3386,3387
< COHABITATE/N/X
---
> COHABIT/S
> COHABITATION/S
3436c3432
< COLUMNATE/D/G/N/S
---
> COLUMNAR
4741c4737
< DISHWASHERS
---
> DISHWASHER/S
4768c4764,4765
< DISPELL/D/G/S
---
> DISPELLED
> DISPELLING
4827d4823
< DISTRITBUTE/D/G/S
5472d5467
< EVILLER
5543d5537
< EXHAUSTABLE
5544a5539
> EXHAUSTIBLE
5628c5623
< EXTENDABLE
---
> EXTENDIBLE
5742c5737,5739
< FARTH/T/R/G
---
> FARTHER
> FARTHEST
> FARTHING
6303d6299
< GAYETY
7918c7914,7915
< KIDNAP/S
---
> KIDNAP/S/R/D/G/M
> KIDNAPPED
7970d7966
< KNOWLEDGABLE
7976c7972
< LABEL/S
---
> LABEL/S/D/R/G/M
8146c8142
< LEVEL/P/D/R/G/Y/S
---
> LEVEL/P/D/R/G/Y/S/T
8227a8224
> LIQUEFY/D/R/Z/G/S
8231d8227
< LIQUIFY/D/R/Z/G/S
8514c8510
< MARVEL/D/S
---
> MARVEL/D/S/G
8944c8940
< MOSQUITO
---
> MOSQUITO/S
8964c8960
< MOTTO
---
> MOTTO/S
8975,8976c8971
< MOUTH
< MOUTHE/D/G/S
---
> MOUTH/D/G
10544c10539
< PROFIT/D/G/S
---
> PROFIT/D/G/S/R/M/Z
10549,10550d10543
< PROFITTED
< PROFITTER/M/S
10647c10640
< PROTEST/D/G/S
---
> PROTEST/D/G/S/R/Z/M
10648a10642
> PROTESTER'S
10650d10643
< PROTESTOR/M/S
10724a10718
> PURCHASABLE
10726d10719
< PURCHASEABLE
10798d10790
< QUEUEING
10885,10886c10877
< RARETY/M/S
< RARITY
---
> RARITY/M/S
11186c11177
< RELABEL/S
---
> RELABEL/S/D/G/R/Z
11356,11357d11346
< RESISTABLE
< RESISTABLY
11359a11349,11350
> RESISTIBLE
> RESISTIBLY
11543c11534
< RIVAL/D/S
---
> RIVAL/D/S/G
12051d12041
< SHAREABLE
12181c12171
< SIGNAL/D/G/Y/S
---
> SIGNAL/D/G/Y/S/R
12182a12173
> SIGNALLER
12991c12982,12983
< SUBTRACT/D/G/S
---
> SUBTRACT/D/G/S/R/Z
> SUBTRACTER'S
12993d12984
< SUBTRACTOR/M/S
13118,13119c13109
< SUPREME/Y
< SUPREMITY/S
---
> SUPREME/Y/P
13660c13650
< TORNADO
---
> TORNADO/S
13662c13652
< TORPEDO
---
> TORPEDO/S
13671c13661,13662
< TOTAL/D/G/Y/S
---
> TOTAL/D/G/Y/S/R/Z/M
> TOTALER'S
13674c13665
< TOTALLER/S
---
> TOTALLER/S/M
13688c13679
< TOWEL/G/S
---
> TOWEL/G/S/D/M
13724a13716
> TRANQUILLITY
13734c13726
< TRANSFER/M/S
---
> TRANSFER/M/S/D/G
13737a13730
> TRANSFERRAL/M/S
14010a14004
> UNCANCELED
14198a14193
> UNLABELED
-- 

	Geoff Kuenning
	{hplabs,ihnp4}!trwrb!desint!geoff