[comp.ai.digest] Large corpora of English text

Bagley.PA@XEROX.COM (08/01/88)

Date: Thu, 28 Jul 88 15:30 EDT
From: Bagley.PA@Xerox.COM
Subject: Large corpora of English text
To: nl-kr@cs.rochester.edu, ailist@ai.ai.mit.edu
Line-fold: no

I am looking for public domain or commercially available corpora of
either written English or transcriptions of spoken English, preferably
significantly longer than a million characters.  If it is tagged with
part-of-speech that would be great, but it isn't necessary.  Thanks for
all assistance.

Steve Bagley
System Sciences Laboratory
Xerox PARC
3333 Coyote Hill Road
Palo Alto CA 94301
Bagley.pa@xerox.com
415-494-4331
-------