[comp.text.sgml] where to get machine readable text?

emv@math.lsa.umich.edu (Edward Vielmetti) (09/18/90)

In article <2945@network.ucsd.edu> pbiron@weber.ucsd.edu (Paul Biron) writes:

   I realize that most discussion in this group is about formating
   text, however, I cannot find a more suitable group to post this
   question to.  Sorry if this is a gross missuse of this group's
   bandwidth!

I've x-posted into comp.text.sgml, which is relevant of sorts

   I am looking for places (hopefully free, via ftp) to get machine
   readable text (list deleted)

For anonymous FTP from the directory

	sgml.math.lsa.umich.edu:/pub/sgml/oxford-text-archive 

you will find a list of texts which are available (for a fee) from the
Oxford Text Archive and a description of how to get them.  Use of the
texts is generally limited to private scholarly research or when
authorized for teaching purposes, and there may be other and more
strict restrictions.  The list of available texts is pretty long.

Beyond that there's the Open Book Initiative led by Barry Shein,
I don't know what sort of a corpus he has available & how extensive
it is, and also something called Project Gutenberg.  I'll pass on
extensive descriptions of each cause I don't have them.

--Ed

Edward Vielmetti, U of Michigan math dept <emv@math.lsa.umich.edu>
moderator, comp.archives