[rec.hunting] Administrivia - The Rec.Hunting Archives

archive@unmvax.cs.unm.edu (Rec.Hunting Archives) (03/28/91)

[Note: this is NOT the text for a monthly posting - that will follow later]


                 Introduction to the Rec.Hunting Archives
                           REQUEST@THEREV.UUCP

          3/25/91 in the wee hours when I should be doing taxes
            or sleeping, instead of tinkering at the computer


    The official Rec.Hunting archives are being maintained long ago, in a
galaxy far, far away.  TheRev.UUCP is a small, privately owned UseNet node
in Los Alamos, New Mexico.  In Los Alamos, one may find a) the largest
single collection of Cray Research mainframes housed under a single roof,
b) this dinky generic 80286 based machine that I use, and c) a 1.43 gigawatt
generator.  The advantage to the first is the incredible speed and
computing power that a researcher can bring to bear on a scientific
problem.  The advantage to the second is that there is no boss or company
that tells me that I've exceeded my disk or resource quota.  There is, as
yet, no advantage to the third - but I had a great time watching them set
it on it's foundation.

                              ARCHIVE NAMING

    The archives are indexed and grouped by posting date, and accessed via
a mail-server, REQUEST@THEREV.UUCP.  The archive name format is ddMMMyy -
for example, any articles with a posting date of March 13, 1991 would be
archived as 13Mar91.  At the end of each month, all articles posted for
that month will be bundled into a monthly archive.  I maintain an index for
each archive, as well as a master index that lists all the articles and
their archive names.

                           MAIL-SERVER COMMANDS

    The archives are retrieved by commands IN THE BODY of an e-mail message
sent to REQUEST@THEREV.UUCP.  You may put anything or nothing in the
Subject line of the mail; the server will pay no attention to it.  The
server currently recognizes two commands, the send command and the special
command.  Here's a list of examples for the send command:

    Command                  What is [supposed to be] sent
    -------                  -----------------------------
    send help                This illuminating and informative document
    send charter             The Rec.Hunting NewsGroup Charter
    send index               The master index of ALL rec.hunting articles,
                             and the archive name where they can be found.
    send list                The list of what's available here.
    send <archive>           A specific archive file
    send <archive>.idx       The index for a specific archive file
    send <article>           A specific article in the archives
    send <message_id>        The article referenced by the message ID
    send money               Has no effect - same as with parents

    Why all the variations?  Because I'm a hacker!  The complexity in the
archives is in the indices, not in the archives themselves.  The format of
the indices allows me to search for and extract just about anything without
much difficulty - the mail server is written in Borland's Turbo Pascal 5.5,
so it's simply a question of indexing the information efficiently.  See
_HOW IT WORKS FAIRLY SIMPLY_ for a more detailed and probably boring
explanation.

    The 'special' command tells the server to forward your request to my
personal account for special handling - it means you want me to break my
back extracting strange information for you without charging you anything
for all the time it's going to take.  So use it advisedly.  Here's an

    special - Send me all articles on steel shot.

    All that really matters to the server is that the first word on the
line is a recognized command.  If you're going to make special requests, do
it in a seperate request.  As soon as the server sees the 'special'
command, it forwards the mail to me.  If any other commands were in the
mail, they will NOT be processed.

    Now if that's not complicated enough, I'm sure I can come up with
something more obtuse.  It actually works fairly simply.


                        HOW IT WORKS FAIRLY SIMPLY

    As articles come in, they are scanned and indexed by date.  The index
lists the article number, article size, and article subject.  The article
itself is packed into an archive file, using Phil Katz's PKZIP.  The index
information is maintained in several files.  There is the master index,
which contains information on all the articles posted.  There is a daily
archive index.  There is a monthly archive index.  These three indices are
available from the server.  There is also a server index that is not
available, which includes the Message-ID from the original posting, and
tracks the References line in the header as well.

    This allows the archive client (That's you!) a great deal of control in
selecting articles from the archive.  You can get an entire month's worth
of articles, say for March of 1991 by sending the command:

                             send Mar91

    You can request articles from specific dates:

                             send 13Mar91
                             send 16Mar91
                             send 12Jun91

    You can request single articles:

                             send 1025
                             send <429@erb1.engr.wisc.edu>

    Note that the article number is assigned by MY machine, not by yours.
You can also get just the indices for the first two examples:

                             send Mar91.idx
                             send 13Mar91.idx

All archive requests should be sent to REQUEST@THEREV.UUCP.  Yes, I know
I've said that before.  I HATE having to search through an entire
mail-server document to find the server's address.


                          COMMENTS AND QUESTIONS

    Comments, questions, and suggestions for improving the archives or mail
server should be sent to archive@therev.uucp.  Note that is NOT
request@therev.uucp.

                            FUTURE DIRECTIONS

    The next "improvement" to the server will be (I hope) a thread
extractor, which will permit a client to request all or part of an ongoing
discussion.

    ---------------------------------------------------------------------
              Mark A. Foster - Keeper of the Rec.Hunting archives
                   Archive requests to: request@therev.UUCP
                 Administrative notes to: archive@therev.UUCP
                           For not-so-smart mailers:
               bbx!therev!{request | archive}@unmvax.cs.unm.edu
    ---------------------------------------------------------------------