[comp.archives.admin] Description programs

gnat@kauri.vuw.ac.nz (Nathan Torkington) (06/04/91)

What programs are people using to index their archives?  I am currently
using "Describe" by Tim Cook.  I will include the README after my .sig,
and I believe it is ftp-able from admin.viccol.edu.au - it is for Unix.

It is a good program, and gives excellent results - even if it required
a little hacking to get to work on our platform :-)  It allows you to
store a text string in a hidden dbm file and then use 'dls' to give the
descriptions in ls form.  Heck, read the README and find out what it
does ... 

What are other people using?

[Please note, you happy FTP-ers, that the archive site I maintain does
*NOT* want overseas FTPing.  We have nothing which cannot be gotten
faster and easier from America or other sites - besides which, we pay
for overseas traffic.  It is for internal New Zealand use only ...]

Nat.
--
Nathan Torkington                           Contracting to CSC,
Internet: gnat@kauri.vuw.ac.nz              Victoria University Of Wellington,
Vocal Abuse: +64 4 853 881 (post-chronos)   P.O. Box 600, New Zealand.
This is not an official communication of any part of Victoria University.

--README FOLLOWS--
Description
-----------
Have you ever wondered what on earth "xfrg2.2.tar.Z" was?  Are you
wondering whether "foo" is really a scratch file?  Does it ashame you
especially, seeing that these files were created by you?

I have the solution for you...  File Descriptions.

I'm sure some operating system or another once had the ability to set
a descriptive comment on a file, but Unix wasn't one of them.  It is
possible, however, to retro-fit onto Unix an implementation that works
quite well.

In this package, I have provided a utility to set/delete/list file
descriptions (describe), and a directory listing utility that shows
any file descriptions that are set next to the respective files (dls).

Implementation
--------------
The storage/retrieval of file-descriptions is implemented in the
following way:

	All file descriptions are stored in a DBM file called .desc,
	in  the same directory as the files described.  This in fact
	means that you have two files, .desc.pag and .desc.dir.

	Each description is indexed by the name of the file (excluding
	any path information).  To provide a better chance of finding
	the description after the file has been renamed (within the
	same directory), the name is also indexed by the inode-number.
	This means that the file's description will still be found
	after it has been renamed (within the same directory) or
	re-written by an editor.  To prevent the two indexes from
	becoming mixed up, whenever a name/description pair are stored
	with a name that is the same size as an inode-number, the name
	is stored with a trailing null byte.
	

Purpose
-------
My reason for coming up with this idea is simple.  I got tired of
logging on to some distant site's anonymous FTP area, and coming
across something like:

-rw-r--r--  1 root       214807 May 22  1989 3c503.tar.Z
-rw-r--r--  1 bin         27751 Aug 22 16:35 bonnie
-rw-rw-r--  1 uucp        65383 Aug 23 02:54 dynafeed.tar.Z
-rw-r--r--  1 bin         10444 Nov  3 21:58 fpipe
-rw-r--r--  1 bin         47138 Aug 22 16:41 opaque	(good name!)
-rw-r--r--  1 don        683759 May 29 23:21 psiber.tar.Z

You know?  You sit there and think ... "some of those sound
interesting, but it wouldn't be nice for me to download them just to
find out what they are...".

If there is no other demand for describe/dls, there should be as a
replacement for ls in anonymous FTP areas.  And that's how simple it
is, once you have dls installed, simply copy it to ~ftp/bin/ls, and
the FTP daemon will use it instead of standard ls for anonymous
logins.  The output of dls was actually designed with anonymous FTP
use in mind (who cares who owns the file or which group it belongs
to?).

Of course, the site administrator has to provide meaningful
descriptions for what has been put in the anonymous FTP area.  I don't
think it would be too hard to generate descriptions files from an
index of something like comp.sources.unix, though.  A dozen lines of
perl should do it.

An Example
----------
As an example of what an anonymous FTP site that uses dls looks like,
try my back yard, admin.viccol.edu.au.

Copyright
---------
The whole system is Copyright (c) Tim Cook 1991, apart from a courtesy
copy of strpbrk.c which is Copyright (c) 1985 Regents of the
University of California.  All together, the package can be
distributed provided no profit is made from any distribution, and all
copyright notices remain intact.

The Author
----------
Tim Cook, Systems Administrator, Victoria College Computer Services.

I am the administrator of a Sequent S27 (Symmetry),
admin.viccol.edu.au, which runs administrative systems under Oracle.

Could I ask all those who install dls in their Anonymous FTP areas to
drop me a line?  Any criticisms, suggestions gladly accepted.  I am
quite interested in what people think of this idea.

bremner@cs.sfu.ca (David Bremner) (06/05/91)

In a related vein, I have started keeping bits I grab from
comp.archives in a full text database ( Liam Quin's lq-text ).  With
the compressed file support I hacked in, this takes about the same
amount of space as storing an uncompressed copy of the text.  I think
that something like this could provide a very useful service; like
archie's new whatis facility, but more powerful.

In case your wondering, I'm not ready to make my database available;
primarily I don't have the resources, also, my complete database is
only a couple of megabytes; I'm pretty selective.  However, with this
caveat, I might be interested in helping get something like this set
up somewhere.

What do people think, is it worth the trouble to be able to type "news
reader" and pull up the readmes for a bunch of news readers?

David
-- 
bremner@cs.sfu.ca 				          ubc-cs!fornax!bremner

cmf851@anu.oz.au (Albert Langer) (06/05/91)

In article <1991Jun4.173107.4964@cs.sfu.ca> bremner@cs.sfu.ca (David Bremner) writes:

>What do people think, is it worth the trouble to be able to type "news
>reader" and pull up the readmes for a bunch of news readers?

YES YES YES PLEASE!

(Could you make available your mods to lq-text and/or scripts for
inputting data from comp.archives etc?)

--
Opinions disclaimed (Authoritative answer from opinion server)
Header reply address wrong. Use cmf851@csc2.anu.edu.au