[bionet.molbio.genbank] statistics on the number of sequences reported ?

engeje@UTS.UNI-C.DK (Jacob Engelbrecht) (05/02/91)

Are there statistics available on the number of sequences and bp
in every release of the GenBank releases? And in other databases ?
I would prefer them in electronic form.
-- 
Jacob Engelbrecht

Department of Structural Properties of Materials
The Technical University of Denmark
e-mail: engeje@uts.uni-c.dk

kristoff@genbank.bio.net (David Kristofferson) (05/03/91)

> Are there statistics available on the number of sequences and bp
> in every release of the GenBank releases? And in other databases ?
> I would prefer them in electronic form.

Yes, this information is available in the release notes accompanying
each GenBank release along with further statistical breakdowns.  You
can get it by anonymous FTP from genbank.bio.net [134.172.1.160] in
the directory pub/db/gb-rel67/gbrel.txt.Z and use the UNIX uncompress
utility.  I have put an uncompressed version in pub/doc/gbrel.txt.
The file is rather large to use e-mail for its retrieval
unfortunately.  

If all you need are the two numbers that you mentioned, here's the
first few lines of the file with that info:

GBREL.TXT          Genetic Sequence Data Bank
                         15 March 1991

                     GenBank(R) Release 67.0

                 Distribution Tape Release Notes

  43903 loci, 55169276 bases, from 53763 reported sequences