engeje@UTS.UNI-C.DK (Jacob Engelbrecht) (05/02/91)
Are there statistics available on the number of sequences and bp in every release of the GenBank releases? And in other databases ? I would prefer them in electronic form. -- Jacob Engelbrecht Department of Structural Properties of Materials The Technical University of Denmark e-mail: engeje@uts.uni-c.dk
kristoff@genbank.bio.net (David Kristofferson) (05/03/91)
> Are there statistics available on the number of sequences and bp > in every release of the GenBank releases? And in other databases ? > I would prefer them in electronic form. Yes, this information is available in the release notes accompanying each GenBank release along with further statistical breakdowns. You can get it by anonymous FTP from genbank.bio.net [134.172.1.160] in the directory pub/db/gb-rel67/gbrel.txt.Z and use the UNIX uncompress utility. I have put an uncompressed version in pub/doc/gbrel.txt. The file is rather large to use e-mail for its retrieval unfortunately. If all you need are the two numbers that you mentioned, here's the first few lines of the file with that info: GBREL.TXT Genetic Sequence Data Bank 15 March 1991 GenBank(R) Release 67.0 Distribution Tape Release Notes 43903 loci, 55169276 bases, from 53763 reported sequences