[bionet.general] Data Submission

pgil%histone@LANL.GOV (Paul Gilna) (11/16/89)

Some questions on data submission have traversed the net recently, so
I thought I would put in my penny's worth.

By way of introduction, I am Paul Gilna, Biology Domain Leader at GenBank,
Los Alamos.

GenBank, EMBL and DDBJ have collectively been involved in 
collaborations with the Journals over the last
two or three years to establish support for direct submission 
of sequence data to the databanks.

Initially, this collaboration took the form of a request for submission
on the part of the Journals.  Today, about 20 journals have adopted
a procedure whereby submission of data to the databanks is a requisite
component of the publication process.

For GenBank, this has resulted in a steady increase in the amount
of data submitted directly from the community.  The proportion now
stands at at least 65% and is climbing.  In theory, we have agreements
in place that cover 90% of the data we currently scan, and we expect
more journals to follow suit.

About 75% of direct submissions are coming to us in electronic form
either via e-mail or on floppy.  Up until now the primary tool for
electronic submission has been the Data Submisson Form held in common
amongst the three nucleotide and three protein sequence databases;
this has been available as an ASCII file via e-mail, on floppy or
on the tape distributions.

Recently, our colleagues at IntelliGenetics have introduced AUTHORIN, a 
PC-based (Mac version available next year) software package designed 
to assist the sequencing community in the generation of a database submission.
The program has the added advantage of generating an automated
database transaction which can be handled via e-mail.

(I am assuming that more on this subject will follow from the folks
at Intelligenetics).  

This program is freely available, just call
Yuki Abe at (415) 962 7364, or e-mail at abe@genbank.bio.net, or write
to IntelliGenetics, 700 East El Camino Real, Mountain View, CA 94940.
On-line distribution (anonymous ftp, servers etc.) policies are currently
under consideration.

All electronic submissions for GenBank can be sent to 

		gb-sub%life@lanl.gov

Submissions on floppy disc should be sent to

		GenBank Submissions
		T-10 Mail Stop K710
		Los Alamos National Laboratory
		Los Alamos, New Mexico, 87545

		We can read all formats of IBM or Mac Diskette.
		We can read most Word Processor formats, but would
		strongly reccomend that files be saved as ASCII
		(text) files.  (the AUTHORIN output is an ASCII file)

		Call us at (505) 665 2177 if need help or more information
		on submissions



Paul Gilna, Ph.D.
GenBank Biology Domain Leader.

pgil%histone@lanl.gov
(505) 665 2177