pgil%histone@LANL.GOV (Paul Gilna) (11/16/89)
Some questions on data submission have traversed the net recently, so I thought I would put in my penny's worth. By way of introduction, I am Paul Gilna, Biology Domain Leader at GenBank, Los Alamos. GenBank, EMBL and DDBJ have collectively been involved in collaborations with the Journals over the last two or three years to establish support for direct submission of sequence data to the databanks. Initially, this collaboration took the form of a request for submission on the part of the Journals. Today, about 20 journals have adopted a procedure whereby submission of data to the databanks is a requisite component of the publication process. For GenBank, this has resulted in a steady increase in the amount of data submitted directly from the community. The proportion now stands at at least 65% and is climbing. In theory, we have agreements in place that cover 90% of the data we currently scan, and we expect more journals to follow suit. About 75% of direct submissions are coming to us in electronic form either via e-mail or on floppy. Up until now the primary tool for electronic submission has been the Data Submisson Form held in common amongst the three nucleotide and three protein sequence databases; this has been available as an ASCII file via e-mail, on floppy or on the tape distributions. Recently, our colleagues at IntelliGenetics have introduced AUTHORIN, a PC-based (Mac version available next year) software package designed to assist the sequencing community in the generation of a database submission. The program has the added advantage of generating an automated database transaction which can be handled via e-mail. (I am assuming that more on this subject will follow from the folks at Intelligenetics). This program is freely available, just call Yuki Abe at (415) 962 7364, or e-mail at abe@genbank.bio.net, or write to IntelliGenetics, 700 East El Camino Real, Mountain View, CA 94940. On-line distribution (anonymous ftp, servers etc.) policies are currently under consideration. All electronic submissions for GenBank can be sent to gb-sub%life@lanl.gov Submissions on floppy disc should be sent to GenBank Submissions T-10 Mail Stop K710 Los Alamos National Laboratory Los Alamos, New Mexico, 87545 We can read all formats of IBM or Mac Diskette. We can read most Word Processor formats, but would strongly reccomend that files be saved as ASCII (text) files. (the AUTHORIN output is an ASCII file) Call us at (505) 665 2177 if need help or more information on submissions Paul Gilna, Ph.D. GenBank Biology Domain Leader. pgil%histone@lanl.gov (505) 665 2177