[bionet.molbio.proteins] Answer to query about protein d.b.

BAIROCH%CGECMU51@CUNYVM.CUNY.EDU (Amos Bairoch) (07/28/89)

<Dear Collegues,
<     I am wondering whether there is any protein data bank available
<which contains information on molecular weight, properties ect. rather
<than mere sequence. E. g. one would like to look for a protein with Mr.
<75 kD which has ATP binding activity or one looks for information on
<all helicases etc. Please send information concerning either the data
<bank itself or people who would be able to provide further help.
<                                  yours: Dieter Blaas


The SWISS-PROT protein data bank contains all these informations.

The molecular weight data is on the "SQ" line-type, the info on properties
(function) of the sequence are on the "CC" and "KW" lines. To show you what
I mean I include an example entry that fit with your example query: an
helicase, ATP-binding of around 75 Kd.

ID   IF4A$MOUSE     STANDARD;      PRT;   390 AA.
AC   P04765;
DT   13-AUG-1987  (REL. 05, CREATED)
DT   13-AUG-1987  (REL. 05, LAST SEQUENCE UPDATE)
DT   01-JUL-1989  (REL. 11, LAST ANNOTATION UPDATE)
DE   INITIATION FACTOR EIF-4A.
OS   MOUSE (MUS MUSCULUS).
OC   EUKARYOTA; METAZOA; CHORDATA; VERTEBRATA; TETRAPODA; MAMMALIA;
OC   EUTHERIA; RODENTIA.
RN   [1] (C19 HYBRIDOMA, SEQUENCE FROM N.A.)
RA   NIELSEN P.J., MCMASTER G.K., TRACHSEL H.;
RL   NUCLEIC ACIDS RES. 13:6867-6880(1985).
CC   -!- FUNCTION: EIF-4A IS BOTH A SUBUNIT OF A HIGH MOLECULAR WEIGHT
CC       PROTEIN COMPLEX INVOLVED IN CAP RECOGNITION AND IS REQUIRED AS A
CC       SINGLE POLYPEPTIDE CHAIN FOR MRNA BINDING TO RIBOSOME.
CC   -!- FUNCTION: EIF-4A IS AN ATP-DEPENDENT SINGLE STRANDED DNA-BINDING
CC       PROTEIN.
CC   -!- IT IS POSSIBLE THAT THE PROTEIN START AT POSITION 20.
CC   -!- SIMILARITY: TO OTHER EIF4-A TYPE ATP-BINDING HELICASES SUCH AS
CC       HUMAN P68 AND DROSOPHILA VASA PROTEIN.
DR   EMBL; X03040; MMEIF4AL.
DR   EMBL; X03039; MMEIF4AS.
DR   PIR; A24267; FIMS4A.
KW   INITIATION FACTOR; PROTEIN BIOSYNTHESIS; ATP-BINDING; RNA-BINDING.
FT   NP_BIND      60     67       ATP (BY SIMILARITY).
SQ   SEQUENCE   390 AA;  44492 MW;  765480 CN;
     MEPEGVIESN WNEIVDSFDD MNLSESLLRG IYAYGFEKPS AIQQRAILPC IKGYDVIAQA
     QSGTGKTATF AISILQQIEL DLKATQALVL APTRELAQQI QKVVMALGDY MGASCHACIG
     GTNVRAEVQK LQMEAPHIIV GTPGRVFDML NRRYLSPKYI KMFVLDEADE MLSRGFKDQI
     YDIFQKLNSN TQVVLLSATM PSDVLEVTKK FMRDPIRILV KKEELTLEGI RQFYINVERE
     EWKLDTLCDL YETLTITQAV IFINTRRKVD WLTEKMHARD FTVSAMHGDM DQKERDVIMR
     EFRSGSSRVL ITTDLLARGI DVQQVSLVIN YDLPTNRENY IHRIGRGGRF GRKGVAINMV
     TEEDKRTLRD IETFYNTSIE EMPLNVADLI
//

Amos Bairoch
Dept. Biochimie Medicale
CMU
1, rue Michel Servet
1211 Geneva 4
Switzerland