GenBank-Updates@genbank.bio.net (09/22/90)
LOCUS EPHCOLF1A 2895 bp ds-DNA INV 22-SEP-1990
DEFINITION Fresh-water sponge Emf1 alpha collagen (COLF1) gene.
ACCESSION M34640
KEYWORDS collagen; extracellular matrix; fibrillar collagen.
SOURCE Fresh-water sponge 3-day-old culture DNA.
ORGANISM Ephydatia mulleri
Eukaryota; Animalia; Parazoa; Porifera; Demospongiae;
Ceractinomorpha; Haplosclerida; Spongillidae.
REFERENCE 1 (bases 1 to 2895)
AUTHORS Exposito,J.-Y. and Garrone,R.
TITLE Characterization of a fibrillar collagen gene in sponges reveals
the early evolutionary appearance of two collagen gene families
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 6669-6673 (1990)
STANDARD full staff_entry
COMMENT Draft entry and computer-readable sequence for [1] kindly submitted
by R.Garrone, 25-MAY-1990, for release after publication.
FEATURES from to/span description
pept < 1 81 alpha collagen
153 260 alpha collagen
534 590 alpha collagen
680 733 alpha collagen
789 842 alpha collagen
899 1114 alpha collagen
1180 1233 alpha collagen
1337 1498 alpha collagen
1595 1720 alpha collagen
1815 1832 alpha collagen
2022 2744 alpha collagen
IVS 82 152 intron 1
IVS 261 533 intron 2
IVS 591 679 intron 3
IVS 734 788 intron 4
IVS 843 898 intron 5
IVS 1115 1179 intron 6
IVS 1234 1336 intron 7
IVS 1499 1594 intron 8
IVS 1721 1814 intron 9
IVS 1833 2021 intron 10
BASE COUNT 698 a 761 c 736 g 700 t
ORIGIN
1 ggaattcaag gagtaccagg accgaatgga gatgttggcc cagctggacc cacaggccct
61 gctggattag atggagcccc agtacgttgg ctcatctccc ctgagcttaa gagtttttaa
121 gagcatgtgg ggtttatctc attgcctggc agggagccca aggtcctgat ggagagcctg
181 gactacctgg cttgcctggt cagtctggta agagtggagc ttctggacag cctggagtcc
241 ctggtccagt gggagcagct gtgagtgtgt ctgccagagt gcatacccac atgtgcacac
301 acacacacca tgtgcacaca cacacatgca cacatgcaca cgcacatgta tacacacaca
361 tacacacaca catacataca cacacacatg cacacacata cacacacaca tacacacaca
421 tacacacaca tacacgcaca catacacgca cattagcagt gcgttcacgt tcaaaaatgg
481 ttgcattcat tcgcagacac cagtacggtt catttatccc catctcccta cagggaaagc
541 ccggatcaat aagaggccag cctggaccac caggaccacc tggtgacctc gtaagtgtct
601 ttgcatcaca tgaccgtcat gtgaccacat ccacgtgact catcacgtga cctcgttttt
661 ctctctccct ccctcacagg gcagaccagg agagagggga gcaaagggtg tgagaggaac
721 gcctggagca cctgtgagtt tacacattct atttgaagca cactgagtta cggagcatat
781 ctgcccaggg ggtggacggt gttgctggca ttgctggagc tattggcttc ccaggaccaa
841 tggttagtta actggctgca agtgaaatgt tgatgcaaag ggtgtttcgt tgccataggg
901 accagatgga gctgctggac cttctggcta tccaggattt gatggtgtgg ccggaaagcc
961 aggaccccag ggggccatgg gaccaaaggg gcaggctggg gagaggggac cccaggggac
1021 accagggacc caaggatcaa agggagtggt tggaccaaag ggagtggttg gacctcaagg
1081 tgacagtgga gacacagggg atgctggaca gaaggtttgt atcttgcata ccactgcata
1141 ccaaaccaag tggtcatctt gttccattca tgttgctagg gagctagagg tacagctggt
1201 tctgttggag ccaagggaac agttggactt cctgtaagca gttatttaac catatcaatt
1261 aacccataaa ataattcagt aaattgagag ggtatggatt tgacatgtgc ctctcccttt
1321 tcctcttata tcccagggca accaagggcc ccaagggcct gctggtctga agggagtgaa
1381 gggagagaaa ggagaggttg gagacaaggg aatccttggt cctgatggag acaagggacc
1441 aacaggcatg tcaggtgatg caggaccagc tggacccatt ggtgatgctg gtatccaggt
1501 accgcacagc aagcatccca ttggttgttg tgtgtgttca tttggtgtgc atcacctctc
1561 ctcctccctc ccctccccac cttgatgccc atagggtcca ccaggacagg atggacccac
1621 gggggcccaa ggtccccgag gaggtcaagg tccaaagggc ccggcaggag cagttggtga
1681 tgttggtgat cgtgggtcaa ctggaccagc tggacctcct gtgagttgcc atggtgttgc
1741 catggtgttg ccatggtgct acttccgtgt ttactgacac tcttgattta ttcctcttct
1801 gtttcgaaca ccagggaccg cctggaccaa ctgtgagttt gttgcaacta gcccaatagg
1861 ttctctgtct tacgcaactc ccccttcctc acttgccctc ccccttcctc acttgccctc
1921 ccccttcctc acttgccctc cccttcctca tttgccctcc ccttcctcac ttgccctccc
1981 cctccccact gtctccctca ctcacatgct cttctgaata gggtggtggc attatcctgg
2041 ttcccgttaa tgatcaaaat cctaccagaa gtccagtttc aggttccgtg ttctatcgcg
2101 ggcaagctga ggagacagat gtcaatctgg gatctgttgc agatgtgatt gaactgcaca
2161 agaagctgca acacctcaag agccccacag gcaccaagga ctcgccagca aggagctgcc
2221 atgacctgtt cctagaggac aattccacct cggatgggta ctactggatt gatcccaatg
2281 gtggttgcat cggggatgct gtcaaggtgt tctgtaattt cactggaggt gtacagcaga
2341 cttgcatctc tgcaacaaag aacgctggtg atctgaagag ctggtccggc cattcaatct
2401 ggttcagtga catgctagga gggttcaagc tcacctatga catcagcagg tcccagctgc
2461 agttcattcg tgctgcctct cgccatgctg ttcaatcctt cacttacaag tgccgcaact
2521 cagctgcagc tgtcatattc cgcactcaag ataacaagga gattgctgcc aacaaggtga
2581 cctacgatgg ctgcaagtca agaccatctg ttccagatgc tgcttttgtt gccgtggaga
2641 ctaagagggt ggagcaattg cccatcaggg attttgcctc cagtgacatt gctggtcagc
2701 atcaagagtt tggctttgag atgggtccag cctgcttcta ctaagcatac tgaaactaat
2761 aacagtttga tgtgtattgt tgtaacttag ataccaacgt ttgaacgctt cgaaaattgt
2821 acatgtactt cataactaca tgtaagtgta tttatctcca gtacaaaaac attatttatt
2881 ttgtgtcttc aaaaa
//