GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS MSVMSVFBR 3811 bp ds-DNA VRL 28-MAY-1991 DEFINITION FBR-murine osteosarcoma provirus genome ACCESSION X03347 KEYWORDS genome; inverted repeat; long terminal repeat; provirus. SOURCE Murine osteosarcoma virus DNA. ORGANISM Murine osteosarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses. REFERENCE 1 (bases 1 to 3811) AUTHORS Van Beveren,C., Enami,S., Curran,T. and Verma,I.M. TITLE FBR murine osteosarcoma virus: II. Nucleotide sequence of the provirus reveals that the genome contains sequences acquired from cellular genes JOURNAL Virology 135, 229-243 (1984) STANDARD full automatic COMMENT From EMBL entry REMSVFBR; dated 25-APR-1990. FEATURES Location/Qualifiers cellular <1..10 /note="flanking rat DNA" repeat_region 7..10 /note="4-base duplication" repeat_region 11..587 /note="LTR-long terminal repeat (U3-R-U5)" misc_feature 11..443 /note="U3 region" repeat_unit 11..21 /note="inverted repeat A" repeat_region 119..153 /note="imp. direct repeat 1" repeat_region 146..176 /note="imp. direct repeat 2" repeat_region 171..205 /note="imp. direct repeat 1" repeat_region 243..273 /note="imp. direct repeat 2" promoter 363..367 /note="CAT box" promoter 414..420 /note="TATA box" misc_feature 444..511 /note="R region" misc_RNA 444..444 /note="cap site" misc_feature 512..587 /note="U5 region" repeat_unit 574..587 /note="inverted repeat A'" misc_feature 588..604 /note="primer binding site" misc_feature 1080..2009 /note="gag-derived sequence" CDS 1080..2741 /note="P75 gag-fos fusion protein (aa 1-554)" /codon_start=1080 misc_feature 1152..1159 /note="pot. N-linked glycosylation site" misc_feature 2010..2718 /note="fos-derived sequence" misc_feature 2719..3168 /note="fox-derived sequence" misc_feature 3169..3178 /note="env-derived sequence" misc_feature 3226..3659 /note="U3 region" repeat_unit 3226..3238 /note="inverted repeat B" repeat_region 3226..3238 /note="LTR" misc_feature 3660..3727 /note="R region" misc_feature 3706..3711 /note="polyadenylation signal" misc_feature 3727..3727 /note="polyadenylation site" misc_feature 3728..3801 /note="U5 region" repeat_unit 3790..3801 /note="inverted repeat B'" repeat_region 3802..3805 /note="4-base duplication" cellular 3805..>3811 /note="flanking rat DNA" BASE COUNT 888 a 1094 c 968 g 861 t ORIGIN 1 cgggctgtat tgaaagaccc cttcataagg cttagccagc taactgcagt aacgccattt 61 tgcaaggcat gggaaaatac cagagctgat gttctcagaa aaacaagaac aaggaggtaa 121 agagaggctg gaaagtaccg ggactagggc caagaacaaa tggttcccag aaatagaggc 181 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccccggc 241 ccagggccaa gaacagatgg ttcccagaaa tagctaaaac aacaacagtt tcaagagacc 301 cagaaactgt ctcaaggttc cccagatgac cggggatcaa ccccaagcct catttaaact 361 aaccaatcag ctcgcttctc gcttctgtac ccgcgcttat tgctgcccag ctctataaaa 421 agggtaagaa ccccacactc ggcgcgccag tcctccgata gactgagtcg cccgggtacc 481 cgtgtatcca ataaagcctt ttgctgttgc atccgaatcg tggtctcgct gatccttggg 541 agggtctcct cagagtgatt gactgcccag cctgggggtc tttcatttgg gggctcgtcc 601 gggatccgga gacccccgcc cagggaccac cgacccaccg tcgggaggta agctggccag 661 cggtcgtttt gtctccgtct ctgtctttgt gcgtgtgtgt gtgtgccggc atctaatctt 721 tgcgcctgcg tctgtatctg tactagttag ctaactagat ctgtatctgg cggttccgtg 781 aaagaactga cgagttcgta ttcccggccg cagccctggg agacgtctca gaggcatcgg 841 gggccatttt tgtggcccaa tctgtatctg agaacccgac ccgtttcgga ctctttggag 901 cttctccatt gactgaagga tacgtggttc tattgggcgg cgaggggccg aaacgctcct 961 ctcctccatc tgaatttttg ctttcggttt tccgccgaaa ccgcgccgcg cgtcttatct 1021 gtctcagtgt tattttgtca tttgtctgtt cgttattgtt ttggaccgtt tctaaaaata 1081 tgggacagac cgtaaccact cctttgagtc tgaccctaga acactgggga gacgtccagc 1141 gcattgcgtc caaccagtcc gtggacgtca agaagagacg ctgggtcacc ttctgttctg 1201 ccgagtggcc aactttcgat gtggggtggc cgcaagatgg tacttttaat ttggacatta 1261 ttttacaggt taaatctaag gtgttctctc ccggtcccca cggacacccg gatcaggtcc 1321 catacattgt cacctgggag gctattgcct atgaaccccc tccgtgggtc aaaccttttg 1381 tctctcccaa actctccccc tctccaaccg ctcccatcct cccatccggt ccttcgaccc 1441 aacctccgcc ccgatctgcc ctttaccctg cccttacccc ctctataaaa cccagacctt 1501 ctaaacctca ggttctctcc gatgacggcg gacctctcat tgaccttctc acagaagacc 1561 ctccgccgta cggagaacag ggaccgtcct cctctgacgg ggatggcgac agagaagagg 1621 ccacctccac ttctgagatt cctgccccct ctcccatggt gtctcgcctg cggggcaaaa 1681 gagacccccc cgcggcagat tccactacct ctcgggcttt cccactccgt ttggggggta 1741 atggtcagaa aaataataac ccttcctttt ctgaagatcc aggtaaattg actgccttaa 1801 tcgagtctgt cctcaccacc caccagccta cctgggacga ctgtcagcag ttgctgggga 1861 ctctgctgac aggagaagaa aagcagcggg tgctcctgga ggccagaaag gcagtccggg 1921 gcaacgatgg gcgccccacc cagatgccta atgaagtcaa tgccgccttc cccctcgaac 1981 gtcccgattg ggattataca actcctgaag acagcctttc ctactaccat tccccagccg 2041 actccttctc cagcatgggc tctcctgtca acacacagga cttttgcgca gatctgtccg 2101 tctctagtgc caactttatc cccacggaga cagccatctc caccagccct gacctgcagt 2161 ggctggtgca gcccactctg gtctcctccg tggccccatc gcagaccaga gcgccccatc 2221 cttacggact ccccacccag tctgctgggg cttacgccag agcgggaatg gtgaagaccg 2281 tgtcaggagg cagagcgcag agcatcggca gaaggggcaa agtagagcag ctatctcctg 2341 aagaggaagt gaaacggaga atccgaagag aacggaataa gatggctgca gccaagtgcc 2401 ggaatcggag gagggagctg acagatacac tccaagcgga gacagatcaa cttgaagatg 2461 agaagtctgc gttgcagact gagattgcca atctgctgaa agagaaggaa aaactggagt 2521 ttattttggc agcccaccga cctgcctgca agatccccga tgaccttggc ttcccagagg 2581 agatgtctgt ggcctcccta gatttgactg gaggtctgct gccccttctc aacgaccctg 2641 agcccaagcc atccttggag ccagtcaaga gcagctttga tgacttcttg tttccggcat 2701 catctggaca cagtggcttt attagcatgg cagggtggca ataggactta gaaattggca 2761 ttggggccct tcttcttccc taaggtgggc acaacattga caaagcgccg gttgtactgc 2821 attcgcctct tggcccggcc tgtcttcttc ttcttctttt cctgtttggc caccttggga 2881 gtctgacctc tcacttttcc agcccgagcc aggaaaccgt gaactttacc tcccagcatg 2941 cggcctgcta cttccagagt ggtcagggcc tctacgccac actggcctag ggtggcctca 3001 tcctcctgcg gcgagcctgc cagaagcacg acttgatcgt cgggggcaat gccttccagg 3061 gaggtcacat gatctttgat ctgggcgacc gtcccctggc cggtcacctc gagggtgtgt 3121 agttcctggg cgcggacaaa gagctgcatg ttggctactt aagacagtaa aagattaaaa 3181 atcacgtgaa taaaagattt tattcagttt acagaaagag gggggaatga aagacccctt 3241 cataaggctt agccagctaa ctgcagtaac gccattttgc aaggcatggg aaaataccag 3301 agctgatgtt ctcagaaaaa caagaacaag gaggtaaaga gaggctggaa agtaccggga 3361 ctagggccaa gaacaaatgg ttcccagaaa tagaggctgg aaagtaccgg gactagggcc 3421 aaacaggata tctgtggtca agcactaggg ccccggccag ggccaagaac agatggttcc 3481 cagaaatagc taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca 3541 gatgaccggg gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt 3601 ctgtacccgc gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggcg 3661 cgccagtcct ccgatagact gagtcgcccg ggtacccgtg tatccaataa agccttttgc 3721 tgttgcatcc gaatcgtggt ctcgctgatc cttgggaggg tctcctcaga gtgattgact 3781 gcccagcctg ggggtctttc agtatgtaat a //