GenBank-Updates@genbank.bio.net (05/29/91)
LOCUS MSVMSVFBR 3811 bp ds-DNA VRL 28-MAY-1991
DEFINITION FBR-murine osteosarcoma provirus genome
ACCESSION X03347
KEYWORDS genome; inverted repeat; long terminal repeat; provirus.
SOURCE Murine osteosarcoma virus DNA.
ORGANISM Murine osteosarcoma virus
Viridae; ss-RNA enveloped viruses; Positive strand RNA virus;
Retroviridae; Oncovirinae; Type C oncovirus group;
Mammalian type C oncoviruses.
REFERENCE 1 (bases 1 to 3811)
AUTHORS Van Beveren,C., Enami,S., Curran,T. and Verma,I.M.
TITLE FBR murine osteosarcoma virus: II. Nucleotide sequence of the
provirus reveals that the genome contains sequences acquired from
cellular genes
JOURNAL Virology 135, 229-243 (1984)
STANDARD full automatic
COMMENT From EMBL entry REMSVFBR; dated 25-APR-1990.
FEATURES Location/Qualifiers
cellular <1..10
/note="flanking rat DNA"
repeat_region 7..10
/note="4-base duplication"
repeat_region 11..587
/note="LTR-long terminal repeat (U3-R-U5)"
misc_feature 11..443
/note="U3 region"
repeat_unit 11..21
/note="inverted repeat A"
repeat_region 119..153
/note="imp. direct repeat 1"
repeat_region 146..176
/note="imp. direct repeat 2"
repeat_region 171..205
/note="imp. direct repeat 1"
repeat_region 243..273
/note="imp. direct repeat 2"
promoter 363..367
/note="CAT box"
promoter 414..420
/note="TATA box"
misc_feature 444..511
/note="R region"
misc_RNA 444..444
/note="cap site"
misc_feature 512..587
/note="U5 region"
repeat_unit 574..587
/note="inverted repeat A'"
misc_feature 588..604
/note="primer binding site"
misc_feature 1080..2009
/note="gag-derived sequence"
CDS 1080..2741
/note="P75 gag-fos fusion protein (aa 1-554)"
/codon_start=1080
misc_feature 1152..1159
/note="pot. N-linked glycosylation site"
misc_feature 2010..2718
/note="fos-derived sequence"
misc_feature 2719..3168
/note="fox-derived sequence"
misc_feature 3169..3178
/note="env-derived sequence"
misc_feature 3226..3659
/note="U3 region"
repeat_unit 3226..3238
/note="inverted repeat B"
repeat_region 3226..3238
/note="LTR"
misc_feature 3660..3727
/note="R region"
misc_feature 3706..3711
/note="polyadenylation signal"
misc_feature 3727..3727
/note="polyadenylation site"
misc_feature 3728..3801
/note="U5 region"
repeat_unit 3790..3801
/note="inverted repeat B'"
repeat_region 3802..3805
/note="4-base duplication"
cellular 3805..>3811
/note="flanking rat DNA"
BASE COUNT 888 a 1094 c 968 g 861 t
ORIGIN
1 cgggctgtat tgaaagaccc cttcataagg cttagccagc taactgcagt aacgccattt
61 tgcaaggcat gggaaaatac cagagctgat gttctcagaa aaacaagaac aaggaggtaa
121 agagaggctg gaaagtaccg ggactagggc caagaacaaa tggttcccag aaatagaggc
181 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccccggc
241 ccagggccaa gaacagatgg ttcccagaaa tagctaaaac aacaacagtt tcaagagacc
301 cagaaactgt ctcaaggttc cccagatgac cggggatcaa ccccaagcct catttaaact
361 aaccaatcag ctcgcttctc gcttctgtac ccgcgcttat tgctgcccag ctctataaaa
421 agggtaagaa ccccacactc ggcgcgccag tcctccgata gactgagtcg cccgggtacc
481 cgtgtatcca ataaagcctt ttgctgttgc atccgaatcg tggtctcgct gatccttggg
541 agggtctcct cagagtgatt gactgcccag cctgggggtc tttcatttgg gggctcgtcc
601 gggatccgga gacccccgcc cagggaccac cgacccaccg tcgggaggta agctggccag
661 cggtcgtttt gtctccgtct ctgtctttgt gcgtgtgtgt gtgtgccggc atctaatctt
721 tgcgcctgcg tctgtatctg tactagttag ctaactagat ctgtatctgg cggttccgtg
781 aaagaactga cgagttcgta ttcccggccg cagccctggg agacgtctca gaggcatcgg
841 gggccatttt tgtggcccaa tctgtatctg agaacccgac ccgtttcgga ctctttggag
901 cttctccatt gactgaagga tacgtggttc tattgggcgg cgaggggccg aaacgctcct
961 ctcctccatc tgaatttttg ctttcggttt tccgccgaaa ccgcgccgcg cgtcttatct
1021 gtctcagtgt tattttgtca tttgtctgtt cgttattgtt ttggaccgtt tctaaaaata
1081 tgggacagac cgtaaccact cctttgagtc tgaccctaga acactgggga gacgtccagc
1141 gcattgcgtc caaccagtcc gtggacgtca agaagagacg ctgggtcacc ttctgttctg
1201 ccgagtggcc aactttcgat gtggggtggc cgcaagatgg tacttttaat ttggacatta
1261 ttttacaggt taaatctaag gtgttctctc ccggtcccca cggacacccg gatcaggtcc
1321 catacattgt cacctgggag gctattgcct atgaaccccc tccgtgggtc aaaccttttg
1381 tctctcccaa actctccccc tctccaaccg ctcccatcct cccatccggt ccttcgaccc
1441 aacctccgcc ccgatctgcc ctttaccctg cccttacccc ctctataaaa cccagacctt
1501 ctaaacctca ggttctctcc gatgacggcg gacctctcat tgaccttctc acagaagacc
1561 ctccgccgta cggagaacag ggaccgtcct cctctgacgg ggatggcgac agagaagagg
1621 ccacctccac ttctgagatt cctgccccct ctcccatggt gtctcgcctg cggggcaaaa
1681 gagacccccc cgcggcagat tccactacct ctcgggcttt cccactccgt ttggggggta
1741 atggtcagaa aaataataac ccttcctttt ctgaagatcc aggtaaattg actgccttaa
1801 tcgagtctgt cctcaccacc caccagccta cctgggacga ctgtcagcag ttgctgggga
1861 ctctgctgac aggagaagaa aagcagcggg tgctcctgga ggccagaaag gcagtccggg
1921 gcaacgatgg gcgccccacc cagatgccta atgaagtcaa tgccgccttc cccctcgaac
1981 gtcccgattg ggattataca actcctgaag acagcctttc ctactaccat tccccagccg
2041 actccttctc cagcatgggc tctcctgtca acacacagga cttttgcgca gatctgtccg
2101 tctctagtgc caactttatc cccacggaga cagccatctc caccagccct gacctgcagt
2161 ggctggtgca gcccactctg gtctcctccg tggccccatc gcagaccaga gcgccccatc
2221 cttacggact ccccacccag tctgctgggg cttacgccag agcgggaatg gtgaagaccg
2281 tgtcaggagg cagagcgcag agcatcggca gaaggggcaa agtagagcag ctatctcctg
2341 aagaggaagt gaaacggaga atccgaagag aacggaataa gatggctgca gccaagtgcc
2401 ggaatcggag gagggagctg acagatacac tccaagcgga gacagatcaa cttgaagatg
2461 agaagtctgc gttgcagact gagattgcca atctgctgaa agagaaggaa aaactggagt
2521 ttattttggc agcccaccga cctgcctgca agatccccga tgaccttggc ttcccagagg
2581 agatgtctgt ggcctcccta gatttgactg gaggtctgct gccccttctc aacgaccctg
2641 agcccaagcc atccttggag ccagtcaaga gcagctttga tgacttcttg tttccggcat
2701 catctggaca cagtggcttt attagcatgg cagggtggca ataggactta gaaattggca
2761 ttggggccct tcttcttccc taaggtgggc acaacattga caaagcgccg gttgtactgc
2821 attcgcctct tggcccggcc tgtcttcttc ttcttctttt cctgtttggc caccttggga
2881 gtctgacctc tcacttttcc agcccgagcc aggaaaccgt gaactttacc tcccagcatg
2941 cggcctgcta cttccagagt ggtcagggcc tctacgccac actggcctag ggtggcctca
3001 tcctcctgcg gcgagcctgc cagaagcacg acttgatcgt cgggggcaat gccttccagg
3061 gaggtcacat gatctttgat ctgggcgacc gtcccctggc cggtcacctc gagggtgtgt
3121 agttcctggg cgcggacaaa gagctgcatg ttggctactt aagacagtaa aagattaaaa
3181 atcacgtgaa taaaagattt tattcagttt acagaaagag gggggaatga aagacccctt
3241 cataaggctt agccagctaa ctgcagtaac gccattttgc aaggcatggg aaaataccag
3301 agctgatgtt ctcagaaaaa caagaacaag gaggtaaaga gaggctggaa agtaccggga
3361 ctagggccaa gaacaaatgg ttcccagaaa tagaggctgg aaagtaccgg gactagggcc
3421 aaacaggata tctgtggtca agcactaggg ccccggccag ggccaagaac agatggttcc
3481 cagaaatagc taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca
3541 gatgaccggg gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt
3601 ctgtacccgc gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggcg
3661 cgccagtcct ccgatagact gagtcgcccg ggtacccgtg tatccaataa agccttttgc
3721 tgttgcatcc gaatcgtggt ctcgctgatc cttgggaggg tctcctcaga gtgattgact
3781 gcccagcctg ggggtctttc agtatgtaat a
//