EMBL: K02013

ID   HIVBRUCG   standard; RNA; VRL; 9229 BP.
XX
AC   K02013;
XX
DT   18-NOV-1986 (Rel. 10, Created)
DT   17-OCT-1992 (Rel. 33, Last updated, Version 5)
XX
DE   Human immunodeficiency virus type 1, isolate BRU, complete genome
DE   (LAV-1).
XX
KW   acquired immune deficiency syndrome; complete genome; env gene;
KW   gag gene; long terminal repeat; pol gene; polyprotein;
KW   proviral gene; reverse transcriptase; tar protein; tat protein;
KW   trans-activator.
XX
OS   Human immunodeficiency virus type 1
OC   Viridae; ss-RNA enveloped viruses; Positive strand RNA viruses;
OC   Retroviridae; Lentivirinae.
XX
RN   [1]
RP   1-9229
RA   Wain-Hobson S., Sonigo P., Danos O., Cole S., Alizon M.;
RT   "Nucleotide sequence of the AIDS Virus, LAV";
RL   Cell 40:9-17(1985).
XX
RN   [2]
RP   1-9193
RA   Van Beveren C., Coffin J., Hughes S.;
RT   "Appendix B: HTLV-3/LAV genome";
RL   (in) Weiss R., Teich N., Varmus H., Coffin J. (eds.);
RL   RNA TUMOR VIRUSES, MOLECULAR BIOLOGY OF TUMOR VIRUSES, SECOND
RL   EDITION, 2 (SUPPLEMENTS AND APPENDIXES):1106-1123;
RL   Cold Spring Harbor Laboratory, CSH, NY (1985)
XX
RN   [3]
RP   1712-1749
RA   Alizon M., Wain-Hobson S., Gluckman J.C., Sonigo P.;
RT   "Genetic variability of the AIDS virus: Nucleotide sequence
RT   analysis of two isolates from African patients";
RL   Cell 46:63-74(1986).
XX
DR   SWISS-PROT; P03348; GAG_HV1BR.
DR   SWISS-PROT; P03367; POL_HV1BR.
DR   SWISS-PROT; P03377; ENV_HV1BR.
DR   SWISS-PROT; P03401; VIF_HV1B1.
DR   SWISS-PROT; P03406; NEF_HV1BR.
DR   SWISS-PROT; P04610; TAT_HV1BR.
DR   SWISS-PROT; P04620; REV_HV1BR.
DR   SWISS-PROT; P05923; VPU_HV1BR.
DR   SWISS-PROT; P05928; VPR_HV1BR.
XX
CC   [(in) Weiss,R., Teich,N., Varmus,H. and Coffin,J. (Eds.);RNA Tumor
CC   Viruses,Molecu] review. [3] revision of [1]. The original LAV,
CC   sometimes called LAV-1 to distinguish it from HIV2 (LAV-2), is now
CC   referred to as HIV-1bru. An infectious clone of this virus has been
CC   constructed by Keith Peden, Molecular Bio- logy and Genetics, Johns
CC   Hopkins University School of Medicine, Baltimore, MD 21205 (301)
CC   955-3652. HIVNL43 is also an infectious clone having for its 3'
CC   half a clone of the BRU isolate. Acquired immune deficiency
CC   syndrome (AIDS) is caused by a retrovirus known by several
CC   different names, probably representing two separate strains: human
CC   T-cell lymphotropic virus-III (HTLV-III) and
CC   lymphadenopathy-associated virus (LAV) are thought to be one
CC   strain, and AIDS-associated retrovirus type 2 (ARV-2) the other.
CC   All three viruses, whose sequences do not differ by more than about
CC   6%, are believed to belong to the retroviral subfamily
CC   Lentiviridae, or 'slow' viruses. For the details of the annotation
CC   and for other pertinent references, see the HIV reference entry.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..9229
FT                   /organism="Human immunodeficiency virus type 1"
FT   LTR             <1..180
FT                   /note="5' LTR"
FT   repeat_region   <1..97
FT                   /note="R repeat 5' copy"
FT   exon            <5412..5626
FT                   /note="tat protein, exon 2 (first expressed exon)"
FT   exon            <5551..5626
FT                   /note="rev protein, exon 2 (first expressed exon)"
FT   CDS             join(5412..5626,7972..8017)
FT                   /note="tat protein"
FT                   /codon_start=1
FT   CDS             join(5551..5626,7972..8246)
FT                   /note="rev protein"
FT                   /codon_start=1
FT   prim_transcript 1..9229
FT                   /note="genomic mRNA"
FT   prim_transcript 1..9229
FT                   /note="tat, rev, nef subgenomic mRNA"
FT   misc_binding    182..199
FT                   /bound_moiety="primer (Lys-tRNA)"
FT   intron          290..5358
FT                   /note="tat, rev, nef subgenomic mRNA intron 1"
FT   CDS             336..1874
FT                   /note="gag polyprotein"
FT                   /codon_start=1
FT   misc_feature    1631..4678
FT                   /note="ORF pol polypeptide bordered by stop codons"
FT                   /product="pol polyprotein"
FT                   /gene="pol"
FT   CDS             4623..5201
FT                   /note="vif protein"
FT                   /codon_start=1
FT   CDS             5141..5431
FT                   /note="vpr protein"
FT                   /codon_start=1
FT   intron          5627..7971
FT                   /note="tat cds intron 2"
FT   intron          5627..7971
FT                   /note="rev cds intron 2"
FT   intron          5627..7971
FT                   /note="tat, rev, nef subgenomic mRNA intron 2"
FT   CDS             5643..5888
FT                   /note="vpu protein"
FT                   /codon_start=1
FT   CDS             5803..8388
FT                   /note="envelope polyprotein"
FT                   /codon_start=1
FT   exon            7972..>8017
FT                   /note="tat protein, exon 3 (AA at 7973)"
FT   exon            7972..>8246
FT                   /note="rev protein, exon 3 (AA at 7974)"
FT   CDS             8390..9010
FT                   /note="nef protein"
FT                   /codon_start=1
FT   LTR             8679..>9229
FT                   /note="3' LTR"
FT   repeat_region   9133..9229
FT                   /note="R repeat 3' copy"
FT   polyA_signal    9205..9210
FT                   /note="mRNA polyadenylation signal"
XX
SQ   Sequence 9229 BP; 3289 A; 1656 C; 2232 G; 2052 T; 0 other;
     ggtctctctg gttagaccag atttgagcct gggagctctc tggctaacta gggaacccac        60
     tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt       120
     gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca       180
     gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag       240
     gactcggctt gctgaagcgc gcacggcaag aggcgagggg aggcgactgg tgagtacgcc       300
     aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa       360
     gcgggggaga attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat       420
     ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg       480
     gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc       540
     agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc       600
     atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa       660
     acaaaagtaa gaaaaaagca cagcaagcag cagctgacac aggacacagc agccaggtca       720
     gccaaaatta ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac       780
     ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga       840
     tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa       900
     acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag       960
     ctgcagaatg ggatagagtg catccagtgc atgcagggcc tattgcacca ggccagatga      1020
     gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat      1080
     ggatgacaaa taatccacct atcccagtag gagaaattta taaaagatgg ataatcctgg      1140
     gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac      1200
     caaaagaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag      1260
     cttcacagga ggtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcgaacccag      1320
     attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag      1380
     catgtcaggg agtgggagga cccggccata aggcaagagt tttggctgaa gcaatgagcc      1440
     aagtaacaaa ttcagctacc ataatgatgc aaagaggcaa ttttaggaac caaagaaaga      1500
     ttgttaagtg tttcaattgt ggcaaagaag ggcacatagc cagaaattgc agggccccta      1560
     ggaaaaaggg ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga      1620
     gacaggctaa ttttttaggg aagatctggc cttcctacaa gggaaggcca gggaattttc      1680
     ttcagagcag accagagcca acagccccac catttcttca gagcagacca gagccaacag      1740
     ccccaccaga agagagcttc aggtctgggg tagagacaac aactccctct cagaagcagg      1800
     agccgataga caaggaactg tatcctttaa cttccctcag atcactcttt ggcaacgacc      1860
     cctcgtcaca ataaagatag gggggcaact aaaggaagct ctattagata caggagcaga      1920
     tgatacagta ttagaagaaa tgagtttgcc aggaagatgg aaaccaaaaa tgataggggg      1980
     aattggaggt tttatcaaag taagacagta tgatcagata ctcatagaaa tctgtggaca      2040
     taaagctata ggtacagtat tagtaggacc tacacctgtc aacataattg gaagaaatct      2100
     gttgactcag attggttgca ctttaaattt tcccattagt cctattgaaa ctgtaccagt      2160
     aaaattaaag ccaggaatgg atggcccaaa agttaaacaa tggccattga cagaagaaaa      2220
     aataaaagca ttagtagaaa tttgtacaga aatggaaaag gaagggaaaa tttcaaaaat      2280
     tgggcctgaa aatccataca atactccagt atttgccata aagaaaaaag acagtactaa      2340
     atggagaaaa ttagtagatt tcagagaact taataagaga actcaagact tctgggaagt      2400
     tcaattagga ataccacatc ccgcagggtt aaaaaagaaa aaatcagtaa cagtactgga      2460
     tgtgggtgat gcatattttt cagttccctt agatgaagac ttcaggaagt atactgcatt      2520
     taccatacct agtataaaca atgagacacc agggattaga tatcagtaca atgtgcttcc      2580
     acagggatgg aaaggatcac cagcaatatt ccaaagtagc atgacaaaaa tcttagagcc      2640
     ttttagaaaa caaaatccag acatagttat ctatcaatac atggatgatt tgtatgtagg      2700
     atctgactta gaaatagggc agcatagaac aaaaatagag gagctgagac aacatctgtt      2760
     gaggtgggga cttaccacac cagacaaaaa acatcagaaa gaacctccat tcctttggat      2820
     gggttatgaa ctccatcctg ataaatggac agtacagcct atagtgctgc cagaaaaaga      2880
     cagctggact gtcaatgaca tacagaagtt agtgggaaaa ttgaattggg caagtcagat      2940
     ttacccaggg attaaagtaa ggcaattatg taaactcctt agaggaacca aagcactaac      3000
     agaagtaata ccactaacag aagaagcaga gctagaactg gcagaaaaca gagagattct      3060
     aaaagaacca gtacatggag tgtattatga cccatcaaaa gacttaatag cagaaataca      3120
     gaagcagggg caaggccaat ggacatatca aatttatcaa gagccattta aaaatctgaa      3180
     aacaggaaaa tatgcaagaa cgaggggtgc ccacactaat gatgtaaaac aattaacaga      3240
     ggcagtgcaa aaaataacca cagaaagcat agtaatatgg ggaaagactc ctaaatttaa      3300
     actacccata caaaaggaaa catgggaaac atggtggaca gagtattggc aagccacctg      3360
     gattcctgag tgggagtttg tcaatacccc tcctttagtg aaattatggt accagttaga      3420
     gaaagaaccc atagtaggag cagaaacgtt ctatgtagat ggggcagcta gcagggagac      3480
     taaattagga aaagcaggat atgttactaa tagaggaaga caaaaagttg tcaccctaac      3540
     tgacacaaca aatcagaaga ctgagttaca agcaattcat ctagctttgc aggattcggg      3600
     attagaagta aatatagtaa cagactcaca atatgcatta ggaatcattc aagcacaacc      3660
     agataaaagt gaatcagagt tagtcaatca aataatagag cagttaataa aaaaggaaaa      3720
     ggtctatctg gcatgggtac cagcacacaa aggaattgga ggaaatgaac aagtagataa      3780
     attagtcagt gctggaatca ggaaagtact atttttagat ggaatagata aggcccaaga      3840
     tgaacatgag aaatatcaca gtaattggag agcaatggct agtgatttta acctgccacc      3900
     tgtagtagca aaagaaatag tagccagctg tgataaatgt cagctaaaag gagaagccat      3960
     gcatggacaa gtagactgta gtccaggaat atggcaacta gattgtacac atttagaagg      4020
     aaaagttatc ctggtagcag ttcatgtagc cagtggatat atagaagcag aagttattcc      4080
     agcagaaaca gggcaggaaa cagcatactt tcttttaaaa ttagcaggaa gatggccagt      4140
     aaaaacaata catacagaca atggcagcaa tttcaccagt actacggtta aggccgcctg      4200
     ttggtgggcg ggaatcaagc aggaatttgg aattccctac aatccccaaa gtcaaggagt      4260
     agtagaatct atgaataaag aattaaagaa aattataggc caggtaagag atcaggctga      4320
     acatcttaag acagcagtac aaatggcagt attcatccac aattttaaaa gaaaaggggg      4380
     gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac      4440
     taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag      4500
     cagagatcca ctttggaaag gaccagcaaa gctcctctgg aaaggtgaag gggcagtagt      4560
     aatacaagat aatagtgaca taaaagtagt gccaagaaga aaagcaaaga tcattaggga      4620
     ttatggaaaa cagatggcag gtgatgattg tgtggcaagt agacaggatg aggattagaa      4680
     catggaaaag tttagtaaaa caccatatgt atgtttcagg gaaagctagg ggatggtttt      4740
     atagacatca ctatgaaagc cctcatccaa gaataagttc agaagtacac atcccactag      4800
     gggatgctag attggtaata acaacatatt ggggtctgca tacaggagaa agagactggc      4860
     atctgggtca gggagtctcc atagaatgga ggaaaaagag atatagcaca caagtagacc      4920
     ctgaactagc agaccaacta attcatctgt attactttga ctgtttttca gactctgcta      4980
     taagaaaggc cttattagga catatagtta gccctaggtg tgaatatcaa gcaggacata      5040
     acaaggtagg atctctacaa tacttggcac tagcagcatt aataacacca aaaaagataa      5100
     agccaccttt gcctagtgtt acgaaactga cagaggatag atggaacaag ccccagaaga      5160
     ccaagggcca cagagggagc cacacaatga atggacacta gagcttttag aggagcttaa      5220
     gaatgaagct gttagacatt ttcctaggat ttggctccat ggcttagggc aacatatcta      5280
     tgaaacttat ggggatactt gggcaggagt ggaagccata ataagaattc tgcaacaact      5340
     gctgtttatc catttcagaa ttgggtgtcg acatagcaga ataggcgtta ctcaacagag      5400
     gagagcaaga aatggagcca gtagatccta gactagagcc ctggaagcat ccaggaagtc      5460
     agcctaaaac tgcttgtacc acttgctatt gtaaaaagtg ttgctttcat tgccaagttt      5520
     gtttcacaac aaaagcctta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa      5580
     gacctcctca aggcagtcag actcatcaag tttctctatc aaagcagtaa gtagtacatg      5640
     taatgcaacc tatacaaata gcaatagcag cattagtagt agcaataata atagcaatag      5700
     ttgtgtggtc catagtaatc atagaatata ggaaaatatt aagacaaaga aaaatagaca      5760
     ggttaattga tagactaata gaaagagcag aagacagtgg caatgagagt gaaggagaaa      5820
     tatcagcact tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg      5880
     atctgtagtg ctacagaaaa attgtgggtc acagtctatt atggggtacc tgtgtggaag      5940
     gaagcaacca ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat      6000
     aatgtttggg ccacacatgc ctgtgtaccc acagacccca acccacaaga agtagtattg      6060
     gtaaatgtga cagaaaattt taacatgtgg aaaaatgaca tggtagaaca gatgcatgag      6120
     gatataatca gtttatggga tcaaagccta aagccatgtg taaaattaac cccactctgt      6180
     gttagtttaa agtgcactga tttggggaat gctactaata ccaatagtag taataccaat      6240
     agtagtagcg gggaaatgat gatggagaaa ggagagataa aaaactgctc tttcaatatc      6300
     agcacaagca taagaggtaa ggtgcagaaa gaatatgcat ttttttataa acttgatata      6360
     ataccaatag ataatgatac taccagctat acgttgacaa gttgtaacac ctcagtcatt      6420
     acacaggcct gtccaaaggt atcctttgag ccaattccca tacattattg tgccccggct      6480
     ggttttgcga ttctaaaatg taataataag acgttcaatg gaacaggacc atgtacaaat      6540
     gtcagcacag tacaatgtac acatggaatt aggccagtag tatcaactca actgctgttg      6600
     aatggcagtc tagcagaaga agaggtagta attagatctg ccaatttcac agacaatgct      6660
     aaaaccataa tagtacagct gaaccaatct gtagaaatta attgtacaag acccaacaac      6720
     aatacaagaa aaagtatccg tatccagagg ggaccaggga gagcatttgt tacaatagga      6780
     aaaataggaa atatgagaca agcacattgt aacattagta gagcaaaatg gaatgccact      6840
     ttaaaacaga tagctagcaa attaagagaa caatttggaa ataataaaac aataatcttt      6900
     aagcaatcct caggagggga cccagaaatt gtaacgcaca gttttaattg tggaggggaa      6960
     tttttctact gtaattcaac acaactgttt aatagtactt ggtttaatag tacttggagt      7020
     actgaagggt caaataacac tgaaggaagt gacacaatca cactcccatg cagaataaaa      7080
     caatttataa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagcgga      7140
     caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtaataac      7200
     aacaatgggt ccgagatctt cagacctgga ggaggagata tgagggacaa ttggagaagt      7260
     gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca      7320
     aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg      7380
     ttcttgggag cagcaggaag cactatgggc gcacggtcaa tgacgctgac ggtacaggcc      7440
     agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg      7500
     caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg      7560
     gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa      7620
     ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag      7680
     atttggaata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta      7740
     atacattcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg      7800
     gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat      7860
     ataaaaatat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta      7920
     ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc      7980
     ccaaccccga ggggacccga caggcccgaa ggaatagaag aagaaggtgg agagagagac      8040
     agagacagat ccattcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg      8100
     agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt      8160
     gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacag      8220
     tattggagtc aggaactaaa gaatagtgct gttagcttgc tcaatgccac agccatagca      8280
     gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc      8340
     cacataccta gaagaataag acagggcttg gaaaggattt tgctataaga tgggtggcaa      8400
     gtggtcaaaa agtagtgtgg ttggatggcc tactgtaagg gaaagaatga gacgagctga      8460
     gccagcagca gatggggtgg gagcagcatc tcgagacctg gaaaaacatg gagcaatcac      8520
     aagtagcaat acagcagcta ccaatgctgc ttgtgcctgg ctagaagcac aagaggagga      8580
     ggaggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt acaaggcagc      8640
     tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca      8700
     acgaagacaa gatatccttg atctgtggat ctaccacaca caaggctact tccctgattg      8760
     gcagaactac acaccagggc caggggtcag atatccactg acctttggat ggtgctacaa      8820
     gctagtacca gttgagccag ataaggtaga agaggccaat aaaggagaga acaccagctt      8880
     gttacaccct gtgagcctgc atggaatgga tgaccctgag agagaagtgt tagagtggag      8940
     gtttgacagc cgcctagcat ttcatcacgt ggcccgagag ctgcatccgg agtacttcaa      9000
     gaactgctga catcgagctt gctacaaggg actttccgct ggggactttc cagggaggcg      9060
     tggcctgggc gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt      9120
     ttgcctgtac tgggtctctc tggttagacc agatttgagc ctgggagctc tctggctaac      9180
     tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttca                  9229
//

This experimental service designed by Keith Robison at Harvard University (robison@mito.harvard.edu) for providing SwissProt with enhancements. (Full-text search ID/DE search) Please do not blame the good people at ExPASy for any trouble -- instead try the native ExPASy SwissProt service