EMBL: K02013
ID HIVBRUCG standard; RNA; VRL; 9229 BP.
XX
AC K02013;
XX
DT 18-NOV-1986 (Rel. 10, Created)
DT 17-OCT-1992 (Rel. 33, Last updated, Version 5)
XX
DE Human immunodeficiency virus type 1, isolate BRU, complete genome
DE (LAV-1).
XX
KW acquired immune deficiency syndrome; complete genome; env gene;
KW gag gene; long terminal repeat; pol gene; polyprotein;
KW proviral gene; reverse transcriptase; tar protein; tat protein;
KW trans-activator.
XX
OS Human immunodeficiency virus type 1
OC Viridae; ss-RNA enveloped viruses; Positive strand RNA viruses;
OC Retroviridae; Lentivirinae.
XX
RN [1]
RP 1-9229
RA Wain-Hobson S., Sonigo P., Danos O., Cole S., Alizon M.;
RT "Nucleotide sequence of the AIDS Virus, LAV";
RL Cell 40:9-17(1985).
XX
RN [2]
RP 1-9193
RA Van Beveren C., Coffin J., Hughes S.;
RT "Appendix B: HTLV-3/LAV genome";
RL (in) Weiss R., Teich N., Varmus H., Coffin J. (eds.);
RL RNA TUMOR VIRUSES, MOLECULAR BIOLOGY OF TUMOR VIRUSES, SECOND
RL EDITION, 2 (SUPPLEMENTS AND APPENDIXES):1106-1123;
RL Cold Spring Harbor Laboratory, CSH, NY (1985)
XX
RN [3]
RP 1712-1749
RA Alizon M., Wain-Hobson S., Gluckman J.C., Sonigo P.;
RT "Genetic variability of the AIDS virus: Nucleotide sequence
RT analysis of two isolates from African patients";
RL Cell 46:63-74(1986).
XX
DR SWISS-PROT; P03348; GAG_HV1BR.
DR SWISS-PROT; P03367; POL_HV1BR.
DR SWISS-PROT; P03377; ENV_HV1BR.
DR SWISS-PROT; P03401; VIF_HV1B1.
DR SWISS-PROT; P03406; NEF_HV1BR.
DR SWISS-PROT; P04610; TAT_HV1BR.
DR SWISS-PROT; P04620; REV_HV1BR.
DR SWISS-PROT; P05923; VPU_HV1BR.
DR SWISS-PROT; P05928; VPR_HV1BR.
XX
CC [(in) Weiss,R., Teich,N., Varmus,H. and Coffin,J. (Eds.);RNA Tumor
CC Viruses,Molecu] review. [3] revision of [1]. The original LAV,
CC sometimes called LAV-1 to distinguish it from HIV2 (LAV-2), is now
CC referred to as HIV-1bru. An infectious clone of this virus has been
CC constructed by Keith Peden, Molecular Bio- logy and Genetics, Johns
CC Hopkins University School of Medicine, Baltimore, MD 21205 (301)
CC 955-3652. HIVNL43 is also an infectious clone having for its 3'
CC half a clone of the BRU isolate. Acquired immune deficiency
CC syndrome (AIDS) is caused by a retrovirus known by several
CC different names, probably representing two separate strains: human
CC T-cell lymphotropic virus-III (HTLV-III) and
CC lymphadenopathy-associated virus (LAV) are thought to be one
CC strain, and AIDS-associated retrovirus type 2 (ARV-2) the other.
CC All three viruses, whose sequences do not differ by more than about
CC 6%, are believed to belong to the retroviral subfamily
CC Lentiviridae, or 'slow' viruses. For the details of the annotation
CC and for other pertinent references, see the HIV reference entry.
XX
FH Key Location/Qualifiers
FH
FT source 1..9229
FT /organism="Human immunodeficiency virus type 1"
FT LTR <1..180
FT /note="5' LTR"
FT repeat_region <1..97
FT /note="R repeat 5' copy"
FT exon <5412..5626
FT /note="tat protein, exon 2 (first expressed exon)"
FT exon <5551..5626
FT /note="rev protein, exon 2 (first expressed exon)"
FT CDS join(5412..5626,7972..8017)
FT /note="tat protein"
FT /codon_start=1
FT CDS join(5551..5626,7972..8246)
FT /note="rev protein"
FT /codon_start=1
FT prim_transcript 1..9229
FT /note="genomic mRNA"
FT prim_transcript 1..9229
FT /note="tat, rev, nef subgenomic mRNA"
FT misc_binding 182..199
FT /bound_moiety="primer (Lys-tRNA)"
FT intron 290..5358
FT /note="tat, rev, nef subgenomic mRNA intron 1"
FT CDS 336..1874
FT /note="gag polyprotein"
FT /codon_start=1
FT misc_feature 1631..4678
FT /note="ORF pol polypeptide bordered by stop codons"
FT /product="pol polyprotein"
FT /gene="pol"
FT CDS 4623..5201
FT /note="vif protein"
FT /codon_start=1
FT CDS 5141..5431
FT /note="vpr protein"
FT /codon_start=1
FT intron 5627..7971
FT /note="tat cds intron 2"
FT intron 5627..7971
FT /note="rev cds intron 2"
FT intron 5627..7971
FT /note="tat, rev, nef subgenomic mRNA intron 2"
FT CDS 5643..5888
FT /note="vpu protein"
FT /codon_start=1
FT CDS 5803..8388
FT /note="envelope polyprotein"
FT /codon_start=1
FT exon 7972..>8017
FT /note="tat protein, exon 3 (AA at 7973)"
FT exon 7972..>8246
FT /note="rev protein, exon 3 (AA at 7974)"
FT CDS 8390..9010
FT /note="nef protein"
FT /codon_start=1
FT LTR 8679..>9229
FT /note="3' LTR"
FT repeat_region 9133..9229
FT /note="R repeat 3' copy"
FT polyA_signal 9205..9210
FT /note="mRNA polyadenylation signal"
XX
SQ Sequence 9229 BP; 3289 A; 1656 C; 2232 G; 2052 T; 0 other;
ggtctctctg gttagaccag atttgagcct gggagctctc tggctaacta gggaacccac 60
tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 120
gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca 180
gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag 240
gactcggctt gctgaagcgc gcacggcaag aggcgagggg aggcgactgg tgagtacgcc 300
aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 360
gcgggggaga attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 420
ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg 480
gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc 540
agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc 600
atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa 660
acaaaagtaa gaaaaaagca cagcaagcag cagctgacac aggacacagc agccaggtca 720
gccaaaatta ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac 780
ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga 840
tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa 900
acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag 960
ctgcagaatg ggatagagtg catccagtgc atgcagggcc tattgcacca ggccagatga 1020
gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat 1080
ggatgacaaa taatccacct atcccagtag gagaaattta taaaagatgg ataatcctgg 1140
gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac 1200
caaaagaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1260
cttcacagga ggtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcgaacccag 1320
attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag 1380
catgtcaggg agtgggagga cccggccata aggcaagagt tttggctgaa gcaatgagcc 1440
aagtaacaaa ttcagctacc ataatgatgc aaagaggcaa ttttaggaac caaagaaaga 1500
ttgttaagtg tttcaattgt ggcaaagaag ggcacatagc cagaaattgc agggccccta 1560
ggaaaaaggg ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga 1620
gacaggctaa ttttttaggg aagatctggc cttcctacaa gggaaggcca gggaattttc 1680
ttcagagcag accagagcca acagccccac catttcttca gagcagacca gagccaacag 1740
ccccaccaga agagagcttc aggtctgggg tagagacaac aactccctct cagaagcagg 1800
agccgataga caaggaactg tatcctttaa cttccctcag atcactcttt ggcaacgacc 1860
cctcgtcaca ataaagatag gggggcaact aaaggaagct ctattagata caggagcaga 1920
tgatacagta ttagaagaaa tgagtttgcc aggaagatgg aaaccaaaaa tgataggggg 1980
aattggaggt tttatcaaag taagacagta tgatcagata ctcatagaaa tctgtggaca 2040
taaagctata ggtacagtat tagtaggacc tacacctgtc aacataattg gaagaaatct 2100
gttgactcag attggttgca ctttaaattt tcccattagt cctattgaaa ctgtaccagt 2160
aaaattaaag ccaggaatgg atggcccaaa agttaaacaa tggccattga cagaagaaaa 2220
aataaaagca ttagtagaaa tttgtacaga aatggaaaag gaagggaaaa tttcaaaaat 2280
tgggcctgaa aatccataca atactccagt atttgccata aagaaaaaag acagtactaa 2340
atggagaaaa ttagtagatt tcagagaact taataagaga actcaagact tctgggaagt 2400
tcaattagga ataccacatc ccgcagggtt aaaaaagaaa aaatcagtaa cagtactgga 2460
tgtgggtgat gcatattttt cagttccctt agatgaagac ttcaggaagt atactgcatt 2520
taccatacct agtataaaca atgagacacc agggattaga tatcagtaca atgtgcttcc 2580
acagggatgg aaaggatcac cagcaatatt ccaaagtagc atgacaaaaa tcttagagcc 2640
ttttagaaaa caaaatccag acatagttat ctatcaatac atggatgatt tgtatgtagg 2700
atctgactta gaaatagggc agcatagaac aaaaatagag gagctgagac aacatctgtt 2760
gaggtgggga cttaccacac cagacaaaaa acatcagaaa gaacctccat tcctttggat 2820
gggttatgaa ctccatcctg ataaatggac agtacagcct atagtgctgc cagaaaaaga 2880
cagctggact gtcaatgaca tacagaagtt agtgggaaaa ttgaattggg caagtcagat 2940
ttacccaggg attaaagtaa ggcaattatg taaactcctt agaggaacca aagcactaac 3000
agaagtaata ccactaacag aagaagcaga gctagaactg gcagaaaaca gagagattct 3060
aaaagaacca gtacatggag tgtattatga cccatcaaaa gacttaatag cagaaataca 3120
gaagcagggg caaggccaat ggacatatca aatttatcaa gagccattta aaaatctgaa 3180
aacaggaaaa tatgcaagaa cgaggggtgc ccacactaat gatgtaaaac aattaacaga 3240
ggcagtgcaa aaaataacca cagaaagcat agtaatatgg ggaaagactc ctaaatttaa 3300
actacccata caaaaggaaa catgggaaac atggtggaca gagtattggc aagccacctg 3360
gattcctgag tgggagtttg tcaatacccc tcctttagtg aaattatggt accagttaga 3420
gaaagaaccc atagtaggag cagaaacgtt ctatgtagat ggggcagcta gcagggagac 3480
taaattagga aaagcaggat atgttactaa tagaggaaga caaaaagttg tcaccctaac 3540
tgacacaaca aatcagaaga ctgagttaca agcaattcat ctagctttgc aggattcggg 3600
attagaagta aatatagtaa cagactcaca atatgcatta ggaatcattc aagcacaacc 3660
agataaaagt gaatcagagt tagtcaatca aataatagag cagttaataa aaaaggaaaa 3720
ggtctatctg gcatgggtac cagcacacaa aggaattgga ggaaatgaac aagtagataa 3780
attagtcagt gctggaatca ggaaagtact atttttagat ggaatagata aggcccaaga 3840
tgaacatgag aaatatcaca gtaattggag agcaatggct agtgatttta acctgccacc 3900
tgtagtagca aaagaaatag tagccagctg tgataaatgt cagctaaaag gagaagccat 3960
gcatggacaa gtagactgta gtccaggaat atggcaacta gattgtacac atttagaagg 4020
aaaagttatc ctggtagcag ttcatgtagc cagtggatat atagaagcag aagttattcc 4080
agcagaaaca gggcaggaaa cagcatactt tcttttaaaa ttagcaggaa gatggccagt 4140
aaaaacaata catacagaca atggcagcaa tttcaccagt actacggtta aggccgcctg 4200
ttggtgggcg ggaatcaagc aggaatttgg aattccctac aatccccaaa gtcaaggagt 4260
agtagaatct atgaataaag aattaaagaa aattataggc caggtaagag atcaggctga 4320
acatcttaag acagcagtac aaatggcagt attcatccac aattttaaaa gaaaaggggg 4380
gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac 4440
taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag 4500
cagagatcca ctttggaaag gaccagcaaa gctcctctgg aaaggtgaag gggcagtagt 4560
aatacaagat aatagtgaca taaaagtagt gccaagaaga aaagcaaaga tcattaggga 4620
ttatggaaaa cagatggcag gtgatgattg tgtggcaagt agacaggatg aggattagaa 4680
catggaaaag tttagtaaaa caccatatgt atgtttcagg gaaagctagg ggatggtttt 4740
atagacatca ctatgaaagc cctcatccaa gaataagttc agaagtacac atcccactag 4800
gggatgctag attggtaata acaacatatt ggggtctgca tacaggagaa agagactggc 4860
atctgggtca gggagtctcc atagaatgga ggaaaaagag atatagcaca caagtagacc 4920
ctgaactagc agaccaacta attcatctgt attactttga ctgtttttca gactctgcta 4980
taagaaaggc cttattagga catatagtta gccctaggtg tgaatatcaa gcaggacata 5040
acaaggtagg atctctacaa tacttggcac tagcagcatt aataacacca aaaaagataa 5100
agccaccttt gcctagtgtt acgaaactga cagaggatag atggaacaag ccccagaaga 5160
ccaagggcca cagagggagc cacacaatga atggacacta gagcttttag aggagcttaa 5220
gaatgaagct gttagacatt ttcctaggat ttggctccat ggcttagggc aacatatcta 5280
tgaaacttat ggggatactt gggcaggagt ggaagccata ataagaattc tgcaacaact 5340
gctgtttatc catttcagaa ttgggtgtcg acatagcaga ataggcgtta ctcaacagag 5400
gagagcaaga aatggagcca gtagatccta gactagagcc ctggaagcat ccaggaagtc 5460
agcctaaaac tgcttgtacc acttgctatt gtaaaaagtg ttgctttcat tgccaagttt 5520
gtttcacaac aaaagcctta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa 5580
gacctcctca aggcagtcag actcatcaag tttctctatc aaagcagtaa gtagtacatg 5640
taatgcaacc tatacaaata gcaatagcag cattagtagt agcaataata atagcaatag 5700
ttgtgtggtc catagtaatc atagaatata ggaaaatatt aagacaaaga aaaatagaca 5760
ggttaattga tagactaata gaaagagcag aagacagtgg caatgagagt gaaggagaaa 5820
tatcagcact tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg 5880
atctgtagtg ctacagaaaa attgtgggtc acagtctatt atggggtacc tgtgtggaag 5940
gaagcaacca ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat 6000
aatgtttggg ccacacatgc ctgtgtaccc acagacccca acccacaaga agtagtattg 6060
gtaaatgtga cagaaaattt taacatgtgg aaaaatgaca tggtagaaca gatgcatgag 6120
gatataatca gtttatggga tcaaagccta aagccatgtg taaaattaac cccactctgt 6180
gttagtttaa agtgcactga tttggggaat gctactaata ccaatagtag taataccaat 6240
agtagtagcg gggaaatgat gatggagaaa ggagagataa aaaactgctc tttcaatatc 6300
agcacaagca taagaggtaa ggtgcagaaa gaatatgcat ttttttataa acttgatata 6360
ataccaatag ataatgatac taccagctat acgttgacaa gttgtaacac ctcagtcatt 6420
acacaggcct gtccaaaggt atcctttgag ccaattccca tacattattg tgccccggct 6480
ggttttgcga ttctaaaatg taataataag acgttcaatg gaacaggacc atgtacaaat 6540
gtcagcacag tacaatgtac acatggaatt aggccagtag tatcaactca actgctgttg 6600
aatggcagtc tagcagaaga agaggtagta attagatctg ccaatttcac agacaatgct 6660
aaaaccataa tagtacagct gaaccaatct gtagaaatta attgtacaag acccaacaac 6720
aatacaagaa aaagtatccg tatccagagg ggaccaggga gagcatttgt tacaatagga 6780
aaaataggaa atatgagaca agcacattgt aacattagta gagcaaaatg gaatgccact 6840
ttaaaacaga tagctagcaa attaagagaa caatttggaa ataataaaac aataatcttt 6900
aagcaatcct caggagggga cccagaaatt gtaacgcaca gttttaattg tggaggggaa 6960
tttttctact gtaattcaac acaactgttt aatagtactt ggtttaatag tacttggagt 7020
actgaagggt caaataacac tgaaggaagt gacacaatca cactcccatg cagaataaaa 7080
caatttataa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagcgga 7140
caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtaataac 7200
aacaatgggt ccgagatctt cagacctgga ggaggagata tgagggacaa ttggagaagt 7260
gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 7320
aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg 7380
ttcttgggag cagcaggaag cactatgggc gcacggtcaa tgacgctgac ggtacaggcc 7440
agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg 7500
caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg 7560
gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa 7620
ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag 7680
atttggaata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta 7740
atacattcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg 7800
gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat 7860
ataaaaatat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta 7920
ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc 7980
ccaaccccga ggggacccga caggcccgaa ggaatagaag aagaaggtgg agagagagac 8040
agagacagat ccattcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg 8100
agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt 8160
gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacag 8220
tattggagtc aggaactaaa gaatagtgct gttagcttgc tcaatgccac agccatagca 8280
gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc 8340
cacataccta gaagaataag acagggcttg gaaaggattt tgctataaga tgggtggcaa 8400
gtggtcaaaa agtagtgtgg ttggatggcc tactgtaagg gaaagaatga gacgagctga 8460
gccagcagca gatggggtgg gagcagcatc tcgagacctg gaaaaacatg gagcaatcac 8520
aagtagcaat acagcagcta ccaatgctgc ttgtgcctgg ctagaagcac aagaggagga 8580
ggaggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt acaaggcagc 8640
tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca 8700
acgaagacaa gatatccttg atctgtggat ctaccacaca caaggctact tccctgattg 8760
gcagaactac acaccagggc caggggtcag atatccactg acctttggat ggtgctacaa 8820
gctagtacca gttgagccag ataaggtaga agaggccaat aaaggagaga acaccagctt 8880
gttacaccct gtgagcctgc atggaatgga tgaccctgag agagaagtgt tagagtggag 8940
gtttgacagc cgcctagcat ttcatcacgt ggcccgagag ctgcatccgg agtacttcaa 9000
gaactgctga catcgagctt gctacaaggg actttccgct ggggactttc cagggaggcg 9060
tggcctgggc gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt 9120
ttgcctgtac tgggtctctc tggttagacc agatttgagc ctgggagctc tctggctaac 9180
tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttca 9229
//
This experimental service designed by Keith Robison at Harvard University (robison@mito.harvard.edu) for providing SwissProt with enhancements. (Full-text search ID/DE search) Please do not blame the good people at ExPASy for any trouble -- instead try the native ExPASy SwissProt service