We have analyzed purine (R) and pyrimidine (Y) codon patterns in variable and constant regions of HIV-1 gp120 in seven patients infected with different HIV-1 subtypes and naive to antiretroviral therapy. We have calculated the relative frequency of each in-frame codon RNY, YNR, RNR, and YNY (N=any nucleotide) in variable and constant regions of gp120, in the sequence within indels and at indels' flanking sites. Our data show that hypervariable regions V1, V2, V4, and V5 are characterized by the presence of long stretches of RNY codons constituting the majority of the sequence portion within insertions/deletions. In full-length gp120 and within inserted/deleted fragments the number of AVT (V=A, C, G) codons did not exceed 50% of the total RNY codons. RNY strings in variable regions spanned up to 21 codons and were always in frame. In contrast, RNY strings in constant regions were mostly out of frame and their length was limited to five codons. The frequency of the codon RNY was found to be significantly higher in variable regions (p2 test). Analysis of the distribution of RNY strings equal to or longer than five codons in the full genome of HXB2 also shows that these sequences are mostly out of frame, unless they contain a potential N-glycosylation site or an asparagine. These data suggest that cryptic repeats of RNY may play a role in the genesis of multiple base insertions and deletions in hypervariable regions of gp120.
ASJC Scopus subject areas
- Infectious Diseases