Evaluation of sequence ambiguities of the HIV-1 pol gene as a method to identify recent HIV-1 infection in transmitted drug resistance surveys

•We analyze 691 recent and chronic HIV pol sequences for proportion of mixed bases.•Significant difference in proportion is seen between recent and chronic infection.•A cutoff of <0.47% mixed bases to identify recent (<1year) infection is established.•We apply it to 36 TDR surveys and 71% of i...

Full description

Saved in:
Bibliographic Details
Published inInfection, genetics and evolution Vol. 18; pp. 125 - 131
Main Authors Andersson, Emmi, Shao, Wei, Bontell, Irene, Cham, Fatim, Cuong, Do Duy, Wondwossen, Amogne, Morris, Lynn, Hunt, Gillian, Sönnerborg, Anders, Bertagnolio, Silvia, Maldarelli, Frank, Jordan, Michael R.
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 01.08.2013
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•We analyze 691 recent and chronic HIV pol sequences for proportion of mixed bases.•Significant difference in proportion is seen between recent and chronic infection.•A cutoff of <0.47% mixed bases to identify recent (<1year) infection is established.•We apply it to 36 TDR surveys and 71% of infections are classified as recent.•The approach may provide additional information on a population level. Identification of recent HIV infection within populations is a public health priority for accurate estimation of HIV incidence rates and transmitted drug resistance at population level. Determining HIV incidence rates by prospective follow-up of HIV-uninfected individuals is challenging and serological assays have important limitations. HIV diversity within an infected host increases with duration of infection. We explore a simple bioinformatics approach to assess viral diversity by determining the percentage of ambiguous base calls in sequences derived from standard genotyping of HIV-1 protease and reverse transcriptase. Sequences from 691 recently infected (⩽1year) and chronically infected (>1year) individuals from Sweden, Vietnam and Ethiopia were analyzed for ambiguity. A significant difference (p<0.0001) in the proportion of ambiguous bases was observed between sequences from individuals with recent and chronic infection in both HIV-1 subtype B and non-B infection, consistent with previous studies. In our analysis, a cutoff of <0.47% ambiguous base calls identified recent infection with a sensitivity and specificity of 88.8% and 74.6% respectively. 1,728 protease and reverse transcriptase sequences from 36 surveys of transmitted HIV drug resistance performed following World Health Organization guidance were analyzed for ambiguity. The 0.47% ambiguity cutoff was applied and survey sequences were classified as likely derived from recently or chronically infected individuals. 71% of patients were classified as likely to have been infected within one year of genotyping but results varied considerably amongst surveys. This bioinformatics approach may provide supporting population-level information to identify recent infection but its application is limited by infection with more than one viral variant, decreasing viral diversity in advanced disease and technical aspects of population based sequencing. Standardization of sequencing techniques and base calling and the addition of other parameters such as CD4 cell count may address some of the technical limitations and increase the usefulness of the approach.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1567-1348
1567-7257
1567-7257
DOI:10.1016/j.meegid.2013.03.050