#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Detection of Tuberculosis in HIV-Infected and -Uninfected African Adults Using Whole Blood RNA Expression Signatures: A Case-Control Study


Background:
A major impediment to tuberculosis control in Africa is the difficulty in diagnosing active tuberculosis (TB), particularly in the context of HIV infection. We hypothesized that a unique host blood RNA transcriptional signature would distinguish TB from other diseases (OD) in HIV-infected and -uninfected patients, and that this could be the basis of a simple diagnostic test.

Methods and Findings:
Adult case-control cohorts were established in South Africa and Malawi of HIV-infected or -uninfected individuals consisting of 584 patients with either TB (confirmed by culture of Mycobacterium tuberculosis [M.TB] from sputum or tissue sample in a patient under investigation for TB), OD (i.e., TB was considered in the differential diagnosis but then excluded), or healthy individuals with latent TB infection (LTBI). Individuals were randomized into training (80%) and test (20%) cohorts. Blood transcriptional profiles were assessed and minimal sets of significantly differentially expressed transcripts distinguishing TB from LTBI and OD were identified in the training cohort. A 27 transcript signature distinguished TB from LTBI and a 44 transcript signature distinguished TB from OD. To evaluate our signatures, we used a novel computational method to calculate a disease risk score (DRS) for each patient. The classification based on this score was first evaluated in the test cohort, and then validated in an independent publically available dataset (GSE19491).

In our test cohort, the DRS classified TB from LTBI (sensitivity 95%, 95% CI [87–100]; specificity 90%, 95% CI [80–97]) and TB from OD (sensitivity 93%, 95% CI [83–100]; specificity 88%, 95% CI [74–97]). In the independent validation cohort, TB patients were distinguished both from LTBI individuals (sensitivity 95%, 95% CI [85–100]; specificity 94%, 95% CI [84–100]) and OD patients (sensitivity 100%, 95% CI [100–100]; specificity 96%, 95% CI [93–100]).

Limitations of our study include the use of only culture confirmed TB patients, and the potential that TB may have been misdiagnosed in a small proportion of OD patients despite the extensive clinical investigation used to assign each patient to their diagnostic group.

Conclusions:
In our study, blood transcriptional signatures distinguished TB from other conditions prevalent in HIV-infected and -uninfected African adults. Our DRS, based on these signatures, could be developed as a test for TB suitable for use in HIV endemic countries. Further evaluation of the performance of the signatures and DRS in prospective populations of patients with symptoms consistent with TB will be needed to define their clinical value under operational conditions.

Please see later in the article for the Editors' Summary


Published in the journal: . PLoS Med 10(10): e32767. doi:10.1371/journal.pmed.1001538
Category: Research Article
doi: https://doi.org/10.1371/journal.pmed.1001538

Summary

Background:
A major impediment to tuberculosis control in Africa is the difficulty in diagnosing active tuberculosis (TB), particularly in the context of HIV infection. We hypothesized that a unique host blood RNA transcriptional signature would distinguish TB from other diseases (OD) in HIV-infected and -uninfected patients, and that this could be the basis of a simple diagnostic test.

Methods and Findings:
Adult case-control cohorts were established in South Africa and Malawi of HIV-infected or -uninfected individuals consisting of 584 patients with either TB (confirmed by culture of Mycobacterium tuberculosis [M.TB] from sputum or tissue sample in a patient under investigation for TB), OD (i.e., TB was considered in the differential diagnosis but then excluded), or healthy individuals with latent TB infection (LTBI). Individuals were randomized into training (80%) and test (20%) cohorts. Blood transcriptional profiles were assessed and minimal sets of significantly differentially expressed transcripts distinguishing TB from LTBI and OD were identified in the training cohort. A 27 transcript signature distinguished TB from LTBI and a 44 transcript signature distinguished TB from OD. To evaluate our signatures, we used a novel computational method to calculate a disease risk score (DRS) for each patient. The classification based on this score was first evaluated in the test cohort, and then validated in an independent publically available dataset (GSE19491).

In our test cohort, the DRS classified TB from LTBI (sensitivity 95%, 95% CI [87–100]; specificity 90%, 95% CI [80–97]) and TB from OD (sensitivity 93%, 95% CI [83–100]; specificity 88%, 95% CI [74–97]). In the independent validation cohort, TB patients were distinguished both from LTBI individuals (sensitivity 95%, 95% CI [85–100]; specificity 94%, 95% CI [84–100]) and OD patients (sensitivity 100%, 95% CI [100–100]; specificity 96%, 95% CI [93–100]).

Limitations of our study include the use of only culture confirmed TB patients, and the potential that TB may have been misdiagnosed in a small proportion of OD patients despite the extensive clinical investigation used to assign each patient to their diagnostic group.

Conclusions:
In our study, blood transcriptional signatures distinguished TB from other conditions prevalent in HIV-infected and -uninfected African adults. Our DRS, based on these signatures, could be developed as a test for TB suitable for use in HIV endemic countries. Further evaluation of the performance of the signatures and DRS in prospective populations of patients with symptoms consistent with TB will be needed to define their clinical value under operational conditions.

Please see later in the article for the Editors' Summary

Introduction

There is an urgent need for improved tests to diagnose active tuberculosis (TB), particularly in countries of sub-Saharan Africa most affected by the TB/HIV pandemic. The diagnosis of TB was problematic even before the emergence of HIV, as symptoms and radiological features of TB overlap those of many other infectious and non-infectious conditions. However in countries of sub-Saharan Africa, where HIV prevalence amongst individuals presenting with symptoms consistent with TB is over 50% [1], the diagnostic difficulty is increased, as TB must be distinguished from a wide range of opportunistic infections and HIV-associated malignancies that present clinically with similar symptoms and signs.

For over a century, diagnosis of TB has relied on clinical and radiological features, sputum microscopy (with or without culture), and tuberculin skin testing (TST). All of these have major drawbacks, particularly in HIV co-infected individuals [2],[3], in whom radiological features are often atypical [4], cavitary lung disease is less common [5],[6], and results of sputum microscopy are often negative [2],[7]. Furthermore, culture facilities are largely unavailable in many African hospitals [8]. As TST and interferon gamma release assays (IGRAs) cannot discriminate TB from latent TB infection (LTBI) [9], they are of limited diagnostic utility amongst African adults where LTBI is highly prevalent in the general population [10], and amongst inpatients with other diagnoses. Molecular methods have improved detection of Mycobacterium tuberculosis (M.TB) DNA in sputum [11], but the sensitivity of this approach is lower in smear negative sputum samples even if culture positive [12]. Consequently, high proportions of patients with TB in sub-Saharan Africa remain undiagnosed or are treated empirically without laboratory confirmation. The need for improved diagnostic methods is highlighted by post mortem studies showing TB to be a frequent undiagnosed cause of death in Africa [13][15].

RNA expression analysis by microarray has emerged as a powerful tool for understanding disease biology [16]. Many diseases including cancer [17] and infectious diseases [18], as well as TB [19][26], are associated with specific transcriptional profiles in blood or tissue. Although previous studies in TB have suggested that RNA expression might be used diagnostically to distinguish TB from other conditions, these studies have excluded HIV-infected participants, and have compared TB with other diseases (OD) that are not representative of the spectrum seen in HIV-infected and -uninfected patients presenting to African hospitals with symptoms for which TB is included in the differential diagnosis [19][26]. There is thus a need to identify biomarkers that discriminate TB from OD prevalent in African populations, where the burden of the HIV/TB pandemic is greatest.

In this two country prospective case-control study, we investigated the hypothesis that host peripheral blood RNA expression would distinguish TB from other conditions prevalent in African populations in the context of endemic HIV infection, and explored the use of a transcriptional signature as the basis for a diagnostic test.

Methods

Ethics Statement

The study was approved by the Human Research Ethics Committee of the University of Cape Town, South Africa (HREC012/2007), the National Health Sciences Research Committee, Malawi (NHSRC/447), and the Ethics Committee of the London School of Hygiene and Tropical Medicine (5212). Written information was provided by trained local health workers in local languages and all patients provided written consent.

Study Sites

In order to enable generalization of our findings to African countries with differing prevalence of malaria and other parasitic infections, as well as other environmental exposures that might affect transcriptional profiles, we chose highly contrasting study sites (one urban, one rural) in two African countries with differing co-endemic diseases:

Cape Town, South Africa

South Africa has one of the highest TB incidence rates in Africa (981 per 100,000) [27], as well as high rates of HIV infection (up to 41.8% prevalence in females aged 25–35) [28]. Patients undergoing investigation for suspected TB were recruited at GF Jooste Hospital Manenberg, Groote Schuur Hospital, and at Khayelitsha site B clinics serving the largely Xhosa population residing in the low income townships of Cape Town. Malaria is not endemic in these urban populations.

Karonga district, Northern Malawi

The incidence of new TB cases in Karonga district (180 per 100,000, Karonga Prevention Study unpublished data, 2012) and the stable HIV prevalence (10%–15% of females aged 25–29, Karonga Prevention Study unpublished data, 2012) are lower in Karonga than in Cape Town. Malaria and helminth infection are hyperendemic. Patients were recruited at Karonga District hospital, which serves a rural population living by the shores of Lake Malawi.

Diagnostic Process

To ensure accurate assignment of patients to definite TB and OD groups, a rigorous diagnostic process was followed. All patients underwent chest radiographs and serological testing for HIV, along with cultures of blood, CSF, and urine, and biopsies for histological examination including TB culture where clinically indicated. Two sputum samples obtained after induction or coughing were examined by standard microscopy for acid fast bacilli (AFB) and cultured for TB using standard methods (i.e., solid media [South Africa and Malawi] and on liquid media [South Africa only]) [29]. Patients were followed up 26 wk post diagnosis to confirm that those with OD remained TB-free. Individuals were either assigned to one of the diagnostic groups or excluded once the results of investigations and follow-up were available. Healthy LTBI controls were recruited by random community selection (Malawi) and from HIV screening clinics (South Africa) from the same catchment areas as patients with TB (Figure 1). In vitro IGRA to substantiate LTBI was undertaken using an in-house whole blood assay [30],[31]. OD patients were recruited if they presented with symptoms that would mandate investigation for TB as a differential diagnosis. After intensive investigation, any case with an established alternative diagnosis to TB, no microbiological evidence of TB, and an absence of TB symptoms at the time of follow-up or with an observed improvement of clinical symptoms on follow-up without TB treatment, was recruited as an OD case. If TB could not be reliably ruled out of the differential, the patient was excluded.

Diagnostic process to identify TB cases, LTBI cases, and other diseases cases.
Fig. 1. Diagnostic process to identify TB cases, LTBI cases, and other diseases cases.

Patient Cohorts

Patient recruitment strategies, which differed at each site, were embedded within health services administered by statutory providers in order to best investigate on an “intention to test” basis.

Cape Town, South Africa

Recruitment in Cape Town commenced 12th October 2007 and concluded 5th January 2010. Subject to research staff availability, 96 sequential patients presenting with at least one positive TB culture result were recruited from an outpatient TB clinic in Khayelitsha site B until 49 HIV-infected and 47 HIV-uninfected persons were recruited (Figure 2). In Cape Town, 36.7% (18/49) HIV-infected patients with TB were smear-negative and 8.5% (4/47) HIV-uninfected patients were smear-negative. Patients in the OD category were recruited at GF Jooste and Groote Schuur hospitals in Cape Town. Patients were assessed by a hospital clinician and enrolled in the study if TB was considered in the differential diagnosis. After intensive investigation as described above, patients were assigned to the OD group if (1) an alternative diagnosis was established; (2) no microbiological evidence of TB was found after culture of sputum or other samples; and (3) an improvement of clinical symptoms was observed on follow-up without TB treatment (Figure 1). If a patient recruited to the OD group was later found to be culture positive for M.TB, they were reclassified appropriately. In total 138 HIV-infected and 80 HIV-uninfected patients were recruited in the OD group, of which 70 HIV-infected and 31 HIV-uninfected were excluded as TB diagnosis could not be excluded (i.e., TB uncertain) (Figure 2).

Study overview showing patient numbers and analysis pipeline.
Fig. 2. Study overview showing patient numbers and analysis pipeline.
HIV-, HIV-uninfected; HIV+, HIV-infected; TB, active tuberculosis (see Table 2).

Karonga district, Northern Malawi

Recruitment at Karonga District Hospital commenced on 1st June 2007 and ceased on the 30th November 2009. Patients attending the hospital were assessed by a local clinician. If this clinician considered TB to be within the differential diagnosis, patients were recruited by a study staff member and investigated according to clinical and study protocols as described above. Following the completion of in-patient care, patients were followed up for at least 26 wk post discharge to assess their progress including a verbal autopsy if the patient had died. Individuals were categorized following the completion of follow-up. Patients were assigned to the OD group if (1) a firm alternative diagnosis was established; (2) there was no microbiological evidence of TB; and (3) there was absence of symptoms of TB at the time of follow-up or assignation of an alternative cause of death on verbal autopsy (Figure 1). Individuals who did not have TB and did not fulfill criteria for OD—e.g., failed to attend follow-up and with an unknown 6-mo outcome—were categorized as “TB uncertain” (i.e., TB uncertain). During the recruitment period 437 patients were recruited. Of these 254 had definite TB, 77 had a confirmed OD, and 106 were categorized as TB cannot be excluded. The first 60 HIV-infected and 59 HIV-uninfected patients with TB, along with all the OD patients were included in the RNA expression study (Figure 2). In Malawi, 13.3% (8/60) HIV-infected patients with TB were smear-negative and 10.2% (6/59) HIV-uninfected patients were smear-negative.

Oversight and Conduct of the Study

Patients were recruited by FZ and a team of research assistants in Karonga, Malawi, and by TO and hospital staff in Cape Town, South Africa. Assignment of patients to clinical groups was made by consensus of two experienced clinicians at each site (independent of those managing the patient clinically) after review of the investigation results. Testing for HIV status was conducted after appropriate counseling. Clinical data were anonymised and patient samples identified only by study number. Statistical analysis was conducted only after the RNA expression data and the clinical databases had been locked and deposited for independent verification.

Peripheral Blood RNA Expression by Microarray

Whole blood was collected at the time of recruitment (before or within 24 h of commencing TB treatment in suspected patients) in PAXgene blood RNA tubes (PreAnalytiX), frozen within 3 h of collection, and later extracted using PAXgene blood RNA kits (PreAnalytiX). RNA was shipped frozen to the Genome Institute of Singapore for analysis on HumanHT-12 v.4 expression Beadarrays (Illumina). Additional details of the microarray method, quality control, and analysis are provided in Text S1.

Statistical Analysis

Expression data were analysed using ‘R’ Language and Environment for Statistical Computing (R) 2.12.1 (Text S1). To identify transcript signatures applicable across geographic locations and in patients with differing HIV status, we combined HIV-infected and -uninfected patient cohorts from South Africa and Malawi. The recruited participants were randomly assigned to a training cohort (80% of the participants) and a test cohort (20%) with no overlap, using the “sample( )” function without replacement in ‘R’, which obtains a subset of a given set [32]. For additional validation we used the whole blood expression dataset from Berry et al. [25] comparing TB with LTBI and other infectious diseases in an African case-control study (accession GSE19491) (i.e., the “validation” dataset) (Text S1).

To detect transcripts that were differentially expressed between patients with TB and comparator groups, a linear model was fitted and moderated t-statistics calculated for each transcript with correction for false discovery using Benjamini and Hochberg's method [33]. Significantly differentially expressed transcripts in the training cohort with a |log2 fold change| (FC)>0.5 were subjected to variable selection using elastic net [34] (Text S1) in order to identify the smallest number of transcripts distinguishing TB from the comparator groups. These minimal transcript selected sets for TB versus LTBI and TB versus OD (Tables S1 and S2) were assessed in the test cohort and further evaluated in the validation dataset [25].

A Simplified Method for Identifying Individual Patient's Risk of Active TB

Current whole genome array-based technologies are not well suited for use in resource poor settings as they are costly and require sophisticated technology as well as bioinformatics expertise. We therefore developed a method for translation of multiple transcript RNA signatures into a disease risk score (DRS), which could form the basis of a simple, low cost, diagnostic test requiring basic laboratory facilities and minimal bioinformatics analysis. For each individual, we calculated (on normalized intensities) the DRS using the minimal transcript selected sets for TB versus LTBI and TB versus OD. The score is derived by adding the total intensity at up-regulated transcripts, and subtracting the total intensity at all down-regulated transcripts (Text S1). The threshold for the classification was calculated as the weighted average of risk score within each class (group of patients), with weights given as the inverse of the standard deviation of the score within each class (Text S1). The information that the DRS requires for classification (i.e., the expression values of the transcripts of the signatures) can be derived from the dataset itself, which allows its unbiased application using expression data acquired using other array platforms or non-array technologies. The sensitivity and specificity of the score in disease classification were evaluated on the test cohort and validation dataset.

Accession Numbers

The data discussed in this publication have been deposited in NCBI's Gene Expression Omnibus and are accessible through GEO Series accession number GSE37250 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37250).

Results

We recruited 311 adults to the South African cohort and 273 to the Malawi cohort meeting the definitions for TB or OD, after screening a total of 314 in South Africa and 437 patients in Malawi (Figures 1 and 2; Table 1). After including samples from LTBI controls that were recruited separately (98 and 77 patients in South Africa and Malawi, respectively) and removing technical failures (48 samples), 536 consecutive patient samples remained for microarray analysis (Figure 2). The spectrum of infectious and malignant diseases in the OD cohorts reflected the range of conditions with similar clinical manifestations to TB at each site (Table 2).

Tab. 1. Clinical and diagnostic features of South Africa and Malawi patients recruited to the study with active tuberculosis, latent TB infection, or other diseases.
Clinical and diagnostic features of South Africa and Malawi patients recruited to the study with active tuberculosis, latent TB infection, or other diseases.
BMI, body mass index; HIV−, HIV-uninfected; HIV+, HIV-infected; IQR, inter quartile range; LTBI, latent TB infection; NA, not applicable; ND, not done; OD, other diseases (see Table 2); SA, South Africa; TB, active TB;

Tab. 2. Major clinical diagnoses in other diseases cohorts.
Major clinical diagnoses in other diseases cohorts.
Bronchial carcinoma (14), lymphoma (4), cervical carcinoma (1), ovarian carcinoma (1), mesothelioma (1), gastric carcinoma (1), metastatic carcinoma of unknown origin (4), benign salivary tumour (1), dermatological tumour (1).

TB Specific RNA Signature That Is Independent of Geographic Location and HIV Status

We performed quality control on the microarray data in order to examine the effect of disease state on transcript expression and to check for assignment errors. Inspection revealed that the primary clustering was based on disease state (TB, LTBI, OD) rather than geographical location or HIV status (Figure S1). There was substantial correlation of TB versus LTBI differential expression across different geographic locations and HIV status, which was also seen for TB versus OD (Figures S2 and S3). This indicates the presence of a robust underlying signature of TB, independent of HIV status or geographical location.

Identification and Validation of Minimal Transcript Sets

To find minimal transcript sets required to discriminate TB from other groups, we applied the variable selection algorithm elastic net [34] to the training cohort (Methods; Text S1). A 27 transcript model was identified for discriminating TB from LTBI in the South Africa/Malawi training and test set (Figure 3A and 3B; Table S1), whilst a 44 transcript model was identified for discriminating TB from OD (Figure 3C and 3D; Table S2). These models were also applied to data from the South Africa validation dataset [25], which, unlike our cohort, included only HIV-uninfected participants (Figure S4).

Heatmaps showing clustering of training and test cohorts using transcriptional signatures.
Fig. 3. Heatmaps showing clustering of training and test cohorts using transcriptional signatures.
Clustering of training (A/C) and test (B/D) cohorts using transcripts identified by elastic net for TB versus LTBI (A/B) and TB versus OD (C/D) (training: nTB = 157 nLTBI = 128/nTB = 153 nOD = 140, test: nTB = 37 nLTBI = 39/nTB = 42 nOD = 34). Rows are transcripts (transcripts shown in red are up-regulated, those in green are down-regulated) and columns are patients regardless of HIV status (purple, patients with TB; green, patients with LTBI; light blue, patients with OD).

Evaluation of a Simplified Disease Risk Score for TB

To evaluate the feasibility of using a simplified diagnostic test based on our transcript sets for TB diagnosis in low resource settings, we applied the DRS to our test cohort, which includes patients that were not used to discover the signatures, and to the South Africa validation dataset [25]. In our combined HIV-infected and -uninfected test set, the 27 transcript DRS discriminated TB from LTBI with sensitivity and specificity of 95%, 95% CI (87–100), and 90%, 95% CI (80–97), respectively, whilst achieving perfect classification in the HIV-uninfected cohorts and a slightly reduced accuracy in the HIV-infected cohorts (Figures 4A, 5A, and 5B; Table 3). In the validation dataset, the DRS achieved a sensitivity of 95%, 95% CI (85–100), and a specificity of 94%, 95% CI (84–100) (Figure 4B; Table 3). As for the discrimination between TB and OD, the 44 transcript DRS's sensitivity and specificity were 93%, 95% CI (83–100), and 88%, 95% CI (74–97), respectively, with consistent accuracy in the HIV-infected and -uninfected test cohorts (Figures 4C, 5C, and 5D; Table 3). In the validation dataset, the patients were classified with 100% sensitivity, 95% CI (100–100), and 96% specificity, 95% CI (93–100) (Figure 4D; Table 3). Similar values for sensitivity and specificity were obtained when the DRS was evaluated in the training dataset, demonstrating the robustness of our approach to avoid overfitting (Table S5). In order to evaluate the classificatory power of the DRS, we compared its performance with the regression model derived from the elastic net based on the same signatures (Table S5). We found that our DRS had similar accuracy in distinguishing TB from LTBI and OD to the weighted regression model.

Classification using the disease risk score on the test cohort and validation dataset.
Fig. 4. Classification using the disease risk score on the test cohort and validation dataset.
Disease risk score and receiver operating characteristic curves based on the TB/LTBI 27 transcript signature (A/B) and the TB/OD 44 transcript signature (C/D) applied to the South African (SA)/Malawi HIV+/− test cohort (A/C) (nTB = 37 nLTBI = 39/nTB = 42 nOD = 34) and independent validation dataset comprising South African patients (B/D) (nTB = 20 nLTBI = 31 nOD = 82). Sensitivity, specificity are reported in Table 3. HIV+, HIV-infected; HIV−, HIV-uninfected. Classification cut-offs: (A) 138.98; (B) 107.76; (C) 154.44; (D) 99.94.

Application of the transcript signatures to the South African and Malawi test cohorts by HIV status.
Fig. 5. Application of the transcript signatures to the South African and Malawi test cohorts by HIV status.
Disease risk score and receiver operating characteristic curves based on the TB/LTBI 27 transcript signature (A/B) and the TB/OD 44 transcript signature (C/D) applied to the HIV-uninfected (HIV−) (A/C) and HIV-infected (HIV+) (B/D) test cohort. Area under the curve, sensitivities, and specificities are reported in Table 3. Classification cut-offs: (A) 131.37; (B) 142.84; (C) 151.10; (D) 142.84.

Tab. 3. Classification achieved using the disease risk score.
Classification achieved using the disease risk score.
The TB/LTBI 27 transcript signature and TB/OD 44 transcript signature were applied to the South African/Malawi HIV-uninfected (HIV−) and HIV-infected (HIV+) test cohort and the independent validation dataset. Sensitivity and specificity calculated using the weighted threshold for classification. The actual numbers of patients that were DRS negative and positive are shown in Table S4.

In order to assess the predictive value of our DRS in a cohort of patients undergoing investigation for persistent symptoms such as cough, fever, and weight loss, i.e., where TB was included in the differential diagnosis, we used the prevalence of TB in our prospective Malawi cohort (58%; 254 confirmed TB cases of 437 patients with suspected TB) to calculate the positive and negative predictive value (PPV/NPV). The DRS for TB versus OD had a PPV of 92%, 95% CI (84–99), and a NPV of 90%, 95% CI (80–100) (Table S7). Using a 20% prevalence, which may be more reflective of a general primary care setting in a high-burden African country, NPV for TB versus OD is higher (98%, 95% CI [96–100]), but PPV decreases (66%, 95% CI [46–87]), emphasizing the value of DRS as a rule-out test, with those patients with positive DRS selected for further investigation (Table S7).

We also explored the effect of adjusting the threshold for the DRS in assigning individual patients to TB or LTBI/OD. By accepting a percentage of patients as “non-classifiable,” the majority of patients under investigation are accurately assigned. These “non-classifiable” patients could then be selected for more detailed investigation (Figure S5).

As it would be advantageous to have a single signature that distinguished TB from non-TB, we assessed the performance of a signature in distinguishing TB from both TB and LTBI. A 53 transcript signature was identified (Table S3) that distinguished TB from both LTBI and OD with sensitivity/specificity 91%/82%—a lower performance than TB/LTBI and TB/OD signatures alone. We also explored whether a smaller number of transcripts could be used to distinguish TB from LTBI and from OD, which would aid in manufacturing of a test (Text S1), resulting in a 21 and 29 transcript signature for distinguishing TB from LTBI and OD, respectively. The sensitivity of the smaller models was 6%–10% lower than the original models, while retaining the same specificity for TB versus OD (Table S8).

In contrast to our approach, previous studies of RNA expression as a diagnostic tool for TB have excluded HIV-infected patients, and have used other disease controls that were not recruited concurrently with TB cases or from the same population of patients undergoing investigation for TB [19],[21],[22],[24],[25]. To establish how these differences in biomarker study design might affect performance of biomarker signatures, we compared the performance of our 27 transcript TB/LTBI signature and our 44 transcript TB/OD signature with the performance of the signatures of Berry et al. [25] for discrimination of TB versus LTBI (393 transcripts) and TB versus OD (86 transcripts). While the 393 TB/LTBI signature achieved a sensitivity of 88%, 95% CI (80–94), and a specificity of 84%, 95% CI (76–92), on our TB HIV-uninfected cohorts, the performance on the HIV-infected group was 74%, 95% CI (65–82), and 80%, 95% CI (71–87), respectively (Figure 6; Table 4). Furthermore, the Berry et al. TB/OD 86 transcript signature had a lower performance on our cohorts (sensitivity 71%, 95% CI (62–80), specificity 76%, 95% CI (67–84), in HIV-uninfected; sensitivity 67%, 95% CI (58–75), specificity 69%, 95% CI (59–78), in HIV-infected) (Figure 6; Table 4). Thus our minimal transcript signatures and the DRS method show better performance in distinguishing TB from LTBI and OD (especially in the HIV-infected cohorts) than the much larger number of transcripts identified by Berry et al. [25]. (Table 5)

Application of transcript signatures <em class=&quot;ref&quot;>[<b>25</b>]</em> to the combined South Africa and Malawi cohorts.
Fig. 6. Application of transcript signatures [25] to the combined South Africa and Malawi cohorts.
Disease risk score and receiver operating characteristic curves based on transcript signatures of Berry et al. [25] for TB versus LTBI (A/B/C) and TB versus OD (D/E/F) applied to the combined training and test cohorts in HIV-uninfected (HIV−) and HIV-infected (HIV+) (A/D), HIV− (B/E), and HIV+ (C/F) cohorts (Table 4 for sensitivities, specificities, and area under the curve). Classification cut-offs: (A) 1,847.73; (B) 1,777.65; (C) 1,898.97; (D) 172.12; (E) 170.30; (F) 173.70.

Tab. 4. Application of published signatures to the South Africa and Malawi cohorts.
Application of published signatures to the South Africa and Malawi cohorts.
Sensitivities, specificities, and area under curve based on transcript signatures of Berry et al. [25] for TB versus LTBI (393 transcripts), and TB versus OD (86 transcripts) applied to the South African/Malawi HIV-uninfected (HIV−) and HIV-infected (HIV+) cohorts.

Tab. 5. Performance of the TB/LTBI 27 and TB/OD 44 transcript signatures and the transcript signatures of Berry et al. [25] when applied to our test cohort.
Performance of the TB/LTBI 27 and TB/OD 44 transcript signatures and the transcript signatures of Berry et al. <em class=&quot;ref&quot;>[25]</em> when applied to our test cohort.
Comparison of the statistical measures of performance of disease classification using our TB/LTBI 27 and TB/OD 44 transcript signatures with the classification using the 393 (−6 transcript) and 86 (−1 transcript) transcript signatures from Berry et al. [25]. The marked improvement shown for HIV+ individuals in both TB versus LTBI and TB versus OD comparisons suggests that transcript signatures must be derived from both HIV-infected and -uninfected individuals in order to have a diagnostic value in these populations. The performance of our signatures in TB versus OD comparison highlights the need for real world “other disease” controls when deriving biomarkers from clinical cohorts.

Finally, we evaluated the performance of our signatures in the smear-negative sub-group of patients with TB, the majority of whom were HIV-infected (31 smear-negative TB patients with definite negative smear status; seven TB HIV-uninfected and 24 TB HIV-infected). In the smear-negative patients the DRS showed a sensitivity for detecting TB of 68%, 95% CI (52–84), when using the TB versus LTBI signature and a sensitivity of 90%, 95% CI (81–100), with the TB/OD signature, both of which are comparable to results obtained in the larger HIV-infected cohort of smear-positive and -negative patients. As we used the same LTBI and OD patients from the test set, the specificity was unchanged (90%, 95% CI (80–97), for TB versus LTBI and 88%, 95% CI (74–97), for TB versus OD) (Table S9).

Discussion

We have identified a host blood transcriptomic signature that distinguishes TB from a wide range of OD prevalent in HIV-infected and -uninfected African patients. We found that patients with TB can be distinguished from LTBI with only 27 transcripts and from OD with 44 transcripts. Our findings appear robust as the results are reproducible in both HIV-infected and -uninfected cohorts, in different geographic locations, and in an independent TB patient dataset. The high sensitivity and specificity of the signatures in distinguishing TB from OD, even in the HIV-infected patients that have differing levels of T cell depletion and a wide spectrum of opportunistic infections as well as HIV-related complications, suggests that the signatures are promising biomarkers of TB. The relatively small number of transcripts in our signatures may increase the potential for using transcriptional profiling as a clinical diagnostic tool from a single peripheral blood sample (i.e., using a multiplex assay [35],[36]).

The major challenge for diagnosis of TB in Africa is how to distinguish this disease from the range of other conditions that show similar symptoms in countries where TB and HIV are co-endemic. Previous TB biomarker studies have focused on distinguishing patients with TB from healthy controls, or from LTBI [21],[22],[24], or have used other disease controls that may not represent the “real world” disease spectra from which TB should be clinically differentiated [19],[25]. Furthermore, these TB biomarker studies have also excluded HIV co-infected patients who are the group that most need new diagnostics. Our study design should ensure that our signatures are applicable in TB/HIV endemic countries as we recruited patients with TB concurrently with patients with a range of conditions that present with similar clinical features to TB, as well as recruiting both HIV-infected and -uninfected individuals.

We have identified separate signatures for distinguishing TB/OD and TB/LTBI, which only overlap in three transcripts. In practice the clinical applications of these signatures might be distinct as the TB/LTBI signature would be of value in contact screening, where the concern is distinguishing active disease from previous exposure in minimally symptomatic individuals. The TB/OD signature would be of most value in evaluating symptomatic patients presenting to medical services with symptoms of TB. We have also explored whether a single signature might be used to distinguish TB from both LTBI and OD. The combined signature showed lower performance to the separate TB/LTBI and TB/OD signatures. Further exploration of the operational performance of a combined signature or separate signatures is needed to establish the best strategy.

Although our signatures and DRS distinguished the majority of patients with TB from those with LTBI or OD, a proportion of patients were not correctly classified. There is increasing recognition that TB and LTBI may represent a dynamically evolving continuum, particularly in HIV-infected patients and thus failure to culture M.TB is not absolute proof that TB is not present. Some false assignment by our current “gold standard” is to be expected as noted by post mortem studies at which undiagnosed TB is confirmed [14],[15]. All patients in the OD group presented with symptoms for which TB was included in the differential diagnosis, and it is possible that TB may have been misdiagnosed in a small proportion of OD patients despite the extensive clinical investigation used to assign each patient to each diagnostic group. Some improvement in sensitivity and specificity of our DRS may also be achieved by weighting the signal from the most discriminatory transcripts, and this could be explored in subsequent refinements of the method.

A major concern in using transcriptional signatures as a clinical diagnostic tool in resource poor settings is the complexity, as well as cost, of the current methodologies. Our results have shown that transcriptional signatures can be used to distinguish TB from OD in an African setting. We explored the feasibility of a simplified method for disease categorization that may facilitate development of a diagnostic test based on our signatures. Our DRS provides a new approach that enables the use of multi-transcript signatures for individual disease risk assignment without the requirement for complex analysis. Our method could be used to develop a simple test in which the transcripts comprising the diagnostic signature (separated into those that are either up- or down-regulated in TB relative to controls) are each measured using a suitable detection system [35], and the combined signature used to identify each patient's risk of TB. For example, a simple test using the TB/OD signature probes that show increased transcript expression in TB relative to OD could be located in a single well or tube, and those probes that show reduced transcript expression in TB located in a second well or tube. Binding of RNA from a patient's blood to these probes could be detected as a combined signal from each tube using one of the aforementioned detection systems. To allow normalization, expression of up- or down-regulated transcripts in an individual patient could be compared with that of housekeeping genes, which do not show variation between healthy and disease states. There are methods for rapid detection of multi-transcript signatures including lateral flow reverse transcription (RT)-PCR based systems, nano-pore technology [37], nano-particle enzyme linked detection [38],[39], and detection using nano-wires and electrical impedance [40]. Some of these may be suitable for direct analysis of multiple transcript signatures in blood and at a relatively low cost.

While this study provides a proof of principle that relatively small numbers of RNA transcripts can be used to discriminate active TB from latent TB infection and OD in Africa, limitations remain that need to be addressed in order to translate these results into a clinical test. One such limitation is that our study has not assessed performance of our DRS in patients treated for TB solely on the basis of clinical suspicion, without any microbiological confirmation. Amongst these “probable/possible” patients with TB, there is no gold standard to evaluate any new biomarker. Exclusion of probable/possible patients with TB may have produced better estimates of sensitivity and specificity than would be achieved in a prospective “all comers” study including the entire cohort of patients in whom TB is included in the differential diagnosis. Thus, further evaluation using a prospective population based study in which the decision whether and when to initiate TB treatment is evaluated against the new biomarker is required. Future studies will also be required to refine the use of these biomarkers in a clinical decision process either as an initial screening tool, or in conjunction with more detailed culture based diagnostics.

From a clinical perspective a simple transcriptome-based test that reliably diagnoses or excludes TB in the majority of patients undergoing investigation for suspected TB, using a single blood sample, would be of great value, allowing scarce hospital resources to be focused on the small proportion of patients where the result was indeterminate. The challenge for the academic research community and for industry is to develop innovative methods to translate multi-transcript signatures into simple, cheap tests for TB suitable for use in African health facilities.

Supporting Information

Attachment 1

Attachment 2

Attachment 3

Attachment 4

Attachment 5

Attachment 6

Attachment 7

Attachment 8

Attachment 9

Attachment 10

Attachment 11

Attachment 12

Attachment 13

Attachment 14

Attachment 15

Attachment 16


Zdroje

1. MunthaliL, MwaunguluJN, MunthaliK, BowieC, CrampinAC (2006) Using tuberculosis suspects to identify patients eligible for antiretroviral treatment. Int J Tuberc Lung Dis 10: 199–202.

2. LawnSD, WoodR (2011) Tuberculosis in antiretroviral treatment services in resource-limited settings: addressing the challenges of screening and diagnosis. J Infect Dis 204 Suppl 4: S1159–S1167.

3. AabyeMG, RavnP, PrayGodG, JeremiahK, MugomelaA, et al. (2009) The impact of HIV infection and CD4 cell count on the performance of an interferon gamma release assay in patients with pulmonary tuberculosis. PLoS One 4: e4220 doi:10.1371/journal.pone.0004220

4. ChamieG, LuetkemeyerA, Walusimbi-NantezaM, OkweraA, WhalenCC, et al. (2010) Significant variation in presentation of pulmonary tuberculosis across a high resolution of CD4 strata. Int J Tuberc Lung Dis 14: 1295–1302.

5. KwanCK, ErnstJD (2011) HIV and tuberculosis: a deadly human syndemic. Clin Microbiol Rev 24: 351–376.

6. VittorAY, GarlandJM, SchlossbergD (2011) Improving the diagnosis of tuberculosis: From QuantiFERON to new techniques to diagnose tuberculosis infections. Curr HIV/AIDS Rep 8: 153–163.

7. WoodR, MaartensG, LombardCJ (2000) Risk factors for developing tuberculosis in HIV-1-infected adults from communities with a low or very high incidence of tuberculosis. J Acquir Immune Defic Syndr 23: 75–80.

8. PerkinsMD, CunninghamJ (2007) Facing the crisis: improving the diagnosis of tuberculosis in the HIV era. J Infect Dis 196 Suppl 1: S15–27.

9. LangeC, PaiM, DrobniewskiF, MiglioriGB (2009) Interferon-gamma release assays for the diagnosis of active tuberculosis: sensible or silly? Eur Respir J 33: 1250–1253.

10. RangakaMX, GideonHP, WilkinsonKA, PaiM, Mwansa-KambafwileJ, et al. (2012) Interferon release does not add discriminatory value to smear-negative HIV-tuberculosis algorithms. Eur Respir J 39: 163–171.

11. BoehmeCC, NabetaP, HillemannD, NicolMP, ShenaiS, et al. (2010) Rapid molecular detection of tuberculosis and rifampin resistance. N Engl J Med 363: 1005–1015.

12. BoehmeCC, NicolMP, NabetaP, MichaelJS, GotuzzoE, et al. (2011) Feasibility, diagnostic accuracy, and effectiveness of decentralised use of the Xpert MTB/RIF test for diagnosis of tuberculosis and multidrug resistance: a multicentre implementation study. Lancet 377: 1495–1505.

13. PronykPM, KahnK, HargreavesJR, TollmanSM, CollinsonM, et al. (2004) Undiagnosed pulmonary tuberculosis deaths in rural South Africa. Int J Tuberc Lung Dis 8: 796–799.

14. CoxJA, LukandeRL, LucasS, NelsonAM, Van MarckE, et al. (2010) Autopsy causes of death in HIV-positive individuals in sub-Saharan Africa and correlation with clinical diagnoses. AIDS Rev 12: 183–194.

15. AnsariNA, KombeAH, KenyonTA, HoneNM, TapperoJW, et al. (2002) Pathology and causes of death in a group of 128 predominantly HIV-positive patients in Botswana, 1997–1998. Int J Tuberc Lung Dis 6: 55–63.

16. ChaussabelD, PascualV, BanchereauJ (2010) Assessing the human immune system through blood transcriptomics. BMC Biol 8: 84.

17. van 't VeerLJ, DaiH, van de VijverMJ, HeYD, HartAA, et al. (2002) Gene expression profiling predicts clinical outcome of breast cancer. Nature 415: 530–536.

18. RamiloO, AllmanW, ChungW, MejiasA, ArduraM, et al. (2007) Gene expression patterns in blood leukocytes discriminate patients with acute infections. Blood 109: 2066–2077.

19. MaertzdorfJ, WeinerJ, MollenkopfHJ, NetworkT, BauerT, et al. (2012) Common patterns and disease-related signatures in tuberculosis and sarcoidosis. Proc Natl Acad Sci U S A 109: 7853–7858.

20. LeshoE, ForestieroFJ, HirataMH, HirataRD, CeconL, et al. (2011) Transcriptional responses of host peripheral blood cells to tuberculosis infection. Tuberculosis (Edinb) 91: 390–399.

21. MaertzdorfJ, OtaM, RepsilberD, MollenkopfHJ, WeinerJ, et al. (2011) Functional correlations of pathogenesis-driven gene expression signatures in tuberculosis. PLoS ONE 6: e26938 doi:10.1371/journal.pone.0026938

22. MaertzdorfJ, RepsilberD, ParidaSK, StanleyK, RobertsT, et al. (2011) Human gene expression profiles of susceptibility and resistance in tuberculosis. Genes Immun 12: 15–22.

23. MistryR, CliffJM, ClaytonCL, BeyersN, MohamedYS, et al. (2007) Gene-expression patterns in whole blood identify subjects at risk for recurrent tuberculosis. J Infect Dis 195: 357–365.

24. JacobsenM, RepsilberD, GutschmidtA, NeherA, FeldmannK, et al. (2007) Candidate biomarkers for discrimination between infection and disease caused by Mycobacterium tuberculosis.. J Mol Med (Berl) 85: 613–621.

25. BerryMP, GrahamCM, McNabFW, XuZ, BlochSA, et al. (2010) An interferon-inducible neutrophil-driven blood transcriptional signature in human tuberculosis. Nature 466: 973–977.

26. LuC, WuJ, WangH, WangS, DiaoN, et al. (2011) Novel biomarkers distinguishing active tuberculosis from latent infection identified by gene expression profile of peripheral blood mononuclear cells. PLoS One 6: e24290 doi/10.1371/journal.pone.0024290.

27. WHO (2011) Global tuberculosis control: WHO report. Geneva: WHO.

28. KranzerK, van SchaikN, KarmueU, MiddelkoopK, SebastianE, et al. (2011) High prevalence of self-reported undiagnosed HIV despite high coverage of HIV testing: a cross-sectional population based sero-survey in South Africa. PLoS One 6: e25244 doi/10.1371/journal.pone.0025244.

29. CrampinAC, FloydS, MwaunguluF, BlackG, NdhlovuR, et al. (2001) Comparison of two versus three smears in identifying culture-positive tuberculosis patients in a rural African setting with high HIV prevalence. Int J Tuberc Lung Dis 5: 994–999.

30. HussainR, KaleemA, ShahidF, DojkiM, JamilB, et al. (2002) Cytokine profiles using whole-blood assays can discriminate between tuberculosis patients and healthy endemic controls in a BCG-vaccinated population. J Immunol Methods 264: 95–108.

31. FrankenKL, HiemstraHS, van MeijgaardenKE, SubrontoY, den HartighJ, et al. (2000) Purification of his-tagged proteins by immobilized chelate affinity chromatography: the benefits from the use of organic solvent. Protein Expr Purif 18: 95–99.

32. Ripley BD (1987) Stochastic simulation. New York: Wiley & Sons.

33. BenjaminiY, HochbergY (1995) Controlling the false discovery rate - a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol 57: 289–300.

34. ZouH, HastieT (2005) Regularization and variable selection via the elastic net. J R Stat Soc Series B Stat Methodol 67: 301–320.

35. JoostenSA, GoemanJJ, SutherlandJS, OpmeerL, de BoerKG, et al. (2012) Identification of biomarkers for tuberculosis disease using a novel dual-color RT-MLPA assay. Genes Immun 13: 71–82.

36. ElderingE, SpekCA, AbersonHL, GrummelsA, DerksIA, et al. (2003) Expression profiling via novel multiplex assay allows rapid assessment of gene regulation in defined signalling pathways. Nucleic Acids Res 31: e153.

37. WangY, ZhengD, TanQ, WangMX, GuLQ (2011) Nanopore-based detection of circulating microRNAs in lung cancer patients. Nat Nanotechnol 6: 668–674.

38. LaromaineA, KohL, MurugesanM, UlijnRV, StevensMM (2007) Protease-triggered dispersion of nanoparticle assemblies. J Am Chem Soc 129: 4156–4157.

39. LoweSB, DickJA, CohenBE, StevensMM (2012) Multiplex sensing of protease and kinase enzyme activity via orthogonal coupling of quantum dot-peptide conjugates. ACS Nano 6: 851–857.

40. MorrowTJ, LiM, KimJ, MayerTS, KeatingCD (2009) Programmed assembly of DNA-coated nanowire devices. Science 323: 352.

Štítky
Interní lékařství

Článek vyšel v časopise

PLOS Medicine


2013 Číslo 10
Nejčtenější tento týden
Nejčtenější v tomto čísle
Kurzy Podcasty Doporučená témata Časopisy
Přihlášení
Zapomenuté heslo

Zadejte e-mailovou adresu, se kterou jste vytvářel(a) účet, budou Vám na ni zaslány informace k nastavení nového hesla.

Přihlášení

Nemáte účet?  Registrujte se

#ADS_BOTTOM_SCRIPTS#