Integrative machine learning approaches with genomic data for predicting antitubercular drug resistance: A systematic review and meta-analysis.

Rohan Shrivastava, Kashmi Sharma, Somesh Mishra, Poonam Parihar, Gobardhan Das, Khushhali Menaria Pandey, Anand Kumar Maurya, Vineet Kumar Sharma, et al. (16 authors)

Journal of global antimicrobial resistance · 2026-01

Abstract

OBJECTIVES: Tuberculosis (TB) remains a leading infectious cause of death, with drug-resistant TB threatening control gains worldwide Rapid, accurate prediction of resistance to first-line agents is essential to guide therapy. The objective of this meta-analysis is to evaluate the diagnostic performance of machine learning (ML) algorithms trained on genomic data for predicting phenotypic resistance to the antitubercular drugs.

METHODS: We searched 4 databases for original studies applying ML to whole-genome or targeted sequencing for resistance prediction against rifampicin (RIF), isoniazid (INH), ethambutol (EMB), streptomycin (STM), and other routinely tested drugs. Random-effects meta-analyses (REML) pooled sensitivity and specificity; Area Under Curve (AUC) was meta-analysed after logit transformation. Publication bias was examined via funnel plots plus Egger's and Begg's tests with trim-and-fill adjustment.

RESULTS: Seven eligible studies were included. Pooled performance was strongest for RIF and INH (RIF: sensitivity 0.90, specificity 0.95; INH: sensitivity 0.88, specificity 0.93). EMB and STM showed lower sensitivities despite reasonable AUCs. Forest plots for AUC (including logit scale), sensitivity, and specificity demonstrated drug-wise variation. Across studies, specificity (mean ≈ 0.92) exceeded sensitivity (mean ≈ 0.75). Bias diagnostics revealed marked funnel plot asymmetry; trim-and-fill imputation added 22, 29, and 23 studies for AUC, sensitivity, and specificity, respectively, yielding adjusted pooled estimates of ∼0.89 (AUC), 0.70 (sensitivity), and 0.93 (specificity).

CONCLUSIONS: ML models trained on genomic data demonstrate high diagnostic accuracy and robust discriminative ability for predicting first-line drug resistance-particularly for RIF and INH-although sensitivity remains variable across drugs and model types. Standardized external validation and calibration are needed before broad clinical deployment.

MeSH terms

Antitubercular Agents
Machine Learning
Humans
Mycobacterium tuberculosis
Genomics
Tuberculosis, Multidrug-Resistant
Microbial Sensitivity Tests
Rifampin
Drug Resistance, Bacterial
Sensitivity and Specificity
Isoniazid

Integrative machine learning approaches with genomic data for predicting antitubercular drug resistance: A systematic review and meta-analysis.

Abstract

MeSH terms

Also indexed in

Related papers