TB Research

Construction of a diagnostic model for tuberculosis based on long non-coding RNA

Xiuxiu Ji, Siyu Yao, Hongyan Jia, Qi Sun, Yingchao Wang, Xuetian Shang, Zeqi Wang, Mailing Huang, et al. (12 authors)

Annals of Medicine · 2026-01

Abstract

BACKGROUND: The World Health Organization encourages the development of novel diagnostic tools based on 'non-sputum' samples to meet global goals for tuberculosis (TB) control. We aimed to develop a machine learning-driven model for TB diagnosis, using long non-coding RNAs (lncRNAs) as biomarkers. METHODS: Peripheral blood mononuclear cells (PBMCs) from 10 TB patients, 10 latent TB infection individuals (LTBI), and 10 healthy controls (HCs) underwent microarray analysis, and the TB-related lncRNA modules were identified by weighted gene co-expression network analysis (WGCNA). Key lncRNAs were validated by qPCR and selected using LASSO regression. Five machine learning algorithms were employed to construct a diagnostic model, with the ROC analysis assessing its performance. RESULTS: = 206) showed an AUC of 0.92 (95%CI: 0.88-0.95). CONCLUSIONS: This study established a novel, blood-based diagnostic model incorporating five host-derived lncRNAs with an AdaBoost algorithm, offering a non-sputum approach to enhance TB diagnosis.

MeSH terms

  • Tuberculosis
  • Medicine
  • Computational biology
  • RNA
  • Virology
  • Computer science
  • AdaBoost
  • Bioinformatics
  • Disease
  • Mycobacterium tuberculosis
  • Biology