TB Research

Identification of Misdiagnosis Factors in Allergic Bronchopulmonary Mycosis Using Explainable Machine Learning

Chen X, Yu Z, Gong X, Cai R, Ge H, Jia Y, Tang J, Huang L, et al. (10 authors)

DOAJ (DOAJ: Directory of Open Access Journals) · 2026-05

Abstract

Xuemei Chen,1,2,* Zekai Yu,3,* Xiaoxiao Gong,1,2,* Runjin Cai,1,2,* Huan Ge,1,2 Yunbing Jia,1,2 Jiale Tang,1,2 Leng Huang,1,2 Xiaozhao Li,2,4 Juntao Feng1,2 1Department of Respiratory Medicine, National Key Clinical Specialty, Branch of National Clinical Research Center for Respiratory Disease, Xiangya Hospital, Central South University, Changsha, Hunan, 410008, People’s Republic of China; 2National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, 410008, People’s Republic of China; 3School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, Zhejiang Province, 310018, People’s Republic of China; 4Department of Nephrology, Xiangya Hospital, Central South University, Changsha, People’s Republic of China*These authors contributed equally to this workCorrespondence: Juntao Feng, Department of Respiratory Medicine, National Key Clinical Specialty, Branch of National Clinical Research Center for Respiratory Disease, Xiangya Hospital, Central South University, Changsha, Hunan, People’s Republic of China, Email jtfeng1976@csu.edu.cn Xiaozhao Li, Department of Nephrology, Xiangya Hospital, Central South University, Changsha, Hunan, People’s Republic of China, Email lixiaozhao@csu.edu.cnBackground: Allergic bronchopulmonary aspergillosis/mycosis (ABPA/ABPM) is frequently misdiagnosed as pulmonary tuberculosis due to overlapping clinical and radiological features.Methods: In a retrospective cohort of 89 multidisciplinary team (MDT)–confirmed ABPA/ABPM patients, we investigated determinants of misdiagnosis and disentangled causal drivers from spurious associations. Machine learning models with SHAP interpretation were utilized alongside a Double Machine Learning framework to estimate the average treatment effects of five key features while adjusting for major confounders.Results: Immunological markers were identified as the dominant contributors to misdiagnosis. Total serum IgE showed the strongest protective causal effect against misdiagnosis, followed by Aspergillus fumigatus–specific IgE. Bronchiectasis demonstrated a modest protective effect. These findings were robust across covariate balance, overlap, and placebo analyses.Conclusion: Our results indicate that ABPA/ABPM misdiagnosis is driven primarily by underrecognition of immunological features rather than imaging findings, underscoring the importance of systematic immunologic assessment to reduce diagnostic delay and unnecessary anti-tuberculosis treatment, particularly in tuberculosis-endemic settings. Infographic on ABPA/M diagnosis: data, methods, clinical impact via machine learning.The infographic is divided into three sections: Data and Processing, Methodology and Results and Clinical Implications. The Data and Processing section describes the patient groups, with 89 patients diagnosed with ABPA/M by MDT, including 55 correctly diagnosed and 34 misdiagnosed. It mentions data standardization using z-standardization and feature selection through LASSO with cross-validation. A map of China is shown, highlighting the lack of diagnostic criteria for ABPA/M and its frequent misdiagnosis as tuberculosis. The Methodology section outlines the use of LASSO regression with cross-validation, model selection involving six machine learning models and SHAP analysis for model interpretation. It includes causal inference using a Double Machine Learning framework and Average Treatment Effect estimation. The Results and Clinical Implications section highlights feature selection results, noting the best performance by the Glmnet model with an AUC of 0.808. It identifies slgE and tIgE as key factors, with low levels leading to misdiagnosis and confirms their protective effect. Clinical implications include identifying patients at high risk of misdiagnosis, suggesting combined immunological and radiological assessments and reducing inappropriate treatment.Keywords: allergic bronchopulmonary aspergillosis, allergic bronchopulmonary mycosis, machine learning, SHAP, diagnosis

MeSH terms

  • Medicine
  • Allergic bronchopulmonary aspergillosis
  • Bronchiectasis
  • Artificial intelligence
  • Machine learning
  • Identification (biology)
  • Respiratory system
  • Tuberculosis
  • Respiratory infection
  • Asthma
  • Cohort
  • Retrospective cohort study