Leveraging explainable artificial intelligence and spatial analysis for communicable diseases in Asia (2000-2022) based on health, climate, and socioeconomic factors.
Md Siddikur Rahman, Md Abu Bokkor Shiddik
International journal of health geographics · 2025-12
Abstract
BACKGROUND: Communicable diseases remain a significant public health challenge in Asia, driven by diverse climatic, socioeconomic, and healthcare-related factors. Despite reductions in diseases such as tuberculosis and malaria, persistent hotspots highlight the need for deeper investigation. This study applies machine learning and spatial analysis techniques to examine patterns and determinants of communicable diseases across 41 countries from 2000 to 2022.
METHODS: Data were sourced from global repositories, including WHO, CRU TS, WDI, and UNICEF, covering disease cases (e.g., tuberculosis, dengue, malaria), climaticvariables (e.g., precipitation, humidity), and healthcare metrics (e.g., hospital bed density). Missing values were imputed using random forest methods. Outlier detection was conducted using Mahalanobis distances, identifying and addressing significant deviations to ensure data consistency. Models like XGBoost and Random Forest were assessed using RMSE, MAE, and R². SHAP and XAI frameworks improved interpretability, while Gi* spatial statistics revealed disease hotspots and disparities.
RESULTS: Tuberculosis cases declined from 8.01 million (2000) to 7.54 million (2022), with hotspots in India (Gi* = 3.07) and Nepal (Gi* = 4.67). Malaria cases dropped from 27.00 million (2000) to 7.96 million (2022), yet Bangladesh (Gi* = 4.13) and Pakistan (Gi* = 4.17) exhibited sustained risk. Dengue peaked at 2.71 million cases in 2019, with current hotspots in Malaysia (Gi* = 2.4) and Myanmar (Gi* = 0.79). Spatial disparities underscore the influence of precipitation, relative humidity, and healthcare gaps. XGBoost achieved remarkable accuracy (e.g., tuberculosis: RMSE = 0.94, R² = 0.91), and SHAP analysis revealed critical predictors such as climatic factors.
CONCLUSION: This study demonstrates the effectiveness of integrating machine learning, spatial analysis, and XAI to uncover disease determinants and guide targeted interventions. The findings offer actionable insights for improving disease surveillance, resource allocation, and public health strategies across Asia.
MeSH terms
- Humans
- Spatial Analysis
- Asia
- Communicable Diseases
- Artificial Intelligence
- Socioeconomic Factors
- Climate