TB Research

Analysis code and results table for a replicate-anchored calibration of intra-host single nucleotide variant detection in Mycobacterium tuberculosis whole genome sequencing

Marie Nancy Séraphin

Zenodo (CERN European Organization for Nuclear Research) · 2026-01

Abstract

This repository contains the analysis pipeline for calibration of low-frequency variant (iSNV) detection thresholds in short-read whole-genome sequencing of Mycobacterium tuberculosis sputum cultures. Thresholds are anchored using within-patient replicate concordance via a lexicographic selection rule, with bootstrap-based stability assessment. The pipeline applies the calibrated rule to a cohort of patients to estimate per-patient prevalence and longitudinal persistence of within-host diversity.

MeSH terms

  • Mycobacterium tuberculosis
  • Whole genome sequencing
  • Replicate
  • Calibration
  • Biology
  • Concordance
  • Selection (genetic algorithm)
  • Single-nucleotide polymorphism
  • Tuberculosis
  • Computational biology
  • Genetics
  • Pipeline (software)
  • Sputum
  • Genotyping
  • Genome
  • Pyrosequencing