TB Research

About TB Research

tbresearch.orgis the open, machine-readable archive of the world's tuberculosis knowledge. We pull every published TB paper from the open biomedical literature, normalize it to markdown, embed it, and serve hybrid keyword + semantic search with cited RAG answers. The full vision is in docs/02_prd.md.

Current corpus

What you can do today

What we don't do yet

Architecture

[ PubMed · Europe PMC · ClinicalTrials.gov · OpenAlex ]
                       ↓
              [ ingest + normalize ]
                       ↓
          [ Postgres + pgvector + tsvector ]
                       ↓
                 [ hybridSearch ]
                       ↓
           ┌───────────┴───────────┐
           ↓                       ↓
     Search results       Ask TB Research (RAG)
                                   ↓
                       Claude Sonnet 4.6
                       + strict citations

Stack

Licensing

Code: Apache 2.0. Corpus metadata: CC0. Document content preserves its source license. Bulk exports surface license information via the X-TBRsch-License header so downstream consumers can honor it.