TBsim Data Modules

This module provides data extraction, processing, and management utilities for TBsim simulations.

Available Data Modules

extract_south_africa

Extract South Africa demographic data from UN World Population Prospects (WPP)

extract_gtb_data

Extract Global TB data for calibration and validation

Key Features

  • Demographic Data: Population data from authoritative sources

  • TB Epidemiology: Global TB statistics and trends

  • Data Processing: Automated extraction and formatting

  • Calibration Support: Data for model parameter fitting

  • Validation Data: Real-world data for model validation

Usage Examples

Extracting South Africa data:

from tbsim.data.extract_south_africa import (
    extract_south_africa_cbr,
    extract_south_africa_asmr,
    extract_south_africa_asfr
)

# Extract crude birth rates
cbr_data = extract_south_africa_cbr()

# Extract age-sex-specific mortality rates
asmr_data = extract_south_africa_asmr()

# Extract age-specific fertility rates
asfr_data = extract_south_africa_asfr()

Extracting Global TB data:

from tbsim.data.extract_gtb_data import extract_gtb_data

# Extract global TB statistics
gtb_data = extract_gtb_data(
    countries=['South Africa', 'India', 'China'],
    years=[2010, 2020]
)

Data Sources

UN World Population Prospects (WPP)
  • Life tables by age and sex

  • Demographic indicators

  • Fertility rates by age

  • Population projections

Global TB Report
  • TB incidence and prevalence

  • Treatment outcomes

  • Mortality statistics

  • Drug resistance data

Required Files
  • WPP2024_Life_Table_Complete_Medium_Female_1950-2023.csv

  • WPP2024_Life_Table_Complete_Medium_Male_1950-2023.csv

  • WPP2024_Demographic_Indicators_Medium.csv

  • WPP2024_Fertility_by_Age1.csv

Output Files

Demographic Data
  • South_Africa_CBR.csv: Crude birth rates by year

  • South_Africa_ASMR.csv: Age-sex-specific mortality rates

  • South_Africa_ASFR.csv: Age-specific fertility rates

TB Data
  • Country-specific TB statistics

  • Time series data for calibration

  • Validation datasets

These data modules provide the foundation for realistic TBsim simulations with real-world demographic and epidemiological data.