AI teams building models that depend on economic conditions need diverse, realistic macro environments for training and testing. WorldSim generates thousands of structurally coherent country-year trajectories across 195 countries, available as raw paths or via API.
Every trajectory respects 100+ structural coupling rules. Every dataset comes with full audit trail and deterministic reproducibility. This is synthetic macro data that behaves like the real world because it's built on the same structural relationships.
The synthetic data market is projected to reach $2.7B by 2030 (39% CAGR). WorldSim serves two distinct segments.
Banks under CCAR, Basel, and EBA frameworks need macroeconomic stress scenarios to test portfolio resilience. Traditional approaches use fixed scenario sets from regulators or generate synthetic data with GANs/VAEs that lack structural coherence.
ML teams building credit scoring, demand forecasting, insurance pricing, or economic prediction models need diverse macro environments for training and robustness testing. Current synthetic data generators (GANs, VAEs) produce statistically plausible data but without causal structure.
Not just numbers. Structurally coherent, causally consistent, reproducible synthetic macro environments.
Up to 10,000 individual simulation paths per country per scenario, each containing 26 KPI values per year from 2025 to 2050. Delivered as CSV or Parquet. Each path is a complete, internally consistent synthetic country trajectory.
P10/P50/P90 quantile outputs per KPI per year, plus full histograms at any target year. Use the distributions directly for model calibration, or sample from the raw paths for training data diversity.
This is what separates WorldSim from GAN-generated synthetic data. Every path respects 100+ structural coupling rules: when GDP falls, unemployment rises, migration outflows increase, fiscal revenue drops, and debt accumulates. The causal structure is preserved across every trajectory.
Programmatic access to submit scenario configurations, retrieve raw paths, and run batch sweeps. Submit 1,000 scenario configs, each generating 5,000+ paths across any set of countries. Enterprise-grade throughput for production ML pipelines.
A single Enterprise batch run can generate:
All structurally coherent, causally consistent, and fully reproducible.
GANs and VAEs generate statistically plausible data. WorldSim generates causally coherent data. The difference matters.
Does your credit scoring model degrade when unemployment doubles? When inflation hits 8%? Generate thousands of macro environments and test model robustness across the full spectrum of plausible economic conditions.
Consumer demand depends on GDP, inflation, employment, and housing costs. WorldSim generates macro environments where these variables interact realistically, providing better training data than historical time series from a single country.
Insurance claims correlate with macro conditions: recessions increase defaults, energy crises affect health costs, crime rates respond to unemployment. WorldSim provides the structurally coherent macro scenarios that actuarial models need.
Move beyond historical VaR. Generate thousands of forward-looking macro scenarios with structural coupling, then map them to asset returns. The distributional output gives you the tail risk that historical data doesn't cover.
Does your model perform differently for users in high-inflation vs low-inflation countries? Generate controlled macro environments where only specific variables change, isolating the effect on model predictions for fairness testing.
Historical macro data is limited: ~25 years of clean data for most countries, with only 2-3 recession episodes. WorldSim generates thousands of recession, recovery, and crisis scenarios that respect structural coupling, massively expanding your training set.
WorldSim covers 195 countries globally. Every simulation produces structured, multi-layered output that AI teams can consume as training data, test environments, or contextual inputs for LLM-based systems.
WorldSim isn't EU-only. Here's a US baseline simulation: TI 0.43, with 26 KPIs across all 9 structural domains. Every country in the database produces the same schema and depth of output, from the US to Bangladesh to Nigeria. AI teams training global models get globally diverse macro environments with consistent structure.
This histogram shows the distribution of US GDP per capita outcomes at 2050 across 500 stored sample paths. Each path is a complete 25-year trajectory with 26 KPIs. For Enterprise buyers, all paths (up to 10,000) are available as raw CSV/Parquet downloads. This is the underlying data that powers every chart, quantile, and regime classification. Your ML model trains on the paths; the distributions are the validation target.
Every simulation path comes with a full log of which structural rules fired, when, and why. For LLM and descriptive AI systems, this is high-value contextual metadata: "GDP fell because Energy Vulnerability triggered in 2025, which cascaded to Fuel Pressure in 2027, which triggered Monetary Tightening in 2029." Your model doesn't just learn the numbers; it learns the causal narrative. Rules can serve as structured inputs, outputs, or training labels depending on your architecture.
Each simulation path is a complete 25-year time series (2025-2050) with annual values for all 26 KPIs. The fan chart shows P10/P50/P90 quantile envelopes, but the underlying raw paths are individual sequences. For LSTM, Transformer, and other sequential ML architectures, this is native training format: thousands of structurally coherent multivariate time series per country, each following different but causally consistent trajectories.
The same 26 KPIs produce fundamentally different structural profiles across countries. Romania (TI 0.53) and Sweden (TI 0.45) show opposite patterns: Romania leads on Income, Housing, and Demographics; Sweden leads on Fiscal and Energy. Training on a single country's data produces models that overfit to one structural pattern. WorldSim gives you 195 structurally distinct environments with the same schema, maximising training diversity without sacrificing consistency.
Every dataset is controlled by a scenario configuration: country, path (Better/Average/Shock), KPI tilts (sigma shifts with persistence and decay), and simulation count. Via API, your system can programmatically generate specific macro environments: "Give me 5,000 paths for Germany where inflation is +3σ and unemployment is +2σ for 5 years." This enables iterative learning: your model can request increasingly extreme scenarios, test its own boundaries, and even integrate WorldSim's engine as a structured environment for reinforcement learning or agent-based exploration.
Historical macro data is scarce: ~25 years of clean data per country, with 2-3 recession episodes. You can't train a robust model on 3 recessions. WorldSim generates thousands of structurally coherent recession, recovery, crisis, and boom scenarios across 195 countries, each respecting the causal macro relationships that your model needs to learn. The coupling rules provide structured causal narratives that LLMs can consume as context. The raw paths provide multivariate time series that sequential models can train on. The parameterised API enables your system to generate its own training environments on demand. This is the synthetic macro data infrastructure that the $2.7B synthetic data market has been missing.
Raw Monte Carlo paths, quantile distributions, and full audit trails across 195 countries. Available as CSV/Parquet downloads or via API.