Calibration Surface — Predicted vs Actual Win Rate
loading…
Diagonal = perfect calibration
—
Calibration Over Time3D Surface · drag to rotate · scroll to zoom
Aggregate Stats—
Status…loading backtest results
A perfectly calibrated model traces a diagonal ridge: predicted 70% → actual 70%, regardless of year.
Bumps above the diagonal = model is under-confident; dips below = over-confident.
Gaps = no fights landed in that (predicted prob × year) cell, or the cell was filtered out by the min-n toggle.
Sparse cells (≤10 fights) carry high binomial noise — toggle Min n=20 or Min n=40 to drop them.
Source: api/calibration-surface.json · 8,000+ walk-forward predictions