Calibration — Definition & Guide | MarketsPrediction Glossary

TL;DR

A well-calibrated forecaster's confidence levels reliably match real-world outcomes. It is the foundation of trustworthy probabilistic prediction.

Key Points

✓Perfect calibration means events predicted at 60% occur 60% of the time, events at 80% occur 80% of the time, and so on across all probability levels.

✓Overconfidence is the most common calibration error: forecasters assign probabilities too close to 0 or 1 and are wrong more often than expected.

✓Calibration is measured visually with reliability diagrams and numerically via metrics like the Brier score and log score.

✓Prediction markets tend to be well-calibrated because mispriced contracts create profit opportunities that traders quickly arbitrage away.

✓Regular feedback and deliberate practice are the most reliable ways to improve personal calibration over time.

What Calibration Means in Practice

Calibration is the degree of agreement between a forecaster's stated probabilities and observed frequencies of outcomes. If you say an event has a 30% chance and you make 100 such predictions, good calibration means roughly 30 of those events actually occur. In prediction markets, contract prices are continuously driven toward calibrated values by competing traders seeking Edge. A price of 0.65 on a Binary Market ideally reflects a true 65% probability of the event resolving YES. Poor calibration in a market signals a potential Mispricing that informed traders can exploit for profit.

How Calibration Is Measured and Improved

The most common numerical tools for assessing calibration are the Brier Score and Log Score, both of which penalize forecasters more heavily for confidently wrong predictions than for modest misses. Reliability diagrams plot predicted probabilities against actual outcome rates; a perfectly calibrated forecaster produces a straight diagonal line. Research from the Good Judgment Project shows that top crowd forecasters achieve significantly better calibration than domain experts by updating incrementally on new evidence using prior probabilities and adjusting to form posterior probabilities. Regular scoring feedback is the single most effective tool for training better calibration.

Sources & References

1Calibration (statistics) - Wikipedia ↗

2Evidence on good forecasting practices from the Good Judgment Project - AI Impacts ↗

3Log probability scoring rule - The Book of Statistical Proofs ↗

Last updated: June 24, 2026

Related Terms

Find the best odds on every market

Compare live prices across Kalshi, Polymarket, and more — spot arbitrage and trade the sharpest line on any event.

Compare Markets