Brier score
0.207
Validation set · N=—
Lower is better · vs 0.222 uniform baseline
MatchMind publishes how its calibrated football model actually performs against real outcomes. This page is public — no signup required — because trust in a probability model has to be earned with numbers, not promises.
Brier score
0.207
Validation set · N=—
Lower is better · vs 0.222 uniform baseline
Log-loss
1.034
Validation set · N=—
Lower is better · vs 1.099 uniform baseline
Calibration error (ECE)
—
Validation set
Lower is better
Model version
—
Active champion
Need ≥ 50 pre-match evaluations to plot calibration. Currently 0 evaluations recorded.
When the live sample reaches the threshold this page will display the actual curve. Until then, the validation-set metrics above are the honest reference.
Each dot is a probability bin — x-position is the average predicted home-win probability, y-position is how often the home team actually won. The dashed diagonal is perfect calibration. Dots above = model is under-confident in that range; below = over-confident.
Analytical estimates only.The numbers on this page describe the model's historical performance — they do not guarantee future outcomes, and MatchMind does not provide betting tips, picks, or guaranteed predictions. Past calibration is the best evidence available for trusting probabilities, but it is not a promise.