验证
首场比赛结束后将开始模型准确性追踪。此前本页面展示指标定义与方法说明。
状态
pre_tournament_placeholderpre_tournament
指标定义
- Brier
- Measures the mean squared difference between predicted probabilities and actual outcomes. Lower is better. Range: 0 (perfect) to 1 (worst).
- Log-loss
- Penalizes confident wrong predictions more heavily. Lower is better.
- Calibration
- Measures whether predicted probabilities match observed frequencies across buckets.