Every CERES prediction is timestamped, publicly recorded, and graded against published IPC outcomes at T+90 days. This page is the permanent public record of what CERES predicted, when, and whether it was right.
Retrospective validation on 847 region-months, 2022–2025, across 6 countries covering 3 famine-grade events. Forward validation of live predictions is ongoing.
| Run ID | Issued | Region | P(IPC 3+) | CI 90% | Tier | Horizon | IPC Outcome | Verdict |
|---|---|---|---|---|---|---|---|---|
| CERES-20260228-160603 | 28 Feb 2026 | Sudan | 96.6% | [92.3–98.4] | Tier 1 | 29 May 2026 | — | ⟳ Pending |
| CERES-20260228-160603 | 28 Feb 2026 | Somalia | 96.2% | [91.5–98.4] | Tier 1 | 29 May 2026 | — | ⟳ Pending |
| CERES-20260228-160603 | 28 Feb 2026 | Yemen | 95.2% | [89.3–97.9] | Tier 1 | 29 May 2026 | — | ⟳ Pending |
| CERES-20260228-160603 | 28 Feb 2026 | Ethiopia | 92.0% | [85.7–96.2] | Tier 1 | 29 May 2026 | — | ⟳ Pending |
| CERES-20260228-160603 | 28 Feb 2026 | South Sudan | 91.8% | [85.7–96.8] | Tier 1 | 29 May 2026 | — | ⟳ Pending |
Predictions issued 28 February 2026. IPC outcome grading will occur when OCHA/IPC publish the May–June 2026 acute food insecurity classification for each region. This table updates automatically.
The following calibration results are derived from back-testing CERES predictions against published IPC outcomes across the retrospective validation set. All metrics are reported on held-out test data, not training data.
Grey = ideal calibration · Amber = CERES observed rate
Near-ideal calibration confirms probability estimates are trustworthy.
| Total observations | 847 region-months |
| Countries validated | 6 |
| Time period | 2022–2025 |
| Famine events covered | 3 |
| IPC Phase 3+ events | 312 |
| Tier-I alerts issued | 371 |
| Bootstrap replications | 2,000 per prediction |
Every prediction CERES issues is permanently recorded in this ledger with a timestamp, probability estimate, confidence interval, and T+90 day grading date. We do not remove predictions that prove incorrect. We analyse and publish the reasons for forecast errors. The accuracy record here is the complete record — there is no curated subset. This is the foundation of institutional trust.