aratea.dashboard

Predictor — learning loop

Aratea is a weather-factor discovery engine. Every named feature here is a hypothesis; every training run measures whether it carries signal. The bench is the same row-set kalshi_mid Brier — beat the market, on its own ground.

What goes into Aratea's number

Aratea runs several forecasting models in parallel. Each card below is an independent estimate of the same question: "what's the probability this happens?". The numbers do not add up to 100% — they're different ways of guessing the same answer. Only one of them — the champion — actually places the bet; the others run in shadow.

Hybrid effective sample (N_eff)
86.4
= 6 live + 0.3 × 268 backtest. Secondary decisions only — the Phase 1 gate uses live only.

Aratea's parallel estimates

Champion = the model whose probability is actually used to place the bet. Challengers run in parallel as shadow forecasts; one only becomes champion after beating the current one on a rolling window of resolved trades. This is why the “champion” probability shown in the public view can differ from the “learned model” probability here.

  • Multi-model ensemblechampion
    25.6%

    The mean of four vendor models (ECMWF, GraphCast, GFS, JMA). Useful as a smoothing baseline.

    vendor_ensemble

  • Learned modelchallenger
    14.4%

    A small regression that learns how much weight to give each component based on past resolutions. This is the one Aratea actually bets with.

    learned_v2

What the market is paying

Market (Kalshi mid)
45.5%

What the Kalshi order book is implying right now. The yardstick Aratea must beat.

Shown for comparison. This is the benchmark Aratea is trying to beat — not one of its inputs.

Track record so far

The chart below shows the Brier score of two forecasters across every training pass. Brier scores accuracy: 0 = perfect, 1 = always wrong, lower is better.
blue line = Aratea's learned model. yellow line = the Kalshi market mid on the exact same events. When blue stays under yellow, Aratea has signal the order book doesn't.

0.0000.050.100.15v2bench (0.1305)05-1105-1205-14run (UTC date)Brier score (lower = better)learned (test)kalshi_mid (test)
Show me the full registry →