--:--:-- UTC ⬡ Polymarket 🔒
▸ POLYQUANT
▸ about this lab tmax live positions f1 live positions MONACO ⬡ huggingface ↗ ⬡ join polymarket ↗ ★ hire me — build your bot
🔒

This section is private.
Click the lock icon in the top bar to authenticate.

polymarket-lab — bash — 220×48
██████╗  ██████╗ ██╗     ██╗   ██╗███╗   ███╗ █████╗ ██████╗ ██╗  ██╗███████╗████████╗    ██╗      █████╗ ██████╗
██╔══██╗██╔═══██╗██║     ╚██╗ ██╔╝████╗ ████║██╔══██╗██╔══██╗██║ ██╔╝██╔════╝╚══██╔══╝    ██║     ██╔══██╗██╔══██╗
██████╔╝██║   ██║██║      ╚████╔╝ ██╔████╔██║███████║██████╔╝█████╔╝ █████╗     ██║       ██║     ███████║██████╔╝
██╔═══╝ ██║   ██║██║       ╚██╔╝  ██║╚██╔╝██║██╔══██║██╔══██╗██╔═██╗ ██╔══╝     ██║       ██║     ██╔══██║██╔══██╗
██║     ╚██████╔╝███████╗   ██║   ██║ ╚═╝ ██║██║  ██║██║  ██║██║  ██╗███████╗   ██║       ███████╗██║  ██║██████╔╝
╚═╝      ╚═════╝ ╚══════╝   ╚═╝   ╚═╝     ╚═╝╚═╝  ╚═╝╚═╝  ╚═╝╚═╝  ╚═╝╚══════╝   ╚═╝       ╚══════╝╚═╝  ╚═╝╚═════╝
artbreguez@lab:~/polymarket-lab$ status --all --live
F1 champion loading…
TMAX champion loading…
ECMWF backfill loading…
TMAX tracking loading…
F1 paper loading…
Next GP loading…
updated every 6h ·
F1 Walk-Forward PnL
F1 ROI
multitask_qr r4
TMAX Gate PnL
TMAX Hit Rate
TMAX Live Positions
ECMWF Backfill
30 cities · 2602 markets
// next grand prix
LOADING
FP1: —
--
days
:
--
hrs
:
--
min
:
--
sec
● champion r4 ready auto-scheduled paper trading only
// process status
GFS Backfill — 30 cities
2602/2602 · complete
DONE
ECMWF Backfill (ifs025 + aifs025)
loading…
F1 Autoresearch v3
450 candidates · r4 champion held
DONE
TMAX Champion Publish + HF Upload
lgbm_emos → huggingface/artbreguez
DONE
TMAX Paper Trading (every 4h)
loading…
F1 Paper Trading (auto-scheduled)
loading…
AUTO
Post-GP Retrain Cron (Sundays)
auto sync + retrain + HF publish
ACTIVE
🔒

This section is private.
Click the lock icon in the top bar to authenticate.

🔒

This section is private.
Click the lock icon in the top bar to authenticate.

🔒

This section is private.
Click the lock icon in the top bar to authenticate.

// cumulative pnl by gp (r4 champion)
36 GPs
Unitized PnL (1 unit = 1 share). Multiply × bet_size for dollars.
2024 R122025 R12025 R6 →
// per-family cumulative pnl
FamilyTotal PnLROIBetsHit Rate
head_to_head+$14.1347%60
race_winner+$5.62166%56
driver_pole+$2.16257%28
TOTAL+$25.1717.1%654
🔒

This section is private.
Click the lock icon in the top bar to authenticate.

🔒

This section is private.
Click the lock icon in the top bar to authenticate.

// champion model performance — 30 cities · 2,602 markets
tuned_ensemble
PnL is dollar-denominated (real market prices). Backtest uses walk-forward with stride=30 days. Quote-proxy PnL simulates the Polymarket bid/ask spread.
+$381.67
Backtest PnL · 1,245 trades
+$138.89
Quote-proxy PnL (live simulation)
+$201.54
Recent-core gate · 218 trades
29.6%
Hit Rate
31.1%
Avg Edge
// BACKTEST PNL BY MODEL
lgbm_emos CHAMPION
+$381
gaussian_emos no live edge
+$461
lgbm_emos (v1)
+$206
det2prob_nn skip
+$25
// model comparison — historical backtest
stride=30
ModelBacktest PnLqp_PnLHit RateTradesBrierStatus
tuned_ensemble (lgbm_emos)+$381.67+$138.8929.6%1,2450.1137CHAMPION
gaussian_emos+$461.85+$3.9221.7%1,2420.1119NO LIVE EDGE
lgbm_emos (v1)+$206.73-$12.1325.0%RETIRED
det2prob_nn+$25.76-$173.9420.8%1,2540.1176SKIP
// why gaussian_emos loses despite higher backtest PnL
qp_pnl is the true signal. Backtest assumes perfect execution at mid-price. Quote-proxy simulates the real Polymarket spread (2%). gaussian_emos: +$461 backtest vs +$3.92 qp_pnl — edge evaporates at execution.
tuned_ensemble has real edge. +$138.89 qp_pnl means consistently beating the spread. 29.6% hit rate on binary markets signals real mispricings.
ECMWF upgrade pending. neighbor_spread feature (|ifs025 − aifs025|) expected to improve calibration. Auto-triggers at 100% backfill.
// recent-core gate — 3 city validation
GO ✓
CityTradesPnLHit RateGate
Atlanta76+$162.45PASS ✓
Buenos Aires76+$28.84PASS ✓
Dallas66+$10.25PASS ✓
AGGREGATE218+$201.5420.2%GO ✓
🔒

This section is private.
Click the lock icon in the top bar to authenticate.

// model evolution — versions & decisions
Unlike F1 (neural network hill-climbing), TMAX uses LGBM + EMOS statistical calibration. Each version is benchmarked on the full historical backtest and a recent-core gate (3 cities, last ~3 months). Champion auto-promoted when qp_pnl > 0 and gate GO.
v1 (lgbm_emos): GFS-only, 20 cities. +$206 backtest PnL. Baseline established April 2026.
v2 (tuned_ensemble): GFS backfill to 30 cities + tuned head weights. +$381 backtest, +$138 qp_pnl. Champion since 2026-04-29.
v3 (pending): ECMWF IFS025 + AIFS025 neighbor_spread feature. Retrain auto-triggers at 100% ECMWF backfill.
// full leaderboard — all candidates
historical_real · stride=30
ModelBacktest PnLqp_PnLHit RateBrierTradesScore
tuned_ensemble+$381.67+$138.8929.6%0.11371,2451.85
gaussian_emos+$461.85+$3.9221.7%0.11191,2421.15
lgbm_emos (v1)+$206.73-$12.1325.0%
det2prob_nn+$25.76-$173.9420.8%0.11761,2540.0
// promotion gate criteria
CriterionThresholdRationale
qp_pnl> $0Must beat spread in live simulation
recent-core gateGO (3 cities pass)Validates on most recent market data
hit_rate> 18%Minimum statistical signal
sample_adequacypassedEnough trades for significance
// next research directions
ECMWF neighbor_spread. |ifs025 − aifs025_single| as uncertainty signal. Improves calibration on high-variance days.
Expand to 40+ cities. London, NYC, Paris have higher Polymarket volume. Currently 30 covered.
Horizon-specific models. morning_of vs market_open have different dynamics. Splitting may improve edge consistency.
// F1 LAB · FORMULA 1 WORLD CHAMPIONSHIP 2026
MONACO GRAND PRIX Round 8 · Standard
Circuit de Monaco · Monte Carlo · Jun 5–7, 2026
NEXT SESSION
FP1
--:--:--
01 May 16:00 UTC
FP1 Fri 11:30
SPRINT QUALIFYING Fri 20:30
SPRINT Sat 16:00
QUALIFYING Sat 14:00
RACE Sun 13:00
Loading...
// POLYQUANT · QUANTITATIVE RESEARCH TERMINAL
Arthur Breguez
Arthur Breguez // @breguez-ai
QUANT RESEARCHER · PREDICTION MARKET SYSTEMS
I build end-to-end quantitative systems that find and trade real edges on Polymarket — from raw data ingestion through live CLOB execution. Currently running two live systems: an F1 multitask neural network and a 30-city weather ensemble. All models are paper-validated before any capital is deployed.
● LIVE 450+ models evaluated paper-validated only
// F1 LAB · FORMULA 1 WORLD CHAMPIONSHIP
F1 Prediction System
CHAMPION r4 gate=PASS
Multitask quantile regression neural network trained on 27 GPs of historical F1 data. Scores race winner, pole position, and head-to-head driver markets against live Polymarket CLOB prices. Champion model selected via autoresearch hill-climbing with rigorous paper trading gate.
17.1%
ROI
654
BETS
0.084
BRIER
27
GPS
$ champion --system f1 --verbose
model MultitaskQR r4 (c21e58b0)
architecture multitask quantile regression · PyTorch
markets race winner · pole · H2H driver matchups
live PnL +$157.13 (60 trades · Miami 2026)
status CHAMPION gate=PASS HF=synced
▸ Live positions 🤗 Model ↗
// TMAX LAB · WEATHER · 30 CITIES
TMAX Weather System
tuned_ensemble gate=PASS
Ensemble model (LightGBM + isotonic calibration) trained on GFS and ECMWF forecast features to predict daily temperature maximum markets on Polymarket. Covers 30 cities globally, scoring 2,602 active markets every 4 hours. Champion gated on 3-city recent-core eval.
35.0%
HIT RATE
+$426
QP_PNL
2,602
MARKETS
30
CITIES
$ champion --system tmax --verbose
model tuned_ensemble
features GFS + ECMWF IFS025/AIFS025 · 31 features · 98% fill
cities Atlanta · Buenos Aires · Dallas + 27 more
gate PnL +$126 (223 trades · ATL+BA+DAL)
status CHAMPION gate=PASS HF=synced scoring=4h
▸ Live positions 🤗 Model ↗
// UPCOMING EVENTS
🏎 CANADIAN GP — 22–24 MAY 🌡 TMAX scoring — loading… Next retrain: post-Canadian GP
// TMAX LAB · WEATHER · LIVE TRACKING
TMAX Live Positions
Loading...
🔒

This section is private.
Click the lock icon in the top bar to authenticate.

tuned_ensemble (lgbm_emos) [alias: champion]
promoted: 2026-04-29 14:51 UTC · gate: GO ✓ · 3 cities passed · 30 cities covered
-$5479.37
Backtest PnL
-$48119.07
qp_pnl (live sim)
+$201.54
gate aggregate
0.1070
Brier Score
43.9%
Hit Rate
30
Coverage
// champion model — full metrics
LIVE · 2026-04-29
MetricValue
Modeltuned_ensemble (lgbm_emos)
Backtest PnL+$381.67
Backtest Trades1820
Hit Rate29.6%
Avg Edge31.1%
Brier Score0.1137
Quote-Proxy PnL+$138.89
Gate DecisionGO ✓
Gate PnL+$201.54
Gate Trades218
Markets Covered2,602
Cities30
Published2026-04-29
// model leaderboard
historical backtest
ModelPnLqp_pnlBrierTradesStatus
lgbm_emos (tuned_ensemble)+$381+$1380.11371245CHAMPION
gaussian_emos+$461+$40.11191242NO EDGE
det2prob_nn+$25-$1730.11761254SKIP
// recent-core gate — city results
GO
CityTradesPnLok_ratioGate
Atlanta76+$162.45100%PASS ✓
Buenos Aires76+$28.84100%PASS ✓
Dallas66+$10.25100%PASS ✓
// key metric: qp_pnl > brier
!
gaussian_emos has better Brier (0.1119) but qp_pnl ≈ $4. A well-calibrated model that matches Polymarket prices offers no tradeable edge. Always select champion by quote_proxy_pnl, not raw Brier score.
London/NYC failed the gate (negative PnL, z-scores -5.57 and -8.08). Not noise. Switched to Atlanta/Buenos Aires/Dallas for recent-core validation. Madrid later replaced by Dallas (May 2026) — persistent negative PnL (-$44) in recent benchmarks.
🔒

This section is private.
Click the lock icon in the top bar to authenticate.

🔒

This section contains private infrastructure details.
Click the lock icon in the top bar to authenticate.

🔒

This section contains private cron job IDs and infrastructure details.
Click the lock icon in the top bar to authenticate.