Methodology · Audit · Open

Real methodology. Not marketing.

Bet Hero's docs page is an EV explainer for beginners. OddsJam doesn't publish methodology at all. This page is the actual math we run, the data we capture, and the cap rules we apply — so when our scanner says "+2.8% executable EV" you can verify exactly what that means.

Cross-venue triangulation

When the same outcome trades on multiple venues, no single quote is the true probability — each venue carries its own vig, its own flow bias, and its own structural fee. We combine them with weights tied to how vig-free each venue is by design.

Kalshiweight 0.45CFTC-regulated DCM, ~1% structural fee

Polymarketweight 0.35P2P AMM + CLOB, near-zero overround on liquid markets

Sportsbooks (combined)weight 0.205-7% vig; split across all SBs present

Per-venue no-vig prob: Kalshi p_raw / 1.01, Polymarket p_raw / 1.01, SB uses the opposite-side decimal odds if known via the multiplicative method (else assumes 5% overround). The final fair probability is the weighted average of all per-venue no-vig probs. Confidence is 1 − std(noVigProbs) / 10. Divergence is fired when the absolute gap between PM and Kalshi no-vig probs exceeds 200bps.

›src/lib/data/fair-price.ts

Slippage walked at $1k / $5k / $25k

For Polymarket and Kalshi, we walk the live order book and compute the fill price you'd actually pay at three stake tiers. This is the foundation of our "real fillable EV vs paper math" claim. OddsJam and Bet Hero quote the headline best price; we quote the fill at YOUR stake size.

$1k filldepth_1kask-walk to fill $1k notional

$5k filldepth_5kask-walk to fill $5k notional

$25k filldepth_25kask-walk to fill $25k notional

Slippage cap2%legs slipping > 2% from best ask flagged

Hard cap10%legs over 10% slip auto-tagged paper

For sportsbooks (which don't expose order books) we use a crowdsourced stake-limits database (159 bootstrap entries × 14 books, growing via user reports). A book is considered fillable at the quoted price up to its known limit; above the limit, the leg is flagged.

›src/lib/data/pm-depth.ts

›src/lib/data/stake-limits.json

Executable / Partial / Paper verdicts

Every arbitrage row carries a verdict that combines slippage and stake limits across all legs. This is the line between an arb that you can actually trade vs an arb that exists only on paper because no order book has the depth.

Executablegreenevery leg fills cleanly at the quoted price for the recommended stake

Partialamber≥1 leg slips 2-10% or hits a stake limit; positive EV preserved, headline edge eaten

Paperred≥1 leg can't fill (slip > 10%, no depth data, or stake limit too low). Math correct, trade impossible.

The default leaderboard view hides PAPER rows because in a raw-edge sort they dominate (worst liquidity ≈ widest spreads). Toggle ?paper=1 to inspect them.

›src/lib/data/arb.ts

+EV scoring with cross-venue peer set

For every outcome quoted by 3+ platforms, we form a peer set of the OTHER platforms (excluding the venue we'd take) and compute the median no-vig fair probability across them. If the venue we'd take offers a payout above that peer-set fair value, the pick is +EV.

Min peer count2reject picks with single-peer signal

Min EV2%below this is estimation noise

Edge cap30%picks above cap are filtered as data corruption (stale lines, dead markets)

Exclude low-liq takenyesnever recommend a leg with thin depth on the take side

Require ≥1 liquid peeryesreject when ALL peers are low-liq (peer signal is junk)

The cross-venue peer set is what makes this scanner sharper than OddsJam's or Bet Hero's: their peer median is computed across sportsbooks only. We add Polymarket and Kalshi to the peer set, which on average shaves 50-150bps off the fair-prob drift because PM/Kalshi carry less vig and less book-cartel bias than the SB consensus.

›src/lib/data/positive-ev.ts

Low-hold detection

A market is low-hold when the cross-platform combination of best prices (one per outcome from whichever venue is cheapest) leaves almost no margin. Negative hold = active arbitrage.

Hold floor4%markets above 4% combined margin filtered out

Per-leg liquiditynon-lowlegs with thin depth excluded from the combo

Arb thresholdhold < 0rows where combined implied prob sums to < 100%

›src/lib/data/low-hold.ts

Settlement risk (Polymarket UMA)

Polymarket markets settle via UMA optimistic oracle proposals. Disputes are rare but exist (the $150M BTC-by-Dec-2025 case being the most public). Our tool estimates dispute probability and expected resolution latency by category and adjusts EV accordingly.

Sportsdispute ~1%objective outcomes, low UMA risk

Politicsdispute 2-3%moderate, depends on outcome objectivity

Weatherdispute 4-5%data-source ambiguity drives disputes

Geopoliticsdispute 5-8%subjective wording, multiple interpretations

The model also factors capital-cost (locked while pending resolution) and flip-risk (overturned outcome on dispute). The settlement-adjusted EV can be markedly lower than the raw EV on high-risk categories.

›src/lib/data/settlement-risk.ts

Tick archive — what we capture every second

Since 2026-04-29 we capture every quote from 2000 Polymarket tokens continuously into Cloudflare R2. Schema below — this is the actual parquet shape, not a marketing approximation.

v1/dt=YYYY-MM-DD/venue=polymarket/hour=HH/<uuid>.parquet

Fields (19):
  ts_capture_utc      timestamp[us, UTC]   capture wall-clock
  venue               string               polymarket / kalshi / book key
  event_id            string               canonical OB event id
  market_id           string               venue-side market id
  outcome_name        string               canonical OB outcome name
  outcome_side        string               YES / NO (PM/Kalshi only)
  raw_implied_prob    float64              0-100, vig included
  decimal_odds        float64              1.01 - 1000
  best_ask            float64              top of ask book
  best_bid            float64              top of bid book
  depth_1k            float64              ask-walk to fill $1k notional
  depth_5k            float64              ask-walk to fill $5k notional
  depth_25k           float64              ask-walk to fill $25k notional
  liquidity_flag      string               normal / low (heuristic)
  spread_bps          int32                (ask - bid) × 10000 / mid
  source_lag_ms       int32                latency from venue clock to our capture
  ws_or_rest          string               polling source
  collector_version   string               schema version pin
  segment_id          string               UUID grouping a single connection window

Total throughput in steady state: ~89 rows/min × 2000 tokens. Storage is partitioned by date + venue + hour for cheap range queries against R2's S3-compatible API.

›oddsbridge_collector/lib/parquet_writer.py

Refresh cadence + cache

Live odds pipeline1.4s cycleWS for PM/Kalshi, REST poll for sportsbooks

Site-wide KV cache90s TTLRedis-backed, shared across Vercel instances

Page ISR (homepage / leaderboard / event)300sall event-listing pages aligned to prevent stale-link 404s

Tracker statsdynamicrecomputed every request from your bet ledger

Limits & disclosures

A few things we are NOT yet:

Tick-archive only covers Polymarket continuously. Kalshi capture is in pre-production. Sportsbook capture has not started.
Backtest engine accepts strategy specs but the runner returns DATA_INSUFFICIENT until 30 contiguous days of archive land (target: mid-June 2026).
No ML fair-value model in production yet — the triangulation engine is hand-tuned, not trained. Calling it "Pinnacle for prediction markets" is aspirational, not shipped.
We do not accept bets, do not execute trades, do not custody funds. We are a comparison layer over public order books.
18+ only. Gambling involves risk. Past performance does not guarantee future results.

Spot a methodology bug or want a tier we don't yet publish? Tell us.