ForesightFlow

Datasets

Open datasets released alongside ForesightFlow research. All datasets are published under CC-BY-4.0 unless otherwise noted.

ForesightFlow Insider Cases

A curated validation set mapping eight publicly documented episodes of suspected informed trading on Polymarket to concrete on-chain market identifiers. Cases span 2023–2026 and cover 24 individual markets across military and geopolitical actions, corporate proprietary disclosures, and regulatory decisions.

CC-BY-4.0
PMXT Behavioral Clusters v1 — Non-Retail Polymarket Microstructure Dataset

Fill-side behavioral clusters, feature tiers, and per-market microstructure signatures from 13.36M OrderFilled events on Polymarket CTFExchange (43,116 markets, 77,203 addresses, 2026-04-21 to 2026-04-27). k-means k=5 archetypes, 6 feature tiers, Spearman bilateral analysis with BH-FDR correction. Bundle 3 of the PMXT family; companion to Paper 4 of the Event-Linked Perpetuals research programme.

CC-BY-4.0
PMXT Counterfactual Replay v1 — Engine and Resolution-Zone Protocol Comparison

Per-(engine/mechanic, leverage, class) summary statistics from two counterfactual simulation experiments on 13,000+ resolved Polymarket binary-event markets. E2 compares three margin-engine variants on survivability and drawdown; E3 compares four resolution-zone mechanics (R0–R3) on final-hour liquidation rates, bad-debt frequency, and average PnL. Bundle 2 of the PMXT family; companion to Paper 1 of the Event-Linked Perpetuals research programme.

CC-BY-4.0
Polymarket Per-Category Exponential Hazard Rates

Per-category maximum-likelihood exponential hazard fits for the time-to-event distribution on Polymarket deadline-resolved contracts. Headline: military_geopolitics λ=0.241 events/day, 95% CI [0.143, 0.365], half-life 2.9 days (KS p=0.425, n=18, adequate). Baseline survival function for the deadline-ILS framework.

CC-BY-4.0
Polymarket ILS Corpus

Population-scale Information Leakage Score (ILS) computations for 4,801 resolved Polymarket markets. Anchored at t_resolve − 24h proxy for 99.9% of records (4,796 markets); 5 markets carry a genuinely event-anchored timestamp. Includes multi-window ILS variants (30min, 2h, 6h, 24h, 7d), top-wallet concentration (HHI), and scope-condition flags. For event-anchored analysis on a smaller, deeper sample, use polymarket-deadline-ils-v3.

CC-BY-4.0
Polymarket T_news / T_event Recovery

Public-event and news-arrival timestamps for 2,052 resolved Polymarket markets, recovered across three methodological tiers: UMA Oracle proposer evidence (Tier 1, n=12, confidence 0.95), GDELT proxy (Tier 2, n=1,993, confidence 0.60), and LLM-assisted multi-source verification (Tier 3, n=47, confidence 0.80–0.90). Snapshot: 2022-12–2026-04.

CC-BY-4.0