Skip to content

Current Data Inventory

Everything this model consumes lives in Supabase project sstmupvotyzrjygqoany (ap-northeast-1) and is served through the MCP gateway (Railway, 84 tools, 19 datasets).

TableRowsCoverageRefresh
ohlcv_1d~23 90881 symbolsDaily 06:00 UTC
ohlcv_1h40 symbolsHourly 0 0-21 * * *
ohlcv_15m40 symbolsHourly
ohlcv_5m40 symbolsHourly
tickers11615-min quotes*/15 * * * *
economic_calendar18High-impact eventsDaily
economic_indicators~25 76885 FRED seriesDaily
central_bank_ratesG10 ratesDaily
cot_reports~1 420Commitment of TradersWeekly
options_chains~2 492Select equities + cryptoDaily
equity_fundamentalsS&P 500 subsetQuarterly
earnings_estimates~360S&P 500 subsetQuarterly
news_articles~13 8556 RSS feeds · VADER*/30 * * * *
ml_predictionsKronos outputsEvery cron tick
signal_evaluationsPrediction labelsHourly / daily

Per project memory, 19 registered datasets. The model is directly powered by:

  • ohlcv_daily, ohlcv_intraday — candle source for Kronos input
  • economic_calendar — event flags
  • economic_indicators — macro context
  • predictions — cached Kronos outputs served to UI
flowchart LR
subgraph Existing
OHLCV[(ohlcv_1h<br/>ohlcv_1d)] --> BATCH[kronos-batch-predict.py]
CAL[(economic_calendar<br/>10-ch placeholder)] --> BATCH
end
BATCH --> ML[(ml_predictions)]
ML --> GW[Gateway<br/>/showcase/ml-prediction]
GW --> UI[prediction.datfxlabs.com]

What the model will need (Phase 1 · 20 channels)

Section titled “What the model will need (Phase 1 · 20 channels)”

See Data Gaps and Backfill Plan — both populated from the data-gap researcher report.

The short version:

  • Have: economic_calendar with actual / forecast fields for primary events.
  • Partial: surprise z-scores (need to compute rolling-20 std per event type at query time or materialise a view).
  • Missing: FOMC hawkish score (requires NLP over statement text not currently stored), cross-asset leader OHLCV for DXY + VIX at 1 h resolution (verify coverage), FOMC statement text archive.
  • No fake data — enforced by CLAUDE.md Data Integrity rules plus DB triggers blocking sources like 'fake', 'mock', 'simulated'.
  • Source whitelist — only Yahoo Finance, FRED, Alpha Vantage, CoinGecko, Binance, Bybit, Finnhub, FMP, Marketaux.
  • External verification — cross-checks against live market prices on data entry.