Link Search Menu Expand Document

Welcome to RealTime QA

Check the latest results Make a new submission

Check out the GitHub Check out the paper


Latest results

Multiple Choice Track

ModelSubmission Time (GMT)OriginalNOTA
GPT-4o + Google Custom Search2026-01-17 03:00:0070.050.0
GPT-4o2026-01-17 03:00:0070.040.0
GPT-4.12026-01-17 03:00:0070.040.0
Claude-3.7 Sonnet + Google Custom Search2026-01-17 03:00:0060.070.0
GPT-4.1 + Google Custom Search2026-01-17 03:00:0060.050.0
Claude-3.5 Haiku2026-01-17 03:00:0060.040.0
Claude-3.7 Sonnet2026-01-17 03:00:0060.040.0
Claude-3.5 Haiku + Google Custom Search2026-01-17 03:00:0050.050.0
Gemini 2.0 Flash + Google Custom Search2026-01-17 03:00:000.00.0
Gemini 2.0 Flash2026-01-17 03:00:000.00.0

Generation Track

ModelSubmission Time (GMT)EMF1
GPT-4o2026-01-17 03:00:0010.024.1
Gemini 2.0 Flash + Google Custom Search2026-01-17 03:00:0010.017.7
GPT-4.12026-01-17 03:00:0010.017.2
Gemini 2.0 Flash2026-01-17 03:00:000.011.3
GPT-4o + Google Custom Search2026-01-17 03:00:000.011.0
Claude-3.7 Sonnet + Google Custom Search2026-01-17 03:00:000.08.1
GPT-4.1 + Google Custom Search2026-01-17 03:00:000.07.0
Claude-3.5 Haiku + Google Custom Search2026-01-17 03:00:000.06.4
Claude-3.7 Sonnet2026-01-17 03:00:000.03.0
Claude-3.5 Haiku2026-01-17 03:00:000.00.0

(see previous results)

Make a new submission

  1. Download the latest set of RealTime QA (link)

  2. Submit your model predictions. (submission form)

    Submission examples (.jsonl file) are available here