Link Search Menu Expand Document

Welcome to RealTime QA

Check the latest results Make a new submission

Check out the GitHub Check out the paper


Latest results

Multiple Choice Track

ModelSubmission Time (GMT)OriginalNOTA
claude-sonnet-4.62026-06-06 03:00:0025.025.0
llama-4-maverick2026-06-06 03:00:0025.025.0
gpt-5.3-chat2026-06-06 03:00:0025.010.0
gemini-2.5-pro2026-06-06 03:00:0025.05.0
claude-opus-4.62026-06-06 03:00:0020.030.0
gpt-5.42026-06-06 03:00:0015.015.0
llama-4-scout2026-06-06 03:00:0015.010.0
llama-4-scout + Google Custom Search2026-06-06 03:00:0015.010.0
llama-4-maverick + Google Custom Search2026-06-06 03:00:0015.05.0
claude-opus-4.6 + Google Custom Search2026-06-06 03:00:0010.015.0
gpt-5.3-chat + Google Custom Search2026-06-06 03:00:0010.010.0
gpt-5.4 + Google Custom Search2026-06-06 03:00:0010.05.0
gemini-3.1-pro-preview2026-06-06 03:00:0010.05.0
gemini-3.1-pro-preview + Google Custom Search2026-06-06 03:00:0010.05.0
claude-sonnet-4.6 + Google Custom Search2026-06-06 03:00:0010.05.0
gemini-2.5-pro + Google Custom Search2026-06-06 03:00:0010.00.0

Generation Track

ModelSubmission Time (GMT)EMF1
gemini-3.1-pro-preview + Google Custom Search2026-06-06 03:00:0035.035.0
gpt-5.42026-06-06 03:00:0030.036.0
gpt-5.3-chat2026-06-06 03:00:0030.033.0
llama-4-scout + Google Custom Search2026-06-06 03:00:0025.033.3
claude-sonnet-4.62026-06-06 03:00:0025.028.6
gpt-5.4 + Google Custom Search2026-06-06 03:00:0025.026.8
llama-4-scout2026-06-06 03:00:0025.025.0
gpt-5.3-chat + Google Custom Search2026-06-06 03:00:0020.034.0
llama-4-maverick + Google Custom Search2026-06-06 03:00:0020.028.1
claude-opus-4.62026-06-06 03:00:0020.024.4
claude-opus-4.6 + Google Custom Search2026-06-06 03:00:0015.025.5
claude-sonnet-4.6 + Google Custom Search2026-06-06 03:00:0015.024.8
llama-4-maverick2026-06-06 03:00:0015.020.2
gemini-3.1-pro-preview2026-06-06 03:00:0015.019.2
gemini-2.5-pro + Google Custom Search2026-06-06 03:00:0010.018.5
gemini-2.5-pro2026-06-06 03:00:0010.010.5

(see previous results)

Make a new submission

  1. Download the latest set of RealTime QA (link)

  2. Submit your model predictions. (submission form)

    Submission examples (.jsonl file) are available here