Link Search Menu Expand Document

Welcome to RealTime QA

Check the latest results Make a new submission

Check out the GitHub Check out the paper


Latest results

Multiple Choice Track

ModelSubmission Time (GMT)OriginalNOTA
GPT-4.12025-07-12 03:00:0085.045.0
GPT-4.1 + Google Custom Search2025-07-12 03:00:0080.070.0
GPT-4o + Google Custom Search2025-07-12 03:00:0075.065.0
GPT-4o2025-07-12 03:00:0070.060.0
Claude-3.5 Haiku + Google Custom Search2025-07-12 03:00:0065.070.0
Claude-3.5 Sonnet + Google Custom Search2025-07-12 03:00:0055.060.0
Claude-3.5 Haiku2025-07-12 03:00:0045.050.0
Claude-3.7 Sonnet + Google Custom Search2025-07-12 03:00:0040.045.0
Claude-3.5 Sonnet2025-07-12 03:00:0040.040.0
Claude-3.7 Sonnet2025-07-12 03:00:0035.040.0
Gemini 1.5 Flash + Google Custom Search2025-07-12 03:00:000.00.0
Gemini 1.5 Flash2025-07-12 03:00:000.00.0
Gemini 2.0 Flash + Google Custom Search2025-07-12 03:00:000.00.0
Gemini 2.0 Flash2025-07-12 03:00:000.00.0

Generation Track

ModelSubmission Time (GMT)EMF1
Gemini 2.0 Flash2025-07-12 03:00:0010.014.2
Gemini 1.5 Flash + Google Custom Search2025-07-12 03:00:005.019.6
GPT-4o + Google Custom Search2025-07-12 03:00:005.017.4
Gemini 2.0 Flash + Google Custom Search2025-07-12 03:00:005.014.5
GPT-4.12025-07-12 03:00:005.012.0
GPT-4o2025-07-12 03:00:005.09.2
Gemini 1.5 Flash2025-07-12 03:00:005.09.2
Claude-3.5 Haiku2025-07-12 03:00:005.07.5
Claude-3.5 Haiku + Google Custom Search2025-07-12 03:00:005.06.8
GPT-4.1 + Google Custom Search2025-07-12 03:00:000.09.6
Claude-3.7 Sonnet + Google Custom Search2025-07-12 03:00:000.09.3
Claude-3.5 Sonnet + Google Custom Search2025-07-12 03:00:000.06.1
Claude-3.7 Sonnet2025-07-12 03:00:000.03.3
Claude-3.5 Sonnet2025-07-12 03:00:000.02.0

(see previous results)

Make a new submission

  1. Download the latest set of RealTime QA (link)

  2. Submit your model predictions. (submission form)

    Submission examples (.jsonl file) are available here