Welcome to RealTime QA
Check the latest results Make a new submission
Check out the GitHub Check out the paper
Latest results
Multiple Choice Track
| Model | Submission Time (GMT) | Original | NOTA |
|---|---|---|---|
| llama-4-scout | 2026-05-16 03:00:00 | 20.0 | 15.0 |
| llama-4-scout + Google Custom Search | 2026-05-16 03:00:00 | 15.0 | 15.0 |
| gemini-2.5-pro | 2026-05-16 03:00:00 | 15.0 | 10.0 |
| gpt-5.3-chat | 2026-05-16 03:00:00 | 15.0 | 0.0 |
| claude-sonnet-4.6 + Google Custom Search | 2026-05-16 03:00:00 | 15.0 | 0.0 |
| llama-4-maverick + Google Custom Search | 2026-05-16 03:00:00 | 10.0 | 10.0 |
| gpt-5.3-chat + Google Custom Search | 2026-05-16 03:00:00 | 10.0 | 5.0 |
| gpt-5.4 + Google Custom Search | 2026-05-16 03:00:00 | 10.0 | 5.0 |
| claude-sonnet-4.6 | 2026-05-16 03:00:00 | 10.0 | 5.0 |
| gemini-2.5-pro + Google Custom Search | 2026-05-16 03:00:00 | 10.0 | 0.0 |
| gemini-3.1-pro-preview | 2026-05-16 03:00:00 | 10.0 | 0.0 |
| gemini-3.1-pro-preview + Google Custom Search | 2026-05-16 03:00:00 | 10.0 | 0.0 |
| gpt-5.4 | 2026-05-16 03:00:00 | 5.0 | 5.0 |
| claude-opus-4.6 | 2026-05-16 03:00:00 | 5.0 | 5.0 |
| llama-4-maverick | 2026-05-16 03:00:00 | 0.0 | 15.0 |
| claude-opus-4.6 + Google Custom Search | 2026-05-16 03:00:00 | 0.0 | 0.0 |
Generation Track
| Model | Submission Time (GMT) | EM | F1 |
|---|---|---|---|
| gpt-5.3-chat + Google Custom Search | 2026-05-16 03:00:00 | 15.0 | 39.8 |
| claude-opus-4.6 + Google Custom Search | 2026-05-16 03:00:00 | 15.0 | 33.9 |
| claude-sonnet-4.6 | 2026-05-16 03:00:00 | 15.0 | 30.8 |
| claude-opus-4.6 | 2026-05-16 03:00:00 | 10.0 | 27.7 |
| gpt-5.3-chat | 2026-05-16 03:00:00 | 10.0 | 25.5 |
| claude-sonnet-4.6 + Google Custom Search | 2026-05-16 03:00:00 | 10.0 | 25.2 |
| gemini-3.1-pro-preview + Google Custom Search | 2026-05-16 03:00:00 | 10.0 | 24.8 |
| llama-4-maverick | 2026-05-16 03:00:00 | 10.0 | 18.5 |
| gemini-3.1-pro-preview | 2026-05-16 03:00:00 | 10.0 | 16.0 |
| gpt-5.4 + Google Custom Search | 2026-05-16 03:00:00 | 5.0 | 31.0 |
| llama-4-scout | 2026-05-16 03:00:00 | 5.0 | 19.3 |
| llama-4-maverick + Google Custom Search | 2026-05-16 03:00:00 | 5.0 | 19.0 |
| gpt-5.4 | 2026-05-16 03:00:00 | 5.0 | 19.0 |
| gemini-2.5-pro | 2026-05-16 03:00:00 | 5.0 | 18.3 |
| llama-4-scout + Google Custom Search | 2026-05-16 03:00:00 | 0.0 | 20.6 |
| gemini-2.5-pro + Google Custom Search | 2026-05-16 03:00:00 | 0.0 | 7.8 |
Make a new submission
Download the latest set of RealTime QA (link)
Submit your model predictions. (submission form)
Submission examples (
.jsonl file) are available here