Results on August 02, 2024
  Multiple Choice Track
| Model | Submission Time (GMT) | Original | NOTA | 
|---|
| GPT-4o + Google Custom Search | 2024-08-03 03:00:00 | 60.0 | 40.0 | 
| Claude-3.5 Haiku + Google Custom Search | 2024-08-03 03:00:00 | 50.0 | 50.0 | 
| GPT-3.5 Turbo + Google Custom Search | 2024-08-03 03:00:00 | 50.0 | 40.0 | 
| Claude-3.5 Sonnet | 2024-08-03 03:00:00 | 50.0 | 33.3 | 
| Claude-3.5 Haiku | 2024-08-03 03:00:00 | 46.7 | 40.0 | 
| Claude-3.5 Sonnet + Google Custom Search | 2024-08-03 03:00:00 | 43.3 | 33.3 | 
| GPT-4o | 2024-08-03 03:00:00 | 40.0 | 40.0 | 
| Gemini 1.5 Flash + Google Custom Search | 2024-08-03 03:00:00 | 40.0 | 36.7 | 
| Gemini 1.5 Flash | 2024-08-03 03:00:00 | 36.7 | 40.0 | 
| Llama3.1-405B-Instruct + Google Custom Search | 2024-08-03 03:00:00 | 33.3 | 33.3 | 
| GPT-3.5 Turbo | 2024-08-03 03:00:00 | 30.0 | 26.7 | 
| Llama3.1-405B-Instruct | 2024-08-03 03:00:00 | 13.3 | 16.7 | 
  Generation Track
| Model | Submission Time (GMT) | EM | F1 | 
|---|
| GPT-3.5 Turbo + Google Custom Search | 2024-08-03 03:00:00 | 23.3 | 31.0 | 
| GPT-4o + Google Custom Search | 2024-08-03 03:00:00 | 20.0 | 29.0 | 
| GPT-4o | 2024-08-03 03:00:00 | 20.0 | 23.6 | 
| Llama3.1-405B-Instruct + Google Custom Search | 2024-08-03 03:00:00 | 16.7 | 23.8 | 
| Gemini 1.5 Flash + Google Custom Search | 2024-08-03 03:00:00 | 16.7 | 22.6 | 
| GPT-3.5 Turbo | 2024-08-03 03:00:00 | 13.3 | 24.3 | 
| Llama3.1-405B-Instruct | 2024-08-03 03:00:00 | 13.3 | 18.7 | 
| Gemini 1.5 Flash | 2024-08-03 03:00:00 | 6.7 | 11.3 | 
| Claude-3.5 Haiku | 2024-08-03 03:00:00 | 3.3 | 6.2 | 
| Claude-3.5 Haiku + Google Custom Search | 2024-08-03 03:00:00 | 0.0 | 7.5 | 
| Claude-3.5 Sonnet + Google Custom Search | 2024-08-03 03:00:00 | 0.0 | 7.5 | 
| Claude-3.5 Sonnet | 2024-08-03 03:00:00 | 0.0 | 7.0 |