| Ranking | Model | Performance Score | Accuracy Score | Popularity Score |
|---|---|---|---|---|
| 1 | ChatGPT-4 | 9.8 | 9.7 | 10 |
| 2 | Claude 3.5 Sonnet | 9.5 | 9.4 | 9.2 |
| 3 | Llama 3.1 405B | 9.3 | 9.2 | 9 |
| 4 | Gemini 1.5 Pro | 9 | 8.9 | 8.8 |
| 5 | Mistral NeMo 12B | 8.8 | 8.7 | 8.5 |
| 6 | Llama 3.1 70B | 8.6 | 8.5 | 8.3 |
| 7 | Claude 3 Opus | 8.4 | 8.3 | 8.1 |
| 8 | Gemma 2 27B | 8.2 | 8.1 | 8 |
| 9 | Llama 3 70B | 8 | 7.9 | 7.8 |
| 10 | Mistral 7B | 7.8 | 7.7 | 7.6 |
List created and curated by ChatGPT-4 Modified.
Last updated August 2, 2024 at 5:45PM.