How models perform on this prompt
Arcee.ai Spotlight
Model answer: left
ChatGPT-4o
Model answer: Left
Claude 3.5 Haiku
Model answer: Left
Claude 3.7 Sonnet
Model answer: Left. In the image, you can see a large Caterpillar (CAT) 994 wheel loader in the foreground, and there appears to be a dump truck partially visible on the left side of the image in the background.
Claude 4 Opus
Model answer: Left
Claude 4 Sonnet
Model answer: Left
Cohere Aya Vision 32B
Model answer: Left
GPT-4.1
Model answer: Left
GPT-4.1 Mini
Model answer: left
GPT-4.1 Nano
Model answer: Left
Gemini 1.5 Flash
Model answer: Left
Gemini 1.5 Pro
Model answer: Left
Gemini 2.0 Flash
Model answer: Left
Gemini 2.0 Flash Lite
Model answer: Left
Gemini 2.5 Flash Preview
Model answer: left
Gemini 2.5 Pro Preview
Model answer: left
Gemma 3 27b
Model answer: left
Llama 3 11B Vision
Model answer: left
Llama 4 Maverick 17B
Model answer: left
Llama 4 Scout 17B
Model answer: left
Mistral Medium 3
Model answer: left
Mistral Small 3.1 24B
Model answer: Left
Mistral Small 3.1 24b
Model answer: Left
OpenAI O1
Model answer: left
OpenAI O3
Model answer: left
OpenAI O4 Mini
Model answer: left
OpenAI o3-pro
Model answer: left
Phi 4 Multimodal
Model answer: Left
Qwen 2.5 VL 7B
Model answer: left
Cohere Aya Vision 8B
Model answer: Right.
Gemma 3 4B
Model answer: Right