How models perform on this prompt
Arcee.ai Spotlight
Model answer: top center
ChatGPT-4o (High Reasoning)
Model answer: Top center
ChatGPT-4o (Medium Reasoning)
Model answer: Top center
Claude 3.7 Sonnet
Model answer: Based on the image, I can see what appears to be a small white mark or defect near the top edge of the gray panel/device. The defect is located at the top center of the product.
Cohere Aya Vision 32B
Model answer: top center
GPT-4.1
Model answer: Top center
GPT-4.1 Mini
Model answer: top center
GPT-5 (high reasoning)
Model answer: top center
GPT-5 Mini
Model answer: top center
GPT-5 with High Reasoning
Model answer: top center
Gemini 1.5 Pro
Model answer: top center
Gemini 2.0 Flash
Model answer: top center
Gemini 2.0 Flash Lite
Model answer: top center
Gemini 2.5 Flash
Model answer: top center
Gemini 2.5 Flash Lite
Model answer: top center
Gemini 2.5 Pro
Model answer: top center
Gemma 3 27b
Model answer: top center
Llama 4 Scout 17B
Model answer: top center
Mistral Medium 3
Model answer: top center
OpenAI O1
Model answer: top center
OpenAI O3 (High Reasoning)
Model answer: top center
OpenAI O3 (Medium Reasoning)
Model answer: top center
OpenAI O4 Mini (High Reasoning)
Model answer: top center
OpenAI O4 Mini (Medium Reasoning)
Model answer: top center
OpenAI o3-pro
Model answer: top center
Qwen 2.5 VL 7B
Model answer: top center
Claude 3.5 Haiku
Model answer: bottom right
Claude 4 Opus
Model answer: top
Claude 4 Sonnet
Model answer: center
Claude 4.1 Opus
Model answer: top
Cohere Aya Vision 8B
Model answer: The defect in the product is located at the top left corner.
GPT-4.1 Nano
Model answer: center
GPT-5
Model answer: top
GPT-5 Nano
Model answer: top
Gemini 1.5 Flash
Model answer: Top left
Gemma 3 4B
Model answer: top left
Llama 4 Maverick 17B
Model answer: center top
Mistral Small 3.1 24B
Model answer: top left
Phi 4 Multimodal
Model answer: top