adjustable glasses illustration eye chart illustration clipboard illustration lenses illustration

Prompt: Relation identification: left or right

39 out of 41 models answer this prompt correctly.
Prompt: Is the dump truck to the left or right of the wheel loader? Answer only left or right.
Answer: left
Submitted by: Roboflow
Relation identification: left or right

Contribute a Prompt

How models perform on this prompt

Arcee.ai Spotlight
ChatGPT-4o (High Reasoning)

Model answer: {"answer": "left"}

View more tests run on this model

ChatGPT-4o (Medium Reasoning)

Model answer: {"answer": "left"}

View more tests run on this model

Claude 3.5 Haiku
Claude 3.7 Sonnet

Model answer: Left. In the image, you can see a CAT 994 wheel loader in the foreground, and there appears to be a dump truck partially visible on the left side of the frame in the background.

View more tests run on this model

Claude 4 Opus
Claude 4 Sonnet
Claude 4.1 Opus
Claude 4.6 Opus

Model answer: {"answer": "left"}

View more tests run on this model

Cohere Aya Vision 32B
GPT-4.1

Model answer: {"answer": "left"}

View more tests run on this model

GPT-4.1 Mini

Model answer: {"answer": "left"}

View more tests run on this model

GPT-4.1 Nano

Model answer: {"answer": "left"}

View more tests run on this model

GPT-5

Model answer: {"answer":"left"}

View more tests run on this model

GPT-5 (high reasoning)

Model answer: {"answer": "left"}

View more tests run on this model

GPT-5 Mini

Model answer: {"answer": "left"}

View more tests run on this model

GPT-5 Nano

Model answer: {"answer": "left"}

View more tests run on this model

GPT-5.1

Model answer: {"answer": "left"}

View more tests run on this model

GPT-5.2

Model answer: {"answer":"left"}

View more tests run on this model

Gemini 2.0 Flash

Model answer: {"answer": "left"}

View more tests run on this model

Gemini 2.0 Flash Lite

Model answer: { "answer": "left" }

View more tests run on this model

Gemini 2.5 Flash

Model answer: {"answer": "left"}

View more tests run on this model

Gemini 2.5 Flash Lite

Model answer: {"answer": "left"}

View more tests run on this model

Gemini 2.5 Pro

Model answer: {"answer": "left"}

View more tests run on this model

Gemini 3 Flash

Model answer: {"answer": "left"}

View more tests run on this model

Gemini 3 Flash (Tools)

Model answer: {"answer": "left"}

View more tests run on this model

Gemini 3 Pro Preview

Model answer: {"answer": "left"}

View more tests run on this model

Gemma 3 27b
Llama 4 Maverick 17B
Llama 4 Scout 17B

Model answer: left.

View more tests run on this model

Mistral Medium 3
Mistral Small 3.1 24B

Model answer: Left.

View more tests run on this model

OpenAI O1

Model answer: {"answer": "left"}

View more tests run on this model

OpenAI O3 (High Reasoning)

Model answer: {"answer": "left"}

View more tests run on this model

OpenAI O3 (Medium Reasoning)

Model answer: {"answer": "left"}

View more tests run on this model

OpenAI O4 Mini (High Reasoning)

Model answer: {"answer": "left"}

View more tests run on this model

OpenAI O4 Mini (Medium Reasoning)

Model answer: {"answer": "left"}

View more tests run on this model

Phi 4 Multimodal
Qwen 2.5 VL 7B
Cohere Aya Vision 8B

Model answer: right

View more tests run on this model

Gemma 3 4B

Model answer: Right

View more tests run on this model