adjustable glasses illustration eye chart illustration clipboard illustration lenses illustration

Prompt: Missing pill in pack classification

38 out of 52 models answer this prompt correctly.
Prompt: Are there any missing pills? Return only yes or no.
Answer: Yes
Submitted by: Roboflow
Missing pill in pack classification

Contribute a Prompt

How models perform on this prompt

Arcee.ai Spotlight
ChatGPT-4o (Medium Reasoning)

Model answer: {"answer": "yes"}

View more tests run on this model

Claude 3.5 Haiku
Claude 3.7 Sonnet
Claude 4 Sonnet
Claude 4.6 Opus

Model answer: {"answer": "yes"}

View more tests run on this model

Cohere Aya Vision 32B
Cohere Aya Vision 8B
GLM 4.6v

Model answer: {"answer": "yes"}

View more tests run on this model

GPT-4.1

Model answer: {"answer": "yes"}

View more tests run on this model

GPT-4.1 Mini

Model answer: {"answer": "yes"}

View more tests run on this model

GPT-5 Mini

Model answer: {"answer": "yes"}

View more tests run on this model

GPT-5 Nano

Model answer: {"answer": "yes"}

View more tests run on this model

GPT-5.4

Model answer: {"answer":"yes"}

View more tests run on this model

Gemini 2.0 Flash

Model answer: {"answer": "yes"}

View more tests run on this model

Gemini 2.0 Flash Lite

Model answer: { "answer": "yes" }

View more tests run on this model

Gemini 2.5 Flash Lite

Model answer: {"answer": "yes"}

View more tests run on this model

Gemini 2.5 Pro

Model answer: {"answer": "yes"}

View more tests run on this model

Gemini 3 Flash

Model answer: {"answer": "yes"}

View more tests run on this model

Gemini 3 Flash (Tools)

Model answer: {"answer": "yes"}

View more tests run on this model

Gemini 3.1 Pro

Model answer: {"answer": "yes"}

View more tests run on this model

Gemini 3.1 Pro (Tools)

Model answer: {"answer": "yes"}

View more tests run on this model

Gemma 3 27b
Kimi k2.5

Model answer: {"answer": "yes"}

View more tests run on this model

Llama 4 Maverick 17B
Llama 4 Scout 17B
Mistral Medium 3
Molmo2 8B

Model answer: {"answer": "yes"}

View more tests run on this model

OpenAI O4 Mini (Medium Reasoning)

Model answer: {"answer": "yes"}

View more tests run on this model

Qwen 2.5 VL 7B
Qwen 3.5 0.8B
Qwen 3.5 122B (A10B)

Model answer: {"answer": "yes"}

View more tests run on this model

Qwen 3.5 27B

Model answer: { "answer": "yes" }

View more tests run on this model

Qwen 3.5 2B
Qwen 3.5 35B (A3B)

Model answer: {"answer": "yes"}

View more tests run on this model

Qwen 3.5 4B

Model answer: {"answer": "yes"}

View more tests run on this model

Qwen 3.5 9B

Model answer: {"answer": "yes"}

View more tests run on this model

Qwen 3.5 Plus

Model answer: {"answer": "yes"}

View more tests run on this model

Claude 4 Opus
Claude 4.1 Opus
Cosmos Reason2 2B

Model answer: {"answer": "no"}

View more tests run on this model

Cosmos Reason2 8B

Model answer: {"answer": "no"}

View more tests run on this model

GPT-4.1 Nano

Model answer: {"answer": "no"}

View more tests run on this model

Gemini 2.5 Flash

Model answer: {"answer": "no"}

View more tests run on this model

Gemma 3 4B
Grok 4

Model answer: {"answer": "no"}

View more tests run on this model

Grok 4.1 Fast

Model answer: {"answer": "no"}

View more tests run on this model

Mistral Small 3.1 24B

Model answer: It is impossible to determine from the image since we don't know how many pills should be on the pack.

View more tests run on this model

OpenAI O1

Model answer: {"answer": "no"}

View more tests run on this model

Phi 4 Multimodal
Reka Edge

Model answer: No missing pills are visible in the blister pack. **Answer:** yes

View more tests run on this model

SmolVLM2 2.2B