Branching Conversation Demo — Tiny Threads

TAGS:

Select Prompt:

Average key point coverage extent for each model across all prompts.

Click on any result cell to open a detailed view.

Advanced view

Highlight best performers

Sort prompts by

Sort models by

Color Scale - Simplified View (Avg. Coverage)

Perfect

Excellent

Good

Fair

Poor

Bad

Not Met

	Prompts vs. Models	Claude 3.5 Haiku	Gemini 2.5 Flash	Mistral Large 2411	GPT 4.1 Mini	GPT 4o Mini
Score		2nd 82.0%	1st 91.0%	4th 74.0%	3rd 80.0%	5th 72.0%
79.8%		82%	91%	74%	80%	72%