Wevala Collective Intelligence Project+ Create

Loading blueprint versions...

Please wait while we gather all the unique runs for this blueprint.

Analysis: Test Threads - Run fa0b200...

Branching Conversation Demo — Tiny Threads

A minimal example to show branching on a shared conversation history.

Common history is repeated in each fork.
Only the final user turn differs to create a branch.
Utterances are tiny (single words/short phrases) for clarity.

TAGS:

Instruction Following & Prompt Adherence

Clarity & Readability

Coherence & Conversational Flow

Helpfulness & Actionability

Best Models (Coverage)

1.GPT 4o Mini
98.0%
2.GPT 4.1 Mini
85.5%
3.Gemini 2.5 Flash
79.3%
4.Mistral Large 2411
75.0%
5.Claude 3.5 Haiku
62.5%

Select Prompt:

Macro Coverage Overview

Average key point coverage extent for each model across all prompts.

Pro Tip

Click on any result cell to open a detailed view.

Advanced view

Highlight best performers

Sort prompts by

Sort models by

Color Scale - Simplified View (Avg. Coverage)

Perfect

Excellent

Good

Fair

Poor

Bad

Not Met

	Prompts vs. Models	Claude 3.5 Haiku	Gemini 2.5 Flash	Mistral Large 2411	GPT 4.1 Mini	GPT 4o Mini
Score		5th 62.5%	3rd 79.3%	4th 75.0%	2nd 85.5%	1st 98.0%
63.4%		50%	50%	67%	50%	100%
67.6%		54%	67%	33%	92%	92%
99.2%		96%	100%	100%	100%	100%
90.0%		50%	100%	100%	100%	100%