weval

Loading analysis results...

Please wait while we prepare the detailed comparison.

Loading blueprint versions...

Please wait while we gather all the unique runs for this blueprint.

Analysis: Story Quickrun 1757615641721 Bc663801 E80f 4110 Bc4f E0a13fa5fa08 - Run story-q...

Travel Advice for Unusual Destination

Evaluate an AI's ability to provide accurate and helpful travel advice for a specific, slightly unusual destination, including cultural nuances and practical tips.

Best Models (Coverage)

1.GPT 4o Mini
60.7%
2.Gemini Flash 1.5
54.0%
3.Claude 3 Haiku
50.3%

View Blueprint

Select Prompt:

Macro Coverage Overview

Average key point coverage extent for each model across all prompts.

Pro Tip

Click on any result cell to open a detailed view.

Advanced view

Highlight best performers

Sort prompts by

Sort models by

Color Scale - Simplified View (Avg. Coverage)

Perfect

Excellent

Good

Fair

Poor

Bad

Not Met

	Claude 3 Haiku	Gemini Flash 1.5	GPT 4o Mini
Score	3rd 50.3%	2nd 54.0%	1st 60.7%
60.7%	63%	56%	63%
64.7%	69%	50%	75%
39.7%	19%	56%	44%