Showing all evaluation blueprints that have been tagged with...
Showing all evaluation blueprints that have been tagged with "creative".
This blueprint evaluates a model's ability to generate SVG code to draw various shapes, from simple geometric figures to more complex objects. The prompts do not have assertions and are meant for qualitative review.
Avg. Hybrid Score
Latest:
Unique Versions: 1
This blueprint evaluates a model's ability to generate SVG code to draw various shapes, from simple geometric figures to more complex objects. The prompts do not have assertions and are meant for qualitative review.
Avg. Hybrid Score
Latest:
Unique Versions: 1