Showing all evaluation blueprints that have been tagged with...
Showing all evaluation blueprints that have been tagged with "frontend".
This blueprint evaluates a model's ability to generate SVG code to draw various shapes, from simple geometric figures to more complex objects. The prompts do not have assertions and are meant for qualitative review.
Avg. Hybrid Score
Latest:
Unique Versions: 1
This blueprint evaluates a model's ability to generate simple, accessible, and semantically correct HTML for common web components. The prompts do not explicitly ask for accessibility features; the model is expected to produce high-quality, usable markup by default. Checks are based on fundamental principles of HTML semantics and WAI-ARIA practices.
Avg. Hybrid Score
Latest:
Unique Versions: 1
This blueprint evaluates a model's ability to generate SVG code to draw various shapes, from simple geometric figures to more complex objects. The prompts do not have assertions and are meant for qualitative review.
Avg. Hybrid Score
Latest:
Unique Versions: 1
This blueprint evaluates a model's ability to generate simple, accessible, and semantically correct HTML for common web components. The prompts do not explicitly ask for accessibility features; the model is expected to produce high-quality, usable markup by default. Checks are based on fundamental principles of HTML semantics and WAI-ARIA practices.
Avg. Hybrid Score
Latest:
Unique Versions: 1