Showing all evaluation blueprints that have been tagged with...
Showing all evaluation blueprints that have been tagged with "usability".
This blueprint evaluates a model's ability to generate simple, accessible, and semantically correct HTML for common web components. The prompts do not explicitly ask for accessibility features; the model is expected to produce high-quality, usable markup by default. Checks are based on fundamental principles of HTML semantics and WAI-ARIA practices.
Avg. Hybrid Score
Latest:
Unique Versions: 1
This blueprint evaluates a model's ability to generate simple, accessible, and semantically correct HTML for common web components. The prompts do not explicitly ask for accessibility features; the model is expected to produce high-quality, usable markup by default. Checks are based on fundamental principles of HTML semantics and WAI-ARIA practices.
Avg. Hybrid Score
Latest:
Unique Versions: 1