Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we find all executions for this version.
This blueprint evaluates a model's ability to generate simple, accessible, and semantically correct HTML for common web components. The prompts do not explicitly ask for accessibility features; the model is expected to produce high-quality, usable markup by default. Checks are based on fundamental principles of HTML semantics and WAI-ARIA practices.
Showing all recorded executions for Run Label 8e312db09f25d375.