Showing all evaluation blueprints that have been tagged with...
Showing all evaluation blueprints that have been tagged with "foia".
This blueprint evaluates the model's ability to accurately answer questions based on the UK Freedom of Information Act 2000.
Avg. Hybrid Score
Latest:
Unique Versions: 1