Loading blueprint versions...
Please wait while we gather all the unique runs for this blueprint.
Please wait while we gather all the unique runs for this blueprint.
Please wait while we find all executions for this version.
Evaluates a model's ability to review verifiable scientific literature, explore novel theories, and integrate philosophy of science, metascience, interdisciplinary perspectives, and complex systems/second-order cybernetics principles.
Showing all recorded executions for Run Label sandbox-run.