Change the repository type filter
All
Repositories list
24 repositories
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
viv-task-dev
Publicinspect_ai
Publictask-standard
Publicai-rd-tasks
Publicagent-fork-of-eval-analysis-public
Public archivellm-foundry
Publicmetr-task-boilerplate
Publictask-protected-scoring
Publictask-artifacts
PublicSWE-bench-fork
Publicinspect_k8s_sandbox
Publicpublic-tasks
Publicworktest-sw-eng-deps
Publictask-assets
Public.github
PublicnanoGPT
Publicautonomy-evals-guide
Publictask-legacy-verifier
Publictask-aux-vm-helpers
Publicpyhooks
Public archivevivaria-mentat
Public archivetask-template
Public template