Change the repository type filter
All
Repositories list
22 repositories
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
eval-analysis-public
Publicllm-foundry
Publicmetr-task-boilerplate
Publictask-protected-scoring
Publictask-artifacts
PublicSWE-bench-fork
Publicinspect_k8s_sandbox
Publicviv-task-dev
Publicpublic-tasks
Publictask-standard
Publicworktest-sw-eng-deps
Publictask-assets
Public.github
PublicnanoGPT
Publicautonomy-evals-guide
Publictask-legacy-verifier
Publictask-aux-vm-helpers
Publicpyhooks
Public archivevivaria-mentat
Public archivetask-template
Public template