Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      Other
      65522Updated Jan 30, 2025Jan 30, 2025
    • vivaria

      Public
      Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
      TypeScript
      MIT License
      257722715Updated Jan 30, 2025Jan 30, 2025
    • Public repository containing METR's DVC pipeline for eval data analysis
      Python
      2030Updated Jan 28, 2025Jan 28, 2025
    • LLM training code for Databricks foundation models
      Python
      Apache License 2.0
      541000Updated Jan 28, 2025Jan 28, 2025
    • A Cookiecutter template for developing tasks according to the METR Task Standard
      TypeScript
      0100Updated Jan 22, 2025Jan 22, 2025
    • Python
      1032Updated Jan 21, 2025Jan 21, 2025
    • Python
      0000Updated Jan 21, 2025Jan 21, 2025
    • [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
      Python
      MIT License
      401001Updated Jan 15, 2025Jan 15, 2025
    • A Kubernetes sandbox environment for use with inspect_ai
      Python
      MIT License
      3000Updated Jan 14, 2025Jan 14, 2025
    • Shell
      1070Updated Jan 10, 2025Jan 10, 2025
    • TeX
      Other
      68202Updated Jan 9, 2025Jan 9, 2025
    • METR Task Standard
      TypeScript
      MIT License
      3213763Updated Jan 3, 2025Jan 3, 2025
    • Dockerfile
      0000Updated Dec 29, 2024Dec 29, 2024
    • Python
      0010Updated Dec 28, 2024Dec 28, 2024
    • .github

      Public
      0000Updated Nov 24, 2024Nov 24, 2024
    • nanoGPT

      Public
      The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      6.3k000Updated Nov 22, 2024Nov 22, 2024
    • SCSS
      MIT License
      4301Updated Nov 21, 2024Nov 21, 2024
    • Python
      0000Updated Nov 8, 2024Nov 8, 2024
    • Python
      1000Updated Nov 2, 2024Nov 2, 2024
    • pyhooks

      Public archive
      A library that METR agents use to communicate with Vivaria.
      Python
      1010Updated Sep 22, 2024Sep 22, 2024
    • vivaria-mentat

      Public archive
      Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
      TypeScript
      MIT License
      25011Updated Sep 19, 2024Sep 19, 2024
    • task-template

      Public template
      TypeScript
      6923Updated Aug 6, 2024Aug 6, 2024