Pinned Loading
-
multimodal-interpretability/maia
multimodal-interpretability/maia PublicOfficial implementation of MAIA, A Multimodal Automated Interpretability Agent
-
multimodal-interpretability/FIND
multimodal-interpretability/FIND PublicOfficial implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents
-
Nix07/belief_tracking
Nix07/belief_tracking PublicThis repository contains the code used for the experiments in the paper "Language Models use Lookbacks to Track Beliefs".
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.