FAR.AI
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Popular repositories Loading
-
tuned-lens
tuned-lens PublicTools for understanding how transformer predictions are built layer-by-layer
-
-
learned-planner
learned-planner PublicInterpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban
-
Repositories
Showing 10 of 49 repositories
- harmtune Public
AlignmentResearch/harmtune’s past year of commit activity - AttemptPersuadeEval Public
AlignmentResearch/AttemptPersuadeEval’s past year of commit activity - aim Public Forked from aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
AlignmentResearch/aim’s past year of commit activity - learned-planner Public
Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban
AlignmentResearch/learned-planner’s past year of commit activity - defense-in-depth-demo Public
AlignmentResearch/defense-in-depth-demo’s past year of commit activity - trl Public Forked from huggingface/trl
Train transformer language models with reinforcement learning.
AlignmentResearch/trl’s past year of commit activity - deception-evasion-honesty Public
AlignmentResearch/deception-evasion-honesty’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…