Athena

Evals and environment packages (Prime / verifiers–style RLMs).

Environments

Directory	Description
`lhaw/`	LHAW RLM — underspecified prompts, `ask_user` clarification, reconstruction judge / native reward (`lhaw_rlm`)
`loca_bench_rlm/`	LOCA Bench RLM
`long_context_retrieval/`	Long-context retrieval
`discover_gsm8k/`	Discover GSM8K
`autoresearch/`	Autoresearch
`advanced_if/`	Advanced IF — `facebook/AdvancedIF` rubric induction via `RLMEnv` (file-backed trajectories + partial judge tool, `advanced_if`)

Each subdirectory is a self-contained package with its own pyproject.toml, uv.lock, and docs. Work inside that folder for install, eval, and packaging.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Athena

Environments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
advanced_if		advanced_if
discover_gsm8k		discover_gsm8k
lhaw		lhaw
loca_bench_rlm		loca_bench_rlm
long_context_retrieval		long_context_retrieval
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Athena

Environments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages