GitHub - epoch-research/epochai-python: Epoch AI client library for Python

This repository contains the Python client library of Epoch AI. At the moment, only one feature is supported: reading from our database of ML models and benchmark results.

Installation

pip install epochai

Usage

Reading from our Airtable database of ML models and benchmark results

A few preparatory steps are required:

Open our Airtable base
Airtable doesn't allow public API access, so you'll have to make a copy of the base (unless you are an Epoch AI team member).
Define the AIRTABLE_BASE_ID environment variable with the ID of the base you just copied. (The ID is in the URL and starts with app.)
Create an Airtable API key with access to the base, and the following scopes: data.records:read, schema.bases:read. Define the AIRTABLE_PERSONAL_ACCESS_TOKEN environment variable with the key.

You're now ready to use the library. The database models are defined in epochai.airtable.models.

You can get started with our example script examples/airtable.py, or try the snippets below.

from epochai.airtable.models import MLModel, Task, Score, Organization, BenchmarkRun

# Get everything at the start to minimize API calls
scores = Score.all(memoize=True)
runs = BenchmarkRun.all(memoize=True)
models = MLModel.all(memoize=True)
tasks = Task.all(memoize=True)
organizations = Organization.all(memoize=True)

Print information about a model:

print_model_info("claude-3-5-sonnet-20240620")

Print the highest scores for a benchmark and scorer:

print_high_scores(
    task_path="bench.task.hendrycks_math.hendrycks_math_lvl_5",
    scorer="model_graded_equiv",
    scores=scores
)

Track the best-performing model to date over time:

print_performance_timeline(
    task_path="bench.task.gpqa.gpqa_diamond",
    scorer="choice",
    scores=scores
)

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
assets		assets
epochai		epochai
examples		examples
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.env.example		.env.example
.pre-commit-config.yaml		.pre-commit-config.yaml
.tool-versions		.tool-versions
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Installation

Usage

Reading from our Airtable database of ML models and benchmark results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

epoch-research/epochai-python

Folders and files

Latest commit

History

Repository files navigation

Installation

Usage

Reading from our Airtable database of ML models and benchmark results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages