Skip to content

Milestones

List view

  • 0.1.0 is the first usable openinfer release: a Rust + CUDA serving engine with an OpenAI-compatible API, Qwen-family coverage, documented benchmarks, and clear project packaging. Scope is stabilization and presentation, not broad model support.

    No due date
    0/8 issues closed