Skip to content

New record submissions for review (#1583, #1584, #1585) #1587

@codemath3000

Description

@codemath3000

@valerio-oai @0hq @cocohearts @openai/parameter-golf-team Hi! I've submitted three related record PRs applying systems-level performance optimizations (fused Muon kernel, batched EMA, loader prealloc) to different base stacks, each improving val_bpb over the respective baseline:

They're submitted against multiple bases so a ready-to-merge option exists regardless of how the pending PRs are resolved.

This is my first submission to the challenge, so I apologize in advance if I've gotten anything wrong — I've done my best to follow the submission guidelines. Happy to address any issues. Thank you for taking the time to review these, I know you're busy!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions