Skip to content

Non-Stanford (following along): Is the Qwen 2.5 Math 1.5B different to HF? #3

@jashvira

Description

@jashvira

Hi,

I’m following along with the course materials as a non-Stanford participant. I noticed that when I run the Qwen 2.5 Math 1.5B model locally via Hugging Face, I get slightly different greedy decoding outputs compared to the tests.
Could you confirm:

Are the model weights used in the course identical to the Hugging Face Qwen/Qwen2.5-Math-1.5B release?

Or is there a Stanford-hosted version / checkpoint with any fine-tuning or other changes?

Thanks in advance!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions