Skip to content

Conversation

@dcw02
Copy link

@dcw02 dcw02 commented Oct 10, 2025

Motivation

** Draft PR. This is currently WIP **

Add eagle3 support for qwen3_vl and qwen3_vl_moe models.

Modifications

Related Issues

Accuracy Test

Benchmark & Profiling

Checklist

@dcw02 dcw02 closed this Oct 13, 2025
@dcw02 dcw02 reopened this Oct 13, 2025
@dcw02
Copy link
Author

dcw02 commented Oct 13, 2025

Training finishes for tp size 1 and both sdpa/flex attention backends. Loss/acc curves look ok, going to add qwen3_vl_moe eagle3 support into sglang/vllm (so I can eval) before adding tp size > 1 support.

@dcw02 dcw02 changed the title [Feature] Qwen3-VL-30B-A3B-Instruct eagle3 support [Feature] Qwen3 VL eagle3 support Oct 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant