Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Added MIN_MASKED_ATTENTION_VALUE
#513 opened Jul 10, 2025 by quic-amitraj Loading…
Llama4 VLM Continuous Batching Support
#510 opened Jul 9, 2025 by mohiso22 Loading…
[Olmo2]: Add Support for Olmo2 CausalLM Model in QEff 1.21.0 enhancement New feature or request
#509 opened Jul 9, 2025 by vbaddi Loading…
Hybrid chunked cache update
#500 opened Jul 8, 2025 by quic-amitraj Draft
default NPI file added 1.20.0
#498 opened Jul 7, 2025 by quic-akuruvil Loading…
[Llama4]: Add support for padding num_patches 1.20.0 enhancement New feature or request
#486 opened Jul 1, 2025 by vbaddi Loading…
Unit Tests for On Device Sampling 1.20.0
#463 opened Jun 18, 2025 by quic-sanising Loading…
Updated get_available_device_id logic 1.21.0
#445 opened Jun 11, 2025 by quic-rishinr Loading…
Addition of MIN_MASKED_ATTN_VALUE
#433 opened Jun 6, 2025 by quic-amitraj Loading…
Added Prompt length check for VLMs
#422 opened May 21, 2025 by asmigosw Loading…
Dependency package upgrade 1.21.0
#407 opened May 15, 2025 by qcdipankar Loading…
Qwen3moe model-enablement
#406 opened May 15, 2025 by qcdipankar Loading…
ProTip! Updated in the last three days: updated:>2025-07-09.