quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 45
Star 72

Code
Issues
Pull requests 31
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: quic/efficient-transformers

Labels 22 Milestones 0

New pull request New

31 Open 477 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Logger module in Efficient Transformers 1.21.0 wip

Work in progress

#517 opened Jul 11, 2025 by quic-hemagnih • Draft

[QEff Finetune] : fix task_type variable in configs

#514 opened Jul 10, 2025 by quic-mamta • Draft

Added MIN_MASKED_ATTENTION_VALUE

#513 opened Jul 10, 2025 by quic-amitraj

Loading…

Added --iteration and --automation flags

#512 opened Jul 10, 2025 by asmigosw • Draft

Llama4 VLM Continuous Batching Support

#510 opened Jul 9, 2025 by mohiso22

Loading…

[Olmo2]: Add Support for Olmo2 CausalLM Model in QEff 1.21.0 enhancement

New feature or request

#509 opened Jul 9, 2025 by vbaddi

Loading…

[QEff Finetune]: Fix for padding.

#503 opened Jul 8, 2025 by quic-swatia • Draft

Jina model support [experimental]

#502 opened Jul 8, 2025 by quic-amitraj • Draft

[Docs]: Add Release Documentation for Version 1.20.0

#501 opened Jul 8, 2025 by abukhoy

Loading…

Hybrid chunked cache update

#500 opened Jul 8, 2025 by quic-amitraj • Draft

Reading mxfp6_matmul for QNN Compilation path from compile API arguments 1.21.0

#499 opened Jul 7, 2025 by shubhagr-qc

Loading…

default NPI file added 1.20.0

#498 opened Jul 7, 2025 by quic-akuruvil

Loading…

[QEff. Finetune] Fixed reporting of single value of loss and ppl across devices.

#496 opened Jul 7, 2025 by quic-meetkuma • Draft

Added env var- DYNAMIC_CACHE to switch from HCC to DC

#489 opened Jul 3, 2025 by asmigosw • Draft

[Llama4]: Add support for padding num_patches 1.20.0 enhancement

New feature or request

#486 opened Jul 1, 2025 by vbaddi

Loading…

Changing the hashing methodology for cache folder creation of models. 1.21.0

#481 opened Jun 24, 2025 by quic-dhirajku • Draft

adding Context Length Specialization (CCL) 1.21.0

#466 opened Jun 19, 2025 by quic-vjanfaza

Loading…

Unit Tests for On Device Sampling 1.20.0

#463 opened Jun 18, 2025 by quic-sanising

Loading…

Updated get_available_device_id logic 1.21.0

#445 opened Jun 11, 2025 by quic-rishinr

Loading…

[QEff Finetune] : Made fixes to training script fine-tuning

#439 opened Jun 10, 2025 by quic-mamta • Draft

Addition of MIN_MASKED_ATTN_VALUE

#433 opened Jun 6, 2025 by quic-amitraj

Loading…

[Tests]: Adding dummy causal models for testing in regular CI run 1.21.0 ready for review

#427 opened May 29, 2025 by abukhoy

Loading…

Added Prompt length check for VLMs

#422 opened May 21, 2025 by asmigosw

Loading…

Dependency package upgrade 1.21.0

#407 opened May 15, 2025 by qcdipankar

Loading…

Qwen3moe model-enablement

#406 opened May 15, 2025 by qcdipankar

Loading…

Previous 1 2 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-07-09.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!