-
Notifications
You must be signed in to change notification settings - Fork 59
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Prefill+decode gpt oss
1.21.0
enhancement
New feature or request
#608
opened Nov 5, 2025 by
ochougul
Loading…
Extend on-device sampling support for dual QPC VLMs
#597
opened Oct 24, 2025 by
quic-xiyushi
Loading…
Modified qwen_2.5 modelling file to allow replicate_kv_script to work for custom num_kv_heads.
#595
opened Oct 18, 2025 by
quic-dhirajku
Loading…
[Upgradation]: onnx opset version updated from 13 to 17
#587
opened Oct 14, 2025 by
abukhoy
Loading…
Example walk through on how to onboard a Causal LM on Qefficient Transformers.
#574
opened Sep 24, 2025 by
quic-dhirajku
Loading…
Logger Module For Efficient Transformers
1.21.0
wip
Work in progress
#555
opened Sep 10, 2025 by
quic-hemagnih
•
Draft
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-07.