-
Notifications
You must be signed in to change notification settings - Fork 113
Pull requests: intel/auto-round
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add support for MiMo-V2-Flash
#1718
opened Apr 22, 2026 by
n1ck-guo
Contributor
Loading…
1 of 9 tasks
Reduce xpu memory usage with patch_xpu_sdpa_drop_causal_mask
#1716
opened Apr 21, 2026 by
xin3he
Contributor
Loading…
2 of 9 tasks
feat: support Nemotron-H / Nemotron-Cascade-2 (#1711)
#1712
opened Apr 20, 2026 by
michael-rabe
•
Draft
4 of 9 tasks
Add MLX format export support for Apple Silicon
#1706
opened Apr 19, 2026 by
wenhuach
Loading…
9 tasks
Continuously optimize AutoScheme RAM consumption
#1703
opened Apr 17, 2026 by
lvliang-intel
Contributor
Loading…
2 of 9 tasks
support model_free WOQ quantization
#1699
opened Apr 17, 2026 by
xin3he
Contributor
Loading…
4 of 9 tasks
Fix Qwen Omni quantization model issue for long form audio generation
#1698
opened Apr 17, 2026 by
lvliang-intel
Contributor
Loading…
2 of 9 tasks
Fix
module.to("meta") for models with plain Tensors
#1688
opened Apr 15, 2026 by
yiliu30
Contributor
Loading…
1 of 9 tasks
Security: HTTP requests are performed without timeout safeguards
#1683
opened Apr 15, 2026 by
tomaioo
Loading…
Feats: Quantize/save/evaluate the Wan-AI/WAN2.2 models in w4a16 format
#1678
opened Apr 14, 2026 by
lvliang-intel
Contributor
Loading…
2 of 9 tasks
Refactor: use get_submodule with manual traversal fallback in get_module
#1677
opened Apr 13, 2026 by
yael-shr
Loading…
5 tasks done
fix gguf issue in alg_ext.py
#1649
opened Apr 2, 2026 by
wenhuach21
Contributor
Loading…
2 of 9 tasks
[Draft] Support TurboQuant KV-cache quantization
#1634
opened Mar 27, 2026 by
lvliang-intel
Contributor
•
Draft
2 of 9 tasks
Support ByteDance-Seed/BAGEL-7B-MoT quantization in w4a16 format
#1633
opened Mar 27, 2026 by
lvliang-intel
Contributor
Loading…
2 of 9 tasks
[Step1 ]new architecture for auto_round
api/new
engineering
ready
only add when the PR is ready to merge
[WIP][refactor quanizers][step 1] refactor rtn and tuning
#1278
opened Jan 14, 2026 by
n1ck-guo
Contributor
Loading…
add per-task lm_eval args for exprimental usage
Stale
#1017
opened Nov 11, 2025 by
WeiweiZhang1
Contributor
Loading…
ProTip!
Follow long discussions with comments:>50.