-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enhance NPU LoRA path with post-norm activation handling
#7929
opened Jan 29, 2026 by
vx120
Loading…
1 of 4 tasks
[bug fix]implement data collator method in GRPOTrainer
#7925
opened Jan 28, 2026 by
burm95
Loading…
4 tasks
[feat] Support ProFit: Extend DFT with Probability Threshold-based Token Filtering
#7921
opened Jan 28, 2026 by
maybefunctionname
Loading…
1 of 4 tasks
[bugfix] fix multimodal peft_model megatron & refactor lora (init/sp)
#7911
opened Jan 27, 2026 by
Jintao-Huang
Loading…
feat: add greedy packing, MiniCPM packing support, and dataset progress tracking
#7904
opened Jan 26, 2026 by
Lollipop
Loading…
Fix Qwen3-VL compatibility: Visual attribute nesting and Processor length issues
#7880
opened Jan 23, 2026 by
Beatlesso
Loading…
fix(megatron): disable checkpointing when calculate KL
#7828
opened Jan 20, 2026 by
zzc0430
Loading…
1 of 4 tasks
[template] Support HunyuanMT1.5-1.8B and HunyuanMT1.5-7B templates
#7351
opened Jan 10, 2026 by
rinne1998
Loading…
feat(cli): add setproctitle support to customize process name
#7278
opened Jan 4, 2026 by
ciaoyizhen
Loading…
1 task done
[feat] support activation cpu offload in fsdp and fsdp2
#7201
opened Dec 24, 2025 by
meichangsu1
Loading…
1 of 4 tasks
support cce、tiledmlp、activation cpu offload
#7169
opened Dec 23, 2025 by
meichangsu1
Loading…
1 of 4 tasks
Improve vLLM examples regarding vllm_engine_kwargs use
#7133
opened Dec 19, 2025 by
3manifold
Loading…
1 task done
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.