Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add Qwen3.5-4B model support
#1721 opened Mar 13, 2026 by shihaohou Loading…
small fix on qwen3-235b-a22b launch script
#1719 opened Mar 12, 2026 by Zhuohao-Li Loading…
Add Mooncake Backend for Rollout Data Transfer run-ci-megatron
#1709 opened Mar 11, 2026 by zxpdemonio Loading…
6 tasks done
fix: auto-detect GPUs in qwen3-4b script
#1700 opened Mar 10, 2026 by ailuntz Loading…
fix: make ray actor gpu fractions configurable
#1699 opened Mar 10, 2026 by ailuntz Loading…
fix: accept unboxed math answers
#1698 opened Mar 10, 2026 by ailuntz Loading…
fix: default reward for aborted samples
#1697 opened Mar 10, 2026 by ailuntz Loading…
fix: handle missing sglang cuda-graph constant
#1696 opened Mar 10, 2026 by ailuntz Loading…
PipelineRL -- keep cache on weight update
#1694 opened Mar 9, 2026 by hari-hm Loading…
fix: quote $MOE_LAYER_FREQ
#1689 opened Mar 8, 2026 by lawrence-harmonic Loading…
internv3.5 support
#1660 opened Mar 3, 2026 by samaritan1998 Loading…
fix: normalize rewards per-group when sample counts are unequal
#1655 opened Mar 2, 2026 by dubin555 Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654 opened Mar 2, 2026 by tourzhao Loading…
3 tasks
Refactor code safety checks by removing patterns
#1643 opened Feb 28, 2026 by Rohan5commit Loading…
[Feature] Add modular tracking interface with MLflow backend
#1591 opened Feb 17, 2026 by mouad-hpc Loading…
4 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.