MoE Commnucator design doc by Binyang2014 · Pull Request #818 · microsoft/mscclpp

Binyang2014 · 2026-06-16T00:40:34Z

Add API doc for MoE communication

…high-level API Unify the high-level MoECommunicator to select its backend from MoECommunicatorConfig.mode: - mode="ll": low-latency (EXPERT_MAJOR) via MoERuntime (reused from binyli/ep, PR #818). The LL runtime is built lazily, so a build that only binds the HT Buffer can still use mode="ht" without MoERuntime being present. - mode="ht": high-throughput (FLAT) via the DeepEP-style Buffer; intranode vs internode is auto-selected from the RDMA buffer-size hint. dispatch() gains an optional previous_handle that reuses the routing layout from a prior dispatch with identical topk_ids (cached intranode dispatch also skips notify_dispatch's host-side counter wait), letting a benchmark isolate the on-GPU dispatch-kernel cost (NCCL-EP ep_bench convention). Rewrite the intranode/internode HT benchmark loops to drive the public MoECommunicator(mode="ht") API instead of raw Buffer calls. Export MoERuntime. Validated on 1 node x 4 GB200 GPUs: correctness PASS; dispatch/combine match the raw-Buffer baseline under identical env (no high-level overhead).

MoE Commnucator design doc

29847e6

Binyang2014 requested review from a team, caiomcbr, chhwang, mahdiehghazim and seagater June 16, 2026 00:41

Binyang2014 added 3 commits June 16, 2026 01:14

update

74dce95

WIP

dcd16d4

lint

57fb704

Binyang2014 marked this pull request as ready for review June 16, 2026 15:55

Binyang2014 added 3 commits June 16, 2026 08:55

Merge branch 'feature/ep' into binyli/ep

bb31c15

WIP

de91542

lint

f4fbd09

Binyang2014 marked this pull request as draft June 19, 2026 22:19

Binyang2014 added 14 commits June 21, 2026 01:40

WIP

125cb02

WIP

4c7e95a

WIP

5e0c1de

WIP

bd0f15b

WIP

354bc34

udpate

374e8b2

update

6ecb19c

update

cb3d553

WIP

e30d64e

rename

49601f9

WIP

2d0b8e2

WIP

7b25bd3

fix

cb04524

FIx

bee4ea9

WIP

c17eaf3

Binyang2014 added 3 commits June 26, 2026 04:22

update

3545225

WIP

a900f00

WIP

684903e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MoE Commnucator design doc#818

MoE Commnucator design doc#818
Binyang2014 wants to merge 25 commits into
feature/epfrom
binyli/ep

Binyang2014 commented Jun 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Binyang2014 commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Binyang2014 commented Jun 16, 2026 •

edited

Loading