Support for SSD expert streaming is not being detected correctly.

https://github.com/SharpAI/SwiftLM/blob/d5a9d118910142ce092fc4357777884a61bb8137/Sources/SwiftLM/Server.swift#L348

It seems this gets (wrongly) triggered when running with:
 --model Qwen/Qwen3.5-397B-A17B (or an 8-bit MXFP8 quantized version thereof) --stream-experts
Resulting in:

```shell
"Model does not support SSD expert streaming (qwen3_5_moe is not MoE). Ignoring --stream-experts flag."
``` 

Along with a subsequent:
```shell
"zsh: killed ./SwiftLM....." 
``` 


Running with the same --model and ./SwiftLM parameters works fine when using the latest release binary (SwiftLM b648).

Example:
```shell
./SwiftLM \
  --model "/Users/user/models/Qwen3.5-397B-A17B-mxfp8-grp32" \
  --port 5413 --stream-experts --thinking --ssd-prefetch
``` 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support for SSD expert streaming is not being detected correctly. #112

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Support for SSD expert streaming is not being detected correctly. #112

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions