Skip to content

add documents for llm-compressor fp8 quant

23b59e4
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

make fp8 model quantized by llm-compressor can be inferenced in turbomind #4509

add documents for llm-compressor fp8 quant
23b59e4
Select commit
Loading
Failed to load commit list.

Annotations

1 warning
cuda-12.8
succeeded Apr 18, 2026 in 33m 30s