Fix a crash on multiple active LoRa (issue 18050) #18375

byko3y · 2025-12-25T22:07:55Z

Split command line parameters and runtime adapter info into different struct-s.
Bump max graph size according to LoRa count and tensor size.
Fixes bug #18050

Some rationale on the changes. LoRa adapters can be as complex as the original model by hooking into every stage i.e. Wk, Wq, Wv in attention and Wup, Wgate, Wdown in FFN.
With multiple LoRa-s the graph size can become more than double of the original graph, thus leading to the 18050 crash; this means that you cannot safely add arbitrary LoRa-s unless you specifically initialized llama_context via common_init_from_params for these specific adapters.
I could have just passed the tensor count into llama_context constructor without much other noise, but that would make the mess even harder to untangle latter. And there are more issues to it, for example, with LoRa-s enabled I get:

ggml_gallocr_needs_realloc: graph has different number of nodes
ggml_gallocr_alloc_graph: reallocating buffers automatically

That's why the build graph with all lora-s active? question left in llama_context::graph_reserve.

So in the end I moved LoRa adapters initialization into common_init_from_params and explicitly bound the adapters lifetime to the common_init_result — original code implicitly stored adapters in common_init_result but most handling was done via raw pointers stored in common_params instead which was just a timebomb waiting to explode.
Particulary, server_context makes unrestricted deep copies/assignments of common_params, so it was all messed up anyway (and that's also why ngxson left todo comments in the code).

Another small thing: adapter scale -1.0f is a valid scale i.e. inverted adapter

llama-server --lora-scaled lora1.gguf:-1.0

, but server_context treated negative scales as "adapter disabled" which is incorrect.

Split command line parameters and runtime adapter info into different struct-s. Bump max graph size according to LoRa count and tensor size.

ngxson · 2025-12-29T20:47:22Z

I don't think this fix is valid as-is. There is no need to add a new dedicated API for it.

There must be an easier way, I'll see.

Another small thing: adapter scale -1.0f is a valid scale i.e. inverted adapter

By design, negative is valid. Looking at the code, I suspect what you said about negative value is only apply for A-LoRA, not normal LoRA

ngxson · 2025-12-29T21:01:16Z

Superseded by #18469

Fix a crash on multiple active LoRa (issue 18050)

fdfa89f

Split command line parameters and runtime adapter info into different struct-s. Bump max graph size according to LoRa count and tensor size.

byko3y requested review from CISC, ggerganov and ngxson as code owners December 25, 2025 22:07

github-actions bot added examples server labels Dec 25, 2025

loci-dev mentioned this pull request Dec 25, 2025

UPSTREAM PR #18375: Fix a crash on multiple active LoRa (issue 18050) auroralabs-loci/llama.cpp#704

Open

byko3y added 3 commits December 26, 2025 02:07

Update export-lora.cpp

210d65d

ping

bccd2c3

Correct alora_scale handling

bd38e59

ngxson mentioned this pull request Dec 29, 2025

lora: count lora nodes in graph_max_nodes #18469

Merged

ngxson closed this Dec 29, 2025

loci-dev mentioned this pull request Dec 29, 2025

UPSTREAM PR #18469: lora: count lora nodes in graph_max_nodes auroralabs-loci/llama.cpp#748

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix a crash on multiple active LoRa (issue 18050) #18375

Fix a crash on multiple active LoRa (issue 18050) #18375

Uh oh!

byko3y commented Dec 25, 2025 •

edited

Loading

Uh oh!

ngxson commented Dec 29, 2025

Uh oh!

ngxson commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix a crash on multiple active LoRa (issue 18050) #18375

Fix a crash on multiple active LoRa (issue 18050) #18375

Uh oh!

Conversation

byko3y commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngxson commented Dec 29, 2025

Uh oh!

ngxson commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

byko3y commented Dec 25, 2025 •

edited

Loading