[BUG] Manual Model ID models/gemma-4-31b-it is overridden by gemini-3.1-pro causing 429 Quota Errors

### Problem

When manually entering a Model ID (e.g., models/gemma-4-31b-it) using the Google Gemini provider, the extension appears to accept the input in the UI but internally overrides the request to use gemini-3.1-pro. This results in 429 Resource Exhausted errors because the request is being billed against the Gemini 3.1 Pro quota instead of the Unlimited Gemma quota. Even when the UI shows the custom model, the internal environment details and the resulting API error confirm that gemini-3.1-pro is being called instead.


### Screenshot of the settings for RooCode inside VS Code
<img width="348" height="494" alt="Image" src="https://github.com/user-attachments/assets/187aacda-2b6b-42a8-bd26-217a65ee19f3" /> 



### Gemma 4 31 and 26 - it's in portuguese, "Ilimitado" means unlimited
<img width="1269" height="147" alt="Image" src="https://github.com/user-attachments/assets/6267d226-c5c0-4390-a9d7-e1fe33ff9ebf" />

### Context

### Comparison: 
The exact same API key and model ID (models/gemma-4-31b-it) work flawlessly in OpenCode (a CLI-based agent), proving the API key and Google AI Studio quotas are valid.
### Motivation: 
I am switching to Roo Code because OpenCode becomes slow, unresponsive, and crashes on large-scale projects. This bug is currently the only blocker preventing me from using Roo Code for high-capacity refactoring.
### Quota Status: 
My Google AI Studio dashboard confirms Unlimited TPM for Gemma 4 models, while the gemini-3.1-pro model is subject to standard free-tier limits.
### Suspected Cause: 
Roo Code likely employs a regex or hardcoded mapping for the "Google Gemini" provider that doesn't recognize the models/gemma- prefix. This causes the extension to either revert to a default Flash model or force the request into a gemini-3.1-pro bucket, ignoring the user's manual input.

### Reproduction steps

1. Open Roo Code Settings and set API Provider to Google Gemini.
2. Paste a valid Google AI Studio API key.
3. Manually type models/gemma-4-31b-it into the Model ID field.
4. Set Reasoning Effort to Medium.
5. Return to chat and send a message (e.g., "Help me refactor this project").
6. Observe that the Model ID in the header/settings often reverts to a default Gemini Flash model.
7. Check the error log to see a 429 error citing gemini-3.1-pro as the exhausted resource.

### Expected result

The extension should persist the manually entered Model ID and successfully route the request to the Gemma 4 endpoint at Google AI Studio.

### Actual result

The selection reverts to a default model. When it does attempt to call the API with the custom ID, it returns a 429 error claiming the quota for gemini-3.1-pro is exceeded (even when the requested model is Gemma).

### Variations tried (optional)

I have attempted to use the following Model ID formats to bypass the internal mapping, but all resulted in the same fallback behavior or 429 error:

    models/gemma-4-31b-it:free
    gemma-4-31b-it:free
    models/gemma-4-31b-it
    gemma-4-31b-it

In all cases, the extension fails to persist the custom ID and defaults the actual API call to a Gemini-branded model.

### App Version

3.52.1

### API Provider (optional)

Google Gemini

### Model Used (optional)

models/gemma-4-31b-it | models/gemma-4-31b-it:free

### Roo Code Task Links (optional)

_No response_

### Relevant logs or errors (optional)

```shell
{
  "error": {
    "timestamp": "2026-04-22T18:16:46.035Z",
    "version": "3.52.1",
    "provider": "gemini",
    "model": "models/gemma-4-31b-it:free",
    "details": "Gemini generate context stream error: {\"error\":{\"message\":\"{\\n  \\\"error\\\": {\\n    \\\"code\\\": 429,\\n    \\\"message\\\": \\\"... Quota exceeded for metric: ... model: gemini-3.1-pro ...\\\",\\n    \\\"status\\\": \\\"RESOURCE_EXHAUSTED\\\"\\n  }\\n}\",\"code\":429,\"status\":\"Too Many Requests\"}}"
  },
  "environment_details_snippet": {
    "role": "user",
    "content": [
      {
        "type": "text",
        "text": "... <model>gemini-3.1-pro-preview</model> ..."
      }
    ]
  }
}
Log Highlights for Devs:

    Line 6: Shows the user-selected model is models/gemma-4-31b-it:free.
    Line 7: The API error response explicitly mentions model: gemini-3.1-pro.
    Environment Injection: The extension is injecting <model>gemini-3.1-pro-preview</model> into the prompt context (visible in the full log history), which confirms the internal state is not respecting the manual model ID override.
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Manual Model ID models/gemma-4-31b-it is overridden by gemini-3.1-pro causing 429 Quota Errors #12166

Problem

Screenshot of the settings for RooCode inside VS Code

Gemma 4 31 and 26 - it's in portuguese, "Ilimitado" means unlimited

Context

Comparison:

Motivation:

Quota Status:

Suspected Cause:

Reproduction steps

Expected result

Actual result

Variations tried (optional)

App Version

API Provider (optional)

Model Used (optional)

Roo Code Task Links (optional)

Relevant logs or errors (optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] Manual Model ID models/gemma-4-31b-it is overridden by gemini-3.1-pro causing 429 Quota Errors #12166

Description

Problem

Screenshot of the settings for RooCode inside VS Code

Gemma 4 31 and 26 - it's in portuguese, "Ilimitado" means unlimited

Context

Comparison:

Motivation:

Quota Status:

Suspected Cause:

Reproduction steps

Expected result

Actual result

Variations tried (optional)

App Version

API Provider (optional)

Model Used (optional)

Roo Code Task Links (optional)

Relevant logs or errors (optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions