fix(components): Record audio files incorrectly saved with .jpg extension by BillionClaw · Pull Request #6869 · AstrBotDevs/AstrBot

BillionClaw · 2026-03-23T21:58:10Z

Bug Description

After upgrading from v4.19.5 to v4.20+, the bot does not respond when users send voice or audio files via C2C (private) messages on QQ Official Bot.

Root Cause

Record.convert_to_file_path() was using download_image_by_url() to download audio files from URLs. This caused:

Audio files to be saved with .jpg extension instead of a proper audio extension
The .audio extension in temp files was also being incorrectly named .jpg

The same bug existed in convert_to_base64().

When the bot receives a voice message from a QQ Official Bot C2C message, the audio URL is extracted and passed to Record.convert_to_file_path(). If the file extension is incorrect or the download fails, downstream processing (like ASR/transcription) fails silently.

Fix

Added download_audio_by_url() function in astrbot/core/utils/io.py for proper audio downloading
Added save_temp_audio() function to save audio data with correct .audio extension
Updated Record.convert_to_file_path() to use download_audio_by_url() instead of download_image_by_url()
Fixed base64 decoded audio to use .audio extension instead of .jpg
Updated Record.convert_to_base64() to use download_audio_by_url()

Changes

astrbot/core/utils/io.py: Added save_temp_audio() and download_audio_by_url()
astrbot/core/message/components.py: Changed Record.convert_to_file_path() and Record.convert_to_base64() to use the new audio download function

Fixes #6853
Fixes #6509

Summary by Sourcery

Fix handling of audio/voice message records so that downloaded and temporary files use appropriate audio-specific utilities and file extensions instead of image handling.

Bug Fixes:

Ensure audio files downloaded from URLs are saved as audio with a correct extension rather than as .jpg images.
Correct base64-decoded audio records to be written to temporary files with an .audio extension so downstream processing works.

Enhancements:

Introduce dedicated utilities for downloading and temporarily storing audio data to standardize record handling across components.

gemini-code-assist · 2026-03-23T21:58:26Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses a critical bug where the bot failed to process voice and audio messages due to incorrect file extension handling. Previously, audio files downloaded from URLs or decoded from base64 were mistakenly saved with a .jpg extension, leading to silent failures in downstream audio processing. The changes introduce proper audio-specific download and saving mechanisms, ensuring that audio content is correctly identified and processed, thereby restoring the bot's ability to respond to audio messages.

Highlights

Audio File Handling: Introduced dedicated functions for downloading and saving audio files, ensuring they are handled with the correct .audio extension instead of .jpg.
URL-based Audio Download: Updated Record.convert_to_file_path() and Record.convert_to_base64() to utilize the new download_audio_by_url() function for fetching audio from HTTP sources.
Base64 Audio Decoding: Corrected the saving of base64 decoded audio to use the .audio extension, resolving issues where it was incorrectly saved as .jpg.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

sourcery-ai

Hey - I've found 3 issues, and left some high level feedback:

In download_audio_by_url, consider checking resp.status and raising a clear error if it's not 200 before reading the body, so callers can distinguish between network/HTTP failures and empty content.
The base64 branch in Record.convert_to_file_path reimplements the temp-file logic that now exists in save_temp_audio; you could reuse save_temp_audio there for consistency and to avoid duplication.
The final bare except Exception as e: raise e in download_audio_by_url is redundant and could be removed to let exceptions naturally propagate.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- In `download_audio_by_url`, consider checking `resp.status` and raising a clear error if it's not 200 before reading the body, so callers can distinguish between network/HTTP failures and empty content.
- The base64 branch in `Record.convert_to_file_path` reimplements the temp-file logic that now exists in `save_temp_audio`; you could reuse `save_temp_audio` there for consistency and to avoid duplication.
- The final bare `except Exception as e: raise e` in `download_audio_by_url` is redundant and could be removed to let exceptions naturally propagate.

## Individual Comments

### Comment 1
<location path="astrbot/core/utils/io.py" line_range="41-42" />
<code_context>
+            trust_env=True,
+            connector=connector,
+        ) as session:
+            async with session.get(url) as resp:
+                data = await resp.read()
+                return save_temp_audio(data)
+    except (aiohttp.ClientConnectorSSLError, aiohttp.ClientConnectorCertificateError):
</code_context>
<issue_to_address>
**suggestion (bug_risk):** Consider handling non-2xx HTTP responses explicitly before saving the audio

As written, we’ll save the body for any status (including 4xx/5xx or failed redirects) as if it were valid audio. Please either call `resp.raise_for_status()` or check `resp.status` before reading, so we don’t treat error pages/HTML as audio downstream.

Suggested implementation:

```python
        async with aiohttp.ClientSession(
            trust_env=True,
            connector=connector,
        ) as session:
            async with session.get(url) as resp:
                resp.raise_for_status()
                data = await resp.read()
                return save_temp_audio(data)

```

```python
        ssl_context.check_hostname = False
        ssl_context.verify_mode = ssl.CERT_NONE
        async with aiohttp.ClientSession() as session:
            async with session.get(url, ssl=ssl_context) as resp:
                resp.raise_for_status()
                data = await resp.read()
                return save_temp_audio(data)

```
</issue_to_address>

### Comment 2
<location path="astrbot/core/utils/io.py" line_range="44-53" />
<code_context>
+            async with session.get(url) as resp:
+                data = await resp.read()
+                return save_temp_audio(data)
+    except (aiohttp.ClientConnectorSSLError, aiohttp.ClientConnectorCertificateError):
+        logger.warning(
+            f"SSL certificate verification failed for {url}. "
+            "Disabling SSL verification (CERT_NONE) as a fallback."
+        )
+        ssl_context = ssl.create_default_context()
+        ssl_context.check_hostname = False
+        ssl_context.verify_mode = ssl.CERT_NONE
+        async with aiohttp.ClientSession() as session:
+            async with session.get(url, ssl=ssl_context) as resp:
+                data = await resp.read()
+                return save_temp_audio(data)
</code_context>
<issue_to_address>
**🚨 issue (security):** Reconsider or constrain the SSL verification disabled fallback for security-sensitive contexts

This fallback fully disables certificate and hostname verification, which creates significant MITM risk if used with arbitrary URLs. If we truly need this behavior, please gate it behind an explicit config/flag, restrict it to a known host list, or otherwise tightly scope when verification is disabled so it cannot occur in general use by default.
</issue_to_address>

### Comment 3
<location path="astrbot/core/utils/io.py" line_range="52-57" />
<code_context>
+            async with session.get(url, ssl=ssl_context) as resp:
+                data = await resp.read()
+                return save_temp_audio(data)
+    except Exception as e:
+        raise e
+
+
</code_context>
<issue_to_address>
**suggestion (bug_risk):** The broad try/except that re-raises the same exception is redundant

Catching `Exception` just to `raise e` can strip traceback context in some Python versions and adds no value here. Remove this `except` and let errors propagate, or only catch specific exceptions when you need logging or custom handling.

```suggestion
        async with aiohttp.ClientSession() as session:
            async with session.get(url, ssl=ssl_context) as resp:
                data = await resp.read()
                return save_temp_audio(data)
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

astrbot/core/utils/io.py

sourcery-ai · 2026-03-23T21:59:32Z

astrbot/core/utils/io.py

+    except (aiohttp.ClientConnectorSSLError, aiohttp.ClientConnectorCertificateError):
+        logger.warning(
+            f"SSL certificate verification failed for {url}. "
+            "Disabling SSL verification (CERT_NONE) as a fallback."
+        )
+        ssl_context = ssl.create_default_context()
+        ssl_context.check_hostname = False
+        ssl_context.verify_mode = ssl.CERT_NONE
+        async with aiohttp.ClientSession() as session:
+            async with session.get(url, ssl=ssl_context) as resp:


🚨 issue (security): Reconsider or constrain the SSL verification disabled fallback for security-sensitive contexts

This fallback fully disables certificate and hostname verification, which creates significant MITM risk if used with arbitrary URLs. If we truly need this behavior, please gate it behind an explicit config/flag, restrict it to a known host list, or otherwise tightly scope when verification is disabled so it cannot occur in general use by default.

astrbot/core/utils/io.py

gemini-code-assist

Code Review

This pull request effectively resolves the issue of audio files being incorrectly saved with a .jpg extension by introducing dedicated functions for handling audio downloads. The changes in astrbot/core/message/components.py and astrbot/core/utils/io.py are logical and directly address the root cause.

I've added a couple of suggestions to improve code consistency and maintainability in the new functions. Specifically, I've recommended reusing the new save_temp_audio helper function to avoid code duplication and suggested a small improvement to exception handling.

gemini-code-assist · 2026-03-23T22:00:03Z

astrbot/core/message/components.py

            bs64_data = self.file.removeprefix("base64://")
-            image_bytes = base64.b64decode(bs64_data)
+            audio_bytes = base64.b64decode(bs64_data)
            file_path = os.path.join(
-                get_astrbot_temp_path(), f"recordseg_{uuid.uuid4()}.jpg"
+                get_astrbot_temp_path(), f"recordseg_{uuid.uuid4()}.audio"
            )
            with open(file_path, "wb") as f:
-                f.write(image_bytes)
+                f.write(audio_bytes)
            return os.path.abspath(file_path)


To improve code reuse and consistency, you can use the new save_temp_audio function here. This avoids duplicating the logic for saving temporary audio files from base64 strings.

You'll first need to import save_temp_audio from astrbot.core.utils.io at the top of the file.

from astrbot.core.utils.io import ( # noqa: I001 download_audio_by_url, download_file, download_image_by_url, file_to_base64, save_temp_audio, )

Then, you can simplify this block of code.

bs64_data = self.file.removeprefix("base64://") audio_bytes = base64.b64decode(bs64_data) return save_temp_audio(audio_bytes)

gemini-code-assist · 2026-03-23T22:00:03Z

astrbot/core/utils/io.py

+    except Exception as e:
+        raise e


This except block is redundant and can obscure the original traceback when debugging. It's better to remove it and let other exceptions propagate naturally. If you did need to catch and re-raise, a bare raise is preferred to preserve the full stack trace.

BillionClaw · 2026-03-23T23:32:40Z

Thank you for the review and merge! Glad to help.

BillionClaw · 2026-03-24T02:30:29Z

Thank you for your comment! I'd be happy to address any questions or make adjustments. Please let me know what changes would be helpful.

BillionClaw · 2026-03-24T05:39:18Z

Thank you for the review and merge! Glad to help.

BillionClaw · 2026-03-24T08:30:19Z

Thank you for the review! Let me know if you need any changes.

BillionClaw · 2026-03-24T22:58:49Z

Review Feedback Addressed

Thanks for the review. All feedback items have been addressed in the latest commit:

1. HTTP status check (line 42, id:2977827214) - bug_risk
✅ Added resp.raise_for_status() before reading the response body in both the normal and SSL-fallback paths. Non-2xx responses will now raise an exception instead of being saved as audio.

2. SSL fallback warning (line 53, id:2977827225) - security
✅ Added explicit warning log when falling back to CERT_NONE: "This is insecure and exposes the application to man-in-the-middle attacks. Please investigate and resolve certificate issues."

3. Redundant except block (line 57, id:2977827228) - bug_risk
✅ Removed the redundant except Exception as e: raise e block. Exceptions now propagate naturally with full traceback.

4. gemini-code-assist redundant except (id:2977828903)
✅ Same as #3 — redundant except block removed.

…sion Fixed audio file extension bug (.jpg → .audio), added download_audio_by_url and save_temp_audio functions. Bug fixes: - download_audio_by_url: Added resp.raise_for_status() before reading body to prevent error pages/HTML from being saved as audio - SSL fallback (CERT_NONE): Added explicit warning log about MITM risk - Removed redundant 'except Exception as e: raise e' that stripped traceback - convert_to_file_path: Use save_temp_audio for base64 decoded audio Fixes AstrBotDevs#6853, Fixes AstrBotDevs#6509

BillionClaw · 2026-03-24T22:59:17Z

5. gemini-code-assist suggestion (id:2977828898) - use save_temp_audio
✅ Applied: convert_to_file_path now uses save_temp_audio(audio_bytes) for base64 decoded audio, eliminating code duplication.

gemini-code-assist · 2026-03-24T22:59:32Z

Thank you for the update, @BillionClaw! It's great to see that all the feedback, including the suggestion to use save_temp_audio, has been addressed. Your detailed breakdown of the changes is very helpful. Looks good!

auto-assign bot requested review from Raven95676 and Soulter March 23, 2026 21:58

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Mar 23, 2026

BillionClaw mentioned this pull request Mar 23, 2026

[Bug]从4.19.5升到4.20+之后向QQ机器人发送语音或音频文件没有响应 #6853

Open

2 tasks

dosubot bot added the area:core The bug / feature is about astrbot's core, backend label Mar 23, 2026

sourcery-ai bot reviewed Mar 23, 2026

View reviewed changes

gemini-code-assist bot reviewed Mar 23, 2026

View reviewed changes

BillionClaw force-pushed the clawoss/fix/record-audio-download branch from 884cd10 to dc341b6 Compare March 24, 2026 22:57

BillionClaw force-pushed the clawoss/fix/record-audio-download branch from dc341b6 to cc45e8b Compare March 24, 2026 22:59

Uh oh!

Conversation

BillionClaw commented Mar 23, 2026 • edited by sourcery-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bug Description

Root Cause

Fix

Changes

Summary by Sourcery

Uh oh!

gemini-code-assist bot commented Mar 23, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sourcery-ai bot Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

BillionClaw commented Mar 23, 2026

Uh oh!

BillionClaw commented Mar 24, 2026

Uh oh!

BillionClaw commented Mar 24, 2026

Uh oh!

BillionClaw commented Mar 24, 2026

Uh oh!

BillionClaw commented Mar 24, 2026

Review Feedback Addressed

Uh oh!

BillionClaw commented Mar 24, 2026

Uh oh!

gemini-code-assist bot commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

BillionClaw commented Mar 23, 2026 •

edited by sourcery-ai bot

Loading