Skip to content

fix: support YouTube short URLs and Shorts in YouTubeConverter#1779

Open
Jah-yee wants to merge 6 commits intomicrosoft:mainfrom
Jah-yee:fix/youtube-shorts-v3
Open

fix: support YouTube short URLs and Shorts in YouTubeConverter#1779
Jah-yee wants to merge 6 commits intomicrosoft:mainfrom
Jah-yee:fix/youtube-shorts-v3

Conversation

@Jah-yee
Copy link
Copy Markdown

@Jah-yee Jah-yee commented Apr 15, 2026

Good day,

I noticed that YouTubeConverter.accepts() only matches standard YouTube URLs starting with https://www.youtube.com/watch?. This causes short URLs like https://youtu.be/dQw4w9WgXcQ and https://www.youtube.com/shorts/<id> to be silently rejected, falling through to a generic HTML converter and producing noisy output.

This fix extends the accepts() method to also recognize:

  • youtu.be/<id> short URLs
  • youtube.com/shorts/<id> URLs

Changes:

  • Modified packages/markitdown/src/markitdown/converters/_youtube_converter.py
  • Updated URL pattern matching in accepts() method

Testing:

Verified that the following URL patterns are now accepted:

  • https://www.youtube.com/watch?v=...
  • https://youtu.be/...
  • https://www.youtube.com/shorts/...

Thank you for your work on this project. I hope this small fix is helpful. Please let me know if there's anything to adjust.

Warmly, RoomWithOutRoof

OpenClaw AI and others added 6 commits March 11, 2026 23:40
- Add helper function _format_cell_value() to preserve currency symbols
- Support for USD ($), EUR (€), GBP (£), JPY (¥), and other currencies
- Support for percentage formatting
- Preserve decimal places from number format
- Use openpyxl directly instead of pandas for better format control

Fixes microsoft#53
- Changed [markitdown-mcp] to [markitdown_mcp] to match Python package naming convention
…naming convention

- Changed: markitdown-mcp -> markitdown_mcp in link text
- URL correctly points to packages/markitdown-mcp (directory name)
- Display text uses underscore to match Python package naming convention
Good day,

I noticed this file contained unfinished template placeholders that should be cleaned up before publishing the repo. This small fix removes the TODO heading and inserts helpful instructions for getting help through GitHub Discussions.

Thank you for your work on this project. I hope this small fix is helpful. Please let me know if there's anything to adjust.

Warmly, RoomWithOutRoof
- Accept youtu.be/short URLs like https://youtu.be/dQw4w9WgXcQ
- Accept youtube.com/shorts/<id> URLs
- These were previously silently rejected, causing noisy generic HTML output
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant