Skip to content

Fall back to sitemap discovery when llms.txt is less than max links#39

Merged
dacharyc merged 1 commit intomainfrom
discovery-source-strategy
Apr 16, 2026
Merged

Fall back to sitemap discovery when llms.txt is less than max links#39
dacharyc merged 1 commit intomainfrom
discovery-source-strategy

Conversation

@dacharyc
Copy link
Copy Markdown
Member

Adds support for the issue described in #27 - when --max-links is greater than the number of links in llms.txt, fall back to the sitemap to discover additional documentation URLs. This accounts for cases where you want to sample a more representative/complete range of URLs than llms.txt alone provides.

This also spawned the investigation ticket #38 . We should consider that as a separate unit of work.

The scope of this PR was relatively small, but it added new test surface area that required adding new mocks across many test files, so the test scope makes it look larger than it actually is.

@dacharyc dacharyc merged commit 0282504 into main Apr 16, 2026
2 checks passed
@dacharyc dacharyc deleted the discovery-source-strategy branch April 16, 2026 13:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant