Formulate a framework for responsible LLM usage when coding, plus practical recommendations by spwoodcock · Pull Request #21 · hotosm/docs

spwoodcock · 2026-02-17T16:13:27Z

Fixes #18

The problem

At HOT, as with most other open-source communities, we need to provide some guidance around the usage of LLMs and agentic coding.
This discussion is open to anyone from the public to contribute to, if they have points to make that we may have missed (or have suggestions for better remediation strategies 🙏)

This PR

After internal deliberation, external consultation, and summarisation of the resulting themes, this PR hopes to publish an acceptable set of guidelines to navigate the challenge.
Suggested reading order:
1. using-llms-responsibily.md: a description of the ongoing problem and it's various framings.
2. ai-assisted-coding-guide.md: practical guide for how to approach agentic coding.
3. managing-ai-contributions.md: practical guide for maintainers accepting AI-assisted code contributions.

Please review, comment, and contradict whatever is written here as needed.
This is supposed to be a collaborative learning exercise to work on these difficult challenges together.

Disclaimer: Yes the documents were initially synthesised from linked references using Claude Opus 4.6 - text summarisation is where LLMs shine after all... the content has then been reviewed and edited by me, to give us a starting point. I added in a few additional perspectives from our ongoing calls with partner organisations too.

Notes

Based on documents:
https://docs.google.com/document/d/1F9C1aaE2CW9JmEmJlOCkuc9Lr_M-YybXXVyHlqDe_GY
https://docs.google.com/document/d/1M85SirgyyQrS33r4l4ta6JDWJg9OgO0BIVKBSoogVZE
https://docs.google.com/document/d/1uMT9EMd50NUCwRj5CTJg2ShQRWbxcI-U3g5oFFS6oMA

spwoodcock · 2026-02-17T16:21:03Z

cc @AbdelrahmanKatkat @Claurt07

docs/ai-guide/ai-assisted-coding-guide.md

dakotabenjamin

Thorough and well-researched. Many problems are identified, but the mitigations look weak in some places, and may fail to address the problem. I've noted a couple in my in-line comments.

We could also add some processes around the following to support some of the guidelines:

measuring the impact of AI usage on maintainers (time spent vs. contributions?)
for maintainers, checklist or clear factors for when to reject contributions

docs/ai-guide/ai-assisted-coding-guide.md

docs/ai-guide/using-lmms-responsibly.md

spwoodcock · 2026-02-17T22:47:49Z

Thanks for taking the time to review @dakotabenjamin!

I really appreciate your input as someone that cares about this a lot & you make some very valid points - I'll try to address them all 😄

Co-authored-by: DK Benjamin <dakota.benjamin@hotosm.org>

docs/ai-guide/ai-assisted-coding-guide.md

kshitijrajsharma · 2026-02-17T23:24:51Z

docs/ai-guide/recommended-open-models.md

+
+Page content to come.
+
+We plan to research and recommend the best open models to use, as alternatives to proprietary services.


Can be also generic in above document, Try to prefer open source based models than proprietary ones ! Like this

It would be nice to provide some guidance though! It's easy to provide guidelines, but then someone gets stuck on what actual tools / models to use. We can probably provide some decent options after a little research

@spwoodcock , is my understanding this would be a recommendation only? I think (just as our internal team is experimenting, other contributors might as well)

Yeah for sure, simply a recommendation. Users are free to use whatever models they like.

If one day there is an org / model that we really disagree with their methods, we could possibly make a 'banned models' list

…quality)

spwoodcock

I made a few fixes / updates 👍

kshitijrajsharma · 2026-02-17T23:43:20Z

@spwoodcock Do you think we should also mention about code license , if AI is being used to write code from some other repo or somewhere else try to see or match compatible license ? I know license with AI agents are kinda shady topic now , its very difficult to know but I feel like we make author aware at-least !

And also about code documentation:

May be we can add ( something like this ) :

Try to document why those changes rather than what every-line does which is self explanatory ( AI tends to do that )

spwoodcock · 2026-02-18T00:25:02Z

@spwoodcock Do you think we should also mention about code license , if AI is being used to write code from some other repo or somewhere else try to see or match compatible license ? I know license with AI agents are kinda shady topic now , its very difficult to know but I feel like we make author aware at-least !

And also about code documentation:

May be we can add ( something like this ) :

Try to document why those changes rather than what every-line does which is self explanatory ( AI tends to do that )

Thanks for all the comments @kshitijrajsharma! I addressed your points and updated with suggestions 😃

For the point about licensing, there is a comment about this in using-llms-responsibility.md which says:

"The LLVM Project's AI policy states it clearly: using AI tools to regenerate copyrighted material does not remove the copyright, and contributors remain responsible for ensuring nothing infringing enters their work [4]. The risk includes inadvertently incorporating copyrighted code or text into publicly released outputs."

Do you think that is enough to cover it, or we should be more explicit somewhere, perhaps in ai-assisted-coding-guide.md?

spwoodcock · 2026-02-18T14:28:29Z

Thanks @mjvanderveen! (your input really helped to craft this)

Things remaining to do:

Guidance on methods available for AI-assisted coding, e.g. paired programming vs agentic mode.
Rewording of importance of using AI. Its not mandatory, we just need to have a framework in place

Now the question of the hour: how do we identify LLM generated code?

I made a start on this here, but would love some input from anyone on the heuristics they have encountered to help do this, and potentially any tools they have have tested to automate it 🙏

dakotabenjamin

Would like to see more input from members of the team before giving explicit approval, but to me this is a great addition to the documentation. Once merged I'll also add it to HOT AI policy as reference.

docs/ai-guide/managing-ai-contributions.md

docs/ai-guide/using-lmms-responsibly.md

LeenDhondt · 2026-02-23T05:38:23Z

I think you did great research Sam and provided a clear framework, also incorporated most feedback both from internal users as from our discussions with partners.

I do suggest we first run it by our full tech team meeting before merging.

docs/ai-guide/ai-assisted-coding-guide.md

…ggestion

spwoodcock · 2026-02-24T11:45:39Z

Adding an idea I read here that I really like:

Good use for AI / LLMs:

Splitting PRs into smaller chunks: Sometimes PRs (particularly those produced by LLMs) can be far too large to properly review. AI can suggest a logical division of code into separate PRs / commits, allowing for easier human review.

spwoodcock · 2026-02-24T12:59:53Z

Remaining things to update:

Guidance on how to measure the impact of AI usage on maintainers (time spent vs. contributions?)
Checklist for maintainers or clear factors for when to reject contributions
Guidance on which AI tools to use (related to the 'open models' page that was added').
Better remediation strategies for environmental impacts
Assess whether the legal position for usage of LLMs is untenable, and shuts down this whole exercise (well at least part of it - we still need guidance for how to handle external AI-assisted contributions anyway)

Once complete, we will discuss in our next team meeting, then merge once happy with it👍

smathermather · 2026-02-24T15:59:06Z

"The LLVM Project's AI policy states it clearly: using AI tools to regenerate copyrighted material does not remove the copyright, and contributors remain responsible for ensuring nothing infringing enters their work [4]. The risk includes inadvertently incorporating copyrighted code or text into publicly released outputs."

Do you think that is enough to cover it, or we should be more explicit somewhere, perhaps in ai-assisted-coding-guide.md?

As I understand it, there are two axes of copyright risk with AI contributions. One is the regeneration of copyrighted material and the other is the copyright-ability of the AI portions of the contribution. I link to the US legal side of this concern, though as I understand it, the EU side is similar.

In short, disclosure is critical for the author asserting the copyright on contributions that include AI written bits.

spwoodcock · 2026-02-24T17:13:24Z

As I understand it, there are two axes of copyright risk with AI contributions. One is the regeneration of copyrighted material and the other is the copyright-ability of the AI portions of the contribution. I link to the US legal side of this concern, though as I understand it, the EU side is similar.

In short, disclosure is critical for the author asserting the copyright on contributions that include AI written bits.

Thats a really valuable document, thanks for sharing @smathermather ❤️

Also one of the toughest points to assess, where there are no real remediation strategies possible.

Either (1) get swept along with the crowd of hiding from it (2) view it as acceptable risk and a ethical negative, weighted up against the ethical positives of the work we do (3) refuse to engage due to the concerns.

Tough call that we need to discuss more and work out a way forward for!

Open to any input or suggestions from people!

Also, its been noted that the remediation strategies on the environmental front are a bit weak, which I agree.

Orgs could possibly do some back of the envelope calcs for how much usage there might be from our own team, approximate the kWh usage, then donate this amount to effective charities and orgs in this space?

This is obviously not an acceptable strategy for the whole world to engage in, but let's be real: most orgs don't care aren't going to saturate the funding and effectiveness of these charities.

I would promote https://www.effectiveenvironmentalism.org/climate-charities

Again, far from perfect, but it would go some way to acknowledging the problem and attempting to solve it in a roundabout way.

Note

I'm commenting entirely on my own behalf, and don't represent the views of HOT. I haven't sought approval to see if remediative donations is an option.

spwoodcock · 2026-02-24T21:26:57Z

docs/ai-guide/ai-assisted-coding-guide.md

+
+One excellent, well-tested PR is worth more than ten AI-generated patches that each require maintainer effort to evaluate. Quality over quantity. Always.
+
+### Prefer Existing Libraries


This section is mentioned above too. It could possibly be removed, or perhaps its worth reiterating an important point

spwoodcock · 2026-02-24T21:27:59Z

docs/ai-guide/ai-assisted-coding-guide.md

+- [ ] Error handling does not leak sensitive information
+- [ ] No unnecessary permissions or access scopes
+- [ ] SQL queries are parameterised (no string concatenation)
+- [ ] File paths are sanitised against traversal attacks


Probably worth clarifying / linking somewhere for how to best do this

spwoodcock · 2026-02-24T21:31:22Z

docs/ai-guide/managing-ai-contributions.md

+
+This file is read by AI coding agents (Copilot, Claude Code, Cursor, etc.) when they work on your codebase. It tells the AI what your standards are, what's off-limits, and how to behave. Think of it as onboarding instructions - but for machines.
+
+**What to include:**


Mention:

MADR, link to section below

Tech decisions or paths already explored and discounted - do not attempt these approaches

spwoodcock · 2026-02-24T21:32:18Z

docs/ai-guide/managing-ai-contributions.md

+AI tools must not be used to fix issues labelled `good first issue`.
+These exist for human learning.
+
+For full policy details, see: https://docs.hotosm.org/ai-assisted-coding


Change to a relative link

spwoodcock · 2026-02-24T21:33:48Z

docs/ai-guide/managing-ai-contributions.md

+- **Code quality**: SonarQube Cloud is free for open source projects to use, assisting code quality and security compliance.
+- **Dependency checking**: OWASP [DependencyCheck](https://github.com/dependency-check/DependencyCheck) or [OSV Scanner](https://github.com/google/osv-scanner) can be used to ensure dependencies are updated to avoid latest security vulnerabilities. It's also recommended to use [Renovate bot](https://github.com/renovatebot/renovate) to regularly update dependencies.
+- **Secrets scanning**: [GitLeaks](https://github.com/gitleaks/gitleaks) can be integrated as a pre-commit hooks or CI action to prevent accidental commit of org secrets.
+- **Licensing and copyright**: [ScanCode Toolkit](https://github.com/aboutcode-org/scancode-toolkit) can be used to scan for copyright breaches in your code and non-compliance with license requirements.


Let's test this one out & see how it performs!

spwoodcock · 2026-02-24T21:40:58Z

docs/ai-guide/managing-ai-contributions.md

+**Key points for reviewers:**
+
+- If a PR is marked AI-assisted, ask "why this approach?" - the answer tells you if the contributor understands the code.
+- Watch for: verbose AI-style PR descriptions, generic variable names, unnecessary complexity, dependencies that seem unrelated.


Simply link to the section above instead of listing out the same signs of AI contribution

spwoodcock · 2026-02-24T21:43:34Z

docs/ai-guide/using-lmms-responsibly.md

+
+## Introduction
+
+AI coding tools have moved from novelty to daily workflow in under two years. Andrej Karpathy coined the term "vibe coding" in early 2025 - describing developers who prompt AI, accept all suggestions, and barely read the output. By early 2026, he had already moved on, calling the practice outdated and advocating instead for "agentic engineering": careful, supervised AI-assisted development with full human oversight [1]. While early-2025 AI models were shown in some cases to have a net negative impact on developer productivity [21], models have improved significantly by early 2026, alongside growing efforts within open-source communities to establish appropriate governance and usage policies.


It might be too soon to have an authoritative source or paper on this, but once there is, it should be added!

Evidence is primarily anecdotal for now, speaking with devs in different orgs, observing the need for communities to catch up to the pace of model development & implement policies.

Sure there is some hype as well, but where there is smoke, there is generally fire too (even if its just smouldering embers for now...).

Watch this space for some actual hard stats

spwoodcock · 2026-02-24T21:51:43Z

docs/ai-guide/using-lmms-responsibly.md

+
+### 1.4 Labour and Exploitation
+
+The refinement of AI models often relies on low-paid human labour for data labelling and content moderation, frequently in low- and middle-income economies. The training data itself was often collected without consent from its creators. Using these tools means participating in a supply chain with unresolved ethical questions about consent, compensation, and intellectual property [20].


Would be appreciated if someone has the time to research this one a little deeper. Its hard to not be implicated here, so we need to ensure the risks aren't too great.

Despite that, its not the highest concern on the list for me personally. There are so many industries and practices globally that have a terrible human rights record. I would argue that long hours curating training data is low on the list of moral injustices out there (we need to put this in perspective of potential good that is derived from the tools that we work on). But again, this hunch needs to be proven by hard data before I can be substantiated fully.

spwoodcock · 2026-02-24T21:57:00Z

docs/ai-guide/using-lmms-responsibly.md

+**Mitigation approaches:**
+
+- Produce open-source software that partners can adopt freely.
+- Advocate for and invest in open-source models that can run locally.


Related to the open models guidance page to be completed.

But we should also provide guidance on how to set up and use these tools in any easy way.

If there is an obvious usability gap (ideally identified through discussion with less tech literate community members - those dabbling with code solutions, who didn't work as software devs previously), we should definitely try to fill them!

I considered a wrapper of sorts to simply run Ollama. But honestly Ollama is pretty simple as it is, as attested to by @emi420. As mentioned, we should seek to identify pain points, and help in the best way we can

spwoodcock · 2026-02-24T22:00:11Z

docs/ai-guide/using-lmms-responsibly.md

+
+AI tools are demonstrably helpful when assisting someone who already understands the codebase and the broader technical landscape, but they are far less reliable as a substitute for that understanding.
+
+**Guidance on appropriate AI use:**


Remove this a perhaps defer to the section in doc 2 that has more detail

smathermather · 2026-02-25T04:01:40Z

Some of this starts to get addressed above, but as I sent this via side channel and Sam suggested in issue is fine:

This is an opus. I appreciate both the thoroughness regarding the problem space, specific challenges, and possible remediation(s) but also the state of the art for responses across projects that have addressed LLM contributions explicitly in their covenants.

Overall, the biggest challenge I see is the remediation question: specifically the challenge of copyright (legal challenge); for labor violations that underpin or are related to those copyright challenges (ethical challenge); for jurisdictional challenges associated with concepts of fair use (possible legal challenges outside US legal frameworks); the existence of untainted, truly open models with known corpus (legal, ethical, and digital sovereignty challenge); and the lack of any clear accounting / signal for decision making on the above with regard to environmental impacts.

These documents serve as a great framing and direction for use of transformer models that are built with consent, documentation of corpus, known licensing and labor practices, as well as resource use. IMO, any substantive ethical use of LLMs in dev work requires a list and possibly the development of such models, and constraint to allowed models with known provenance.

spwoodcock · 2026-02-25T12:38:03Z

Getting there bit by bit!

As promised, I did some 'back-of-the-envelope' calcs to determine a reasonable emissions offset donation for our LLM usage. It uses many assumptions and fudge factors, but overall suggests that energy usage at best could be reasonably low, and at worst (as with large models, regular heavy refactoring) is still manageable for now, although definitely not sustainable into the future:

https://github.com/hotosm/docs/blob/docs/llm-usage-guide/docs/ai-guide/using-lmms-responsibly.md#appendix-a-methodology-for-estimating-llm-energy--co-emissions-and-donation-proxy

The overall summary is that I think HOT should donate ~$300 to effective climate policy and advocacy charities (again, this is my opinion and I have not run it by anyone else yet...).

The last and most difficult thing to address is the legal / ethical concerns raised by @smathermather about copyright infringement and lack of truly 'open training data' models.

We need to decide if:

The potential infringement is an acceptable risk, provided we take a firm stance on "You must disclose AI usage".
We can't accept the legal risk or ethical implications, and decide to ban LLM usage outright.

I'll leave this question in the open for a bit, probably until next week, to allow anyone to comment - would really love some additional feedback here, as I'm a single fallible human being that has many blind spots and is certainly prone to errors in judgement 😄

Comments from @LeenDhondt below:

@spwoodcock , option 2 is for me not an option. We need to learn how to navigate this new disruption in our space, not avoid it
@spwoodcock, on donating offset emissions, this is something we need to bring up at Org level (as same accounts for other areas of work at HOT)

spwoodcock · 2026-02-26T15:49:16Z

Thanks @LeenDhondt - agree with need to navigate & not avoid 👍

To put the energy issue in context, I added this:
#21 (comment)

The energy usage for our team is a drop in the ocean compared to travel emissions. I still think donations would be great, considering the distributed team and need to meet, but not as initially thought for LLM usage.

spwoodcock · 2026-03-01T18:36:52Z

Also, to comment on the copyright issue.

To summarise, US courts + Copyright Office take the position that purely AI-generated outputs are not copyrightable, unless they have an element of human authorship (arrangement, substantive edits, creative transformation, etc).

For software we care about
(1) whether training on copyrighted code is lawful, and
(2) whether model outputs reproduce copyrighted code.

In Cory Doctorow’s blog post (from FSF), he argues that model training is not infringement. Training involves copying for analysis and extracting statistical patterns, and copyright law has historically permitted analysis of copyrighted works. If we expand copyright to prohibit training, it would mostly benefit large rights-holders more than individual creators. Cory is mostly discussing artistic work here, but for software it still applies when copyright code was used for training.

The first question remains legally unsettled, so this framework can't really comment. For the second point, we can meaningful address it through the suggested mitigation methods: keep a human in the loop, require contributor disclosure, small PRs, and try our best to detect AI generated content. All AI generated content should be treated as untrusted and not committed blindly.

I would recommend reading the linked blog - it has some nice points. Also articulated in this video shared by Shoaib (for those that prefer):

AI bubble: what is fueling it, how it will burst, and the reality of what we are left with: some useful tools for summarisation and coding assistance (akin to a coding plugin, rather than a replacement for software engineers).
The reverse centaur model that replaces human work while leaving them responsible for oversight and liability, should be avoided at all costs. If a manager suggested something stupid to a dev, they get told as much. If a manager asks an AI tool, the crappy idea will just get implemented without thought about long term goals, adjacent software, etc - without human judgement and review, we get fragile systems and accumulated tech debt.

emi420 · 2026-03-02T17:14:28Z

Sharing what I’m experimenting on, in case you want to also experiment yourself.

Taking account the recent events of mega-giant AI companies discussing if their products will be used for autonomous killing systems and mass surveillance or not, but also in line with reducing dependency from paid and closed-source software, I’ve started to do more experiments with open and local models and tools.

Currently I’m testing OpenCode (which is similar to Claude Code) connected to a local Ollama instance serving the new qwen3.5 model.

I’m also testing a new strategy, maybe some of you are already doing this. Instead of “chatting” with the model directly, I do the following:

Create a document with a precise description of the issue I want to solve and the idea of the possible solution (ex: 001-frontend-config.md)
Write a prompt requesting the writing of a new document with a plan of actions to solve the issue (ex: 001-frontend-config-PLAN.md)
Read the generated plan, make/request adjustments if necessary or ask/write an alternative plan (ex: 001-frontend-config-PLAN-B.md)
Write a prompt requesting the execution of the plan (ie: code changes)
Review and test the code

The results looks very promising with Qwen3.5. In theory this model offers better performance than Sonnet 4.5 (from Anthropic) but it’s open (open weights) and it runs locally.

Also, following this methodology, while it takes some time and in some cases it’s easier to just write the code, it provides more control and better quality. I’ll share about this in a doc when I have the time.

Note: running qwen3.5:27b on my system (chip: Apple M2 Max, memory: 32gb) feels quite slow but it works, qwen3:8b works really fast. I have to test new versions of qwen3.5 that are available in Ollama starting today.

smathermather · 2026-03-04T01:29:14Z

In Cory Doctorow’s blog post (from FSF), he argues that model training is not infringement. Training involves copying for analysis and extracting statistical patterns, and copyright law has historically permitted analysis of copyrighted works. If we expand copyright to prohibit training, it would mostly benefit large rights-holders more than individual creators. Cory is mostly discussing artistic work here, but for software it still applies when copyright code was used for training.

I actually know very little how this analysis applies outside the US, but it is important to note that this analysis applies specifically to copyright in the US. Doctorow is partially right: under US law, training is (likely) not infringement. But fair use doesn't apply outside the US. I would be interested if anyone has an inkling of how e.g. EU infringement questions are likely to play out.

The reverse centaur model that replaces human work while leaving them responsible for oversight and liability, should be avoided at all costs. If a manager suggested something stupid to a dev, they get told as much. If a manager asks an AI tool, the crappy idea will just get implemented without thought about long term goals, adjacent software, etc - without human judgement and review, we get fragile systems and accumulated tech debt.

Yes I almost highlighted the reverse centaur in reference to this portion of the docs, though I think this accountability is important in the context of contributors, as with a FOSS project it's the only safe default.

But for an org / corporate body, reverse centaur / responsibility laundering of LLMs is a concern, especially if / when LLM use is required or expected.

I’ve started to do more experiments with open and local models and tools.

Very interesting. I'm looking forward to the open models list being populated.

spwoodcock requested review from LeenDhondt, PaulUithol, cgiovando, dakotabenjamin, emi420, ivangayton, kshitijrajsharma, petya-kangalova and ramyaragupathy February 17, 2026 16:19

spwoodcock force-pushed the docs/llm-usage-guide branch from 2a147ea to 6266350 Compare February 17, 2026 16:41

docs: add three documents related to usage of llms in coding projects

cf94368

spwoodcock force-pushed the docs/llm-usage-guide branch from 6266350 to cf94368 Compare February 17, 2026 16:45

spwoodcock added 2 commits February 17, 2026 16:58

fix: improve on the data privacy / security section for ai

f7e1286

fix: tweaks to primary document wording

0561f05

dakotabenjamin reviewed Feb 17, 2026

View reviewed changes

docs/ai-guide/ai-assisted-coding-guide.md Outdated Show resolved Hide resolved

dakotabenjamin requested changes Feb 17, 2026

View reviewed changes

smathermather mentioned this pull request Feb 17, 2026

Add task options groups without LLM use OpenDroneMap/WebODM#1834

Closed

spwoodcock and others added 2 commits February 17, 2026 22:49

docs: accept suggestion for docs/ai-guide/ai-assisted-coding-guide.md

0fff633

Co-authored-by: DK Benjamin <dakota.benjamin@hotosm.org>

fix: address comments made to ai docs by @dakotabenjamin

b50f817

kshitijrajsharma reviewed Feb 17, 2026

View reviewed changes

fix: add recommendation for solid ci pipeline setup (security + code …

ecab565

…quality)

spwoodcock commented Feb 17, 2026

View reviewed changes

fix: comment from @kshitijrajsharma on ai commits

881eec6

fix: add cnpg as example of a good ai usage policy

14248e5

fix: fixes to llm guide based on feedback

3a8ca71

fix: add section about identifying ai generated code

8e23500

spwoodcock force-pushed the docs/llm-usage-guide branch from 77050c6 to 8e23500 Compare February 18, 2026 14:34

fix: note about identifying ai bot accounts

c953768

dakotabenjamin reviewed Feb 18, 2026

View reviewed changes

docs/ai-guide/managing-ai-contributions.md Show resolved Hide resolved

docs/ai-guide/using-lmms-responsibly.md Outdated Show resolved Hide resolved

LeenDhondt reviewed Feb 23, 2026

View reviewed changes

docs/ai-guide/ai-assisted-coding-guide.md Show resolved Hide resolved

spwoodcock added 4 commits February 23, 2026 23:56

feat: add info about adding MADR to help guide llm decisions

f8585e3

fix: wording around AI policy requirement, based on @smathermather su…

0b755d4

…ggestion

fix: add info about blocking users in worst case policy breach

d30c538

fix: suggestion on llm guide by @LeenDhondt

de8b7b4

kshitijrajsharma mentioned this pull request Feb 24, 2026

Feature : Zenml and 3.0 Architecture with example local setup hotosm/fAIr-models#11

Merged

spwoodcock added 5 commits February 24, 2026 12:49

fix: add additional info about agentic coding impact on comprehension

148036f

fix: update section on what AI is good for

b7cb9cd

fix: add section on types of AI assistance available

a6fe2fa

fix: add note about allowing AI to assess it's own work

ca286cf

refactor: remaining ai assisted coding guidance

a64b6cd

spwoodcock commented Feb 24, 2026

View reviewed changes

feat: add offset donation recommendation for environmental concerns

3b30f6a

fix: info about adding contributor agreement signature to ci pipeline

6d56a31


		Page content to come.

		We plan to research and recommend the best open models to use, as alternatives to proprietary services.


		One excellent, well-tested PR is worth more than ten AI-generated patches that each require maintainer effort to evaluate. Quality over quantity. Always.

		### Prefer Existing Libraries


		This file is read by AI coding agents (Copilot, Claude Code, Cursor, etc.) when they work on your codebase. It tells the AI what your standards are, what's off-limits, and how to behave. Think of it as onboarding instructions - but for machines.

		What to include:


		## Introduction

		AI coding tools have moved from novelty to daily workflow in under two years. Andrej Karpathy coined the term "vibe coding" in early 2025 - describing developers who prompt AI, accept all suggestions, and barely read the output. By early 2026, he had already moved on, calling the practice outdated and advocating instead for "agentic engineering": careful, supervised AI-assisted development with full human oversight [1]. While early-2025 AI models were shown in some cases to have a net negative impact on developer productivity [21], models have improved significantly by early 2026, alongside growing efforts within open-source communities to establish appropriate governance and usage policies.


		### 1.4 Labour and Exploitation

		The refinement of AI models often relies on low-paid human labour for data labelling and content moderation, frequently in low- and middle-income economies. The training data itself was often collected without consent from its creators. Using these tools means participating in a supply chain with unresolved ethical questions about consent, compensation, and intellectual property [20].


		AI tools are demonstrably helpful when assisting someone who already understands the codebase and the broader technical landscape, but they are far less reliable as a substitute for that understanding.

		Guidance on appropriate AI use:

Conversation

spwoodcock commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The problem

This PR

Notes

Uh oh!

spwoodcock commented Feb 17, 2026

Uh oh!

Uh oh!

dakotabenjamin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

spwoodcock commented Feb 17, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

spwoodcock left a comment

Choose a reason for hiding this comment

Uh oh!

kshitijrajsharma commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

spwoodcock commented Feb 18, 2026

Uh oh!

spwoodcock commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dakotabenjamin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

LeenDhondt commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

spwoodcock commented Feb 24, 2026

Uh oh!

spwoodcock commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smathermather commented Feb 24, 2026

Uh oh!

spwoodcock commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

spwoodcock commented Feb 17, 2026 •

edited

Loading

kshitijrajsharma commented Feb 17, 2026 •

edited

Loading

spwoodcock commented Feb 18, 2026 •

edited

Loading

LeenDhondt commented Feb 23, 2026 •

edited

Loading

spwoodcock commented Feb 24, 2026 •

edited

Loading

spwoodcock commented Feb 24, 2026 •

edited

Loading

spwoodcock commented Feb 25, 2026 •

edited

Loading

emi420 commented Mar 2, 2026 •

edited

Loading