Add an LLM policy for `rust-lang/rust` by jyn514 · Pull Request #1040 · rust-lang/rust-forge

jyn514 · 2026-04-17T08:47:28Z

Summary

This document establishes a policy for how LLMs can be used when contributing to rust-lang/rust. Subtrees, submodules, and dependencies from crates.io are not in scope. Other repositories in the rust-lang organization are not in scope.

This policy is intended to live in Forge as a living document, not as a dead RFC. It will be linked from CONTRIBUTING.md in rust-lang/rust as well as from the rustc- and std-dev-guides.

Ethical issues

See this thread.

Moderation guidelines

This PR is preceded by an enormous amount of discussion on Zulip. Almost every conceivable angle has been discussed to death; there have been upwards of 3000 messages, not even counting discussion on GitHub. We initially doubted whether we could reach consensus at all.

Therefore, we ask to bound the scope of this PR specifically to the policy itself. In particular, we mark several topics as out of scope below. We still consider these topics to be important, we simply do not believe this is the right place to discuss them.

So, the following are considered off topic for this PR specifically:

Long-term social or economic impact of LLMs
The environmental impact of LLMs
Anything to do with the copyright status of LLM output. We have gotten initial confirmation from US lawyers that LLM output likely doesn't cause issues under US law. We still need to confirm this with EU lawyers and in other parts of the world.
Moral judgements about people who use LLMs

We have asked the moderation team to help us enforce these rules. For an extended rationale, please see this comment.

Feedback guidelines

We are aware that parts of this policy will make some people very unhappy. As you are reading, we ask you to consider the following.

Can you think of a concrete improvement to the policy that addresses your concern? Consider:
- Whether your change will make the policy harder to moderate
- Whether your change will make it harder to come to a consensus
Does your concern need to be addressed before merging or can it be addressed in a follow-up?
- Keep in mind the cost of not creating a policy.

If your concern is for yourself or for your team

What are the specific parts of your workflow that will be disrupted?
- In particular we are only interested in workflows involving rust-lang/rust. Other repositories are not affected by this policy and are therefore not in scope.
Can you live with the disruption? Is it worth blocking the policy over?

Previous versions of this document were discussed on Zulip, and we have made edits in responses to suggestions there.

Motivation

Many people find LLM-generated code and writing deeply unpleasant to read or review.
Many people find LLMs to be a significant aid to learning and discovery.
rust-lang/rust is currently dealing with a deluge of low-effort "slop" PRs primarily authored by LLMs.
- Having a policy makes these easier to moderate, without having to take every single instance on a case-by-case basis.

This policy is not intended as a debate over whether LLMs are a good or bad idea, nor over the long-term impact of LLMs. It is only intended to set out the future policy of rust-lang/rust itself.

Drawbacks

This bans some valid usages of LLMs. We intentionally err on the side of banning too much rather than too little in order to make the policy easy to understand and moderate.
This intentionally does not address the moral, social, and environmental impacts of LLMs. These topics have been extensively discussed on Zulip without reaching consensus, but this policy is relevant regardless of the outcome of these discussions.
This intentionally does not attempt to set a project-wide policy. We have attempted to come to a consensus for upwards of a month without significant progress. We are cutting our losses so we can have something rather than adhoc moderation decisions.
This intentionally does not apply to subtrees of rust-lang/rust. We don't have the same moderation issues there, so we don't have time pressure to set a policy in the same way.

Rationale and alternatives

We could create a project-wide policy, rather than scoping it to rust-lang/rust. This has the advantage that everyone knows what the policy is everywhere, and that it's easy to make things part of the mono-repo at a later date. It has the disadvantage that we think it is nigh-impossible to get everyone to agree. There are also reasons for teams to have different policies; for example, the standard for correctness is much higher within the compiler than within Clippy.
We could have different standards for people in the Rust project than for new contributors. That would make moderation much easier, and allow us to experiment with additional LLM use. However, it reinforces existing power structures, creates more of a gap between authors and reviewers, and feels "unfriendly" to new contributors.
We could have a more lenient policy that allows "responsible and appropriate" use of LLMs. This raises the question of what "responsible and appropriate" means. The usual suggestion is "self-review, and judging the change by the same standard as any other change"; but this neglects the reputational and social harm of work that "feels" LLM generated. It also makes our moderation policy much harder to understand, and increases the likelihood of re-litigating each moderation decision.
We could have a more strict policy that removes the threshold of originality condition. This has the advantage that our policy becomes easier to moderate and understand. It has the disadvantage that it becomes easy for people to intend to follow the policy, but be put in a position where their only choices are to either discard the PR altogether, rewrite it from scratch, or tell "white lies" about whether an LLM was involved.
We could have a more strict policy that bans LLMs altogether. It seems unlikely we will be able to agree on this, and we believe attempting it will cause many people to leave the project.
We could have no policy at all. This avoids banning valid use cases; avoids implicitly legitimizing the use of LLMs by setting a policy; and saves all of us a great deal of time and effort. However, it greatly increases the baseline level of distrust within the project; makes review assignment even more of a dice roll than it is already; wastes a great deal of contributor and moderator time dealing with LLM-authored PRs on a case-by-case basis; and gives no guidance for people who really do want to use an LLM in a responsible way.

Prior art

This prior art section is taken almost entirely from Jane Lusby's summary of her research, although we have taken the liberty of moving the Rust project's prior art to the top. We thank her for her help.

Rust

Other organizations

These are organized along a spectrum of AI friendliness, where top is least friendly, and bottom is most friendly.

full ban
- postmarketOS - also explicitly bans encouraging others to use AI for solving problems related to postmarketOS - multi point ethics based rational with citations included
- zig
  - philosophical, cites Profession (novella)
  - rooted in concerns around the construction and origins of original thought
- servo
  - more pragmatic, directly lists concerns around ai, fairly concise
- qemu
  - pragmatic, focuses on copyright and licensing concerns
  - explicitly allows AI for exploring api, debugging, and other non generative assistance, other policies do not explicitly ban this or mention it in any way
- forgejo
  - bans AI for review, code, documentation, and communication
  - mentions "legal uncertainties" as a motivating factor
  - explicitly excludes machine translation
allowed with supervision, human is ultimately responsible
- scipy
  - strict attribution policy including name of model
- llvm
- blender
- linux kernel
  - quite concise but otherwise seems the same as many in this category
- mesa
  - framed as a contribution policy not an AI policy, AI is listed as a tool that can be used but emphasizes same requirements that author must understand the code they contribute, seems to leave room for partial understanding from new contributors.
    
    Understand the code you write at least well enough to be able to explain why your changes are beneficial to the project.
- firefox
- ghostty
  - pro-AI but views "bad users" as the source of issues with it and the only reason for what ghostty considers a "strict AI policy"
- fedora
  - clearly inspired and is cited by many of the above, but is definitely framed more pro-ai than the derived policies tend to be
curl
- does not explicitly require humans understand contributions, otherwise policy is similar to above policies
linux foundation
- encourages usage, focuses on legal liability, mentions that tooling exists to help automate managing legal liability, does not mention specific tools
In progress
- NixOS
  - How do we deal with AI-generated issues? NixOS/nixpkgs#410741

Unresolved questions

See the "Moderation guidelines" and "Drawbacks" section for a list of topics that are out of scope.

rustbot · 2026-04-17T08:47:35Z

r? @jieyouxu

rustbot has assigned @jieyouxu.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

Why was this reviewer chosen?

The reviewer was selected based on:

Fallback group: @Mark-Simulacrum, internal-sites
@Mark-Simulacrum, internal-sites expanded to Mark-Simulacrum, Urgau, ehuss, jieyouxu
Random selection from Mark-Simulacrum, Urgau, ehuss, jieyouxu

jyn514 · 2026-04-17T08:47:41Z

@rustbot label T-libs T-compiler T-rustdoc T-bootstrap

## Summary [summary]: #summary This document establishes a policy for how LLMs can be used when contributing to `rust-lang/rust`. Subtrees, submodules, and dependencies from crates.io are not in scope. Other repositories in the `rust-lang` organization are not in scope. This policy is intended to live in [Forge](https://forge.rust-lang.org/) as a living document, not as a dead RFC. It will be linked from `CONTRIBUTING.md` in rust-lang/rust as well as from the rustc- and std-dev-guides. ## Moderation guidelines This PR is preceded by [an enormous amount of discussion on Zulip](https://rust-lang.zulipchat.com/#narrow/channel/588130-project-llm-policy). Almost every conceivable angle has been discussed to death; there have been upwards of 3000 messages, not even counting discussion on GitHub. We initially doubted whether we could reach consensus at all. Therefore, we ask to bound the scope of this PR specifically to the policy itself. In particular, we mark several topics as out of scope below. We still consider these topics to be important, we simply do not believe this is the right place to discuss them. No comment on this PR may mention the following topics: - Long-term social or economic impact of LLMs - The environmental impact of LLMs - Anything to do with the copyright status of LLM output - Moral judgements about people who use LLMs We have asked the moderation team to help us enforce these rules. ## Feedback guidelines We are aware that parts of this policy will make some people very unhappy. As you are reading, we ask you to consider the following. - Can you think of a *concrete* improvement to the policy that addresses your concern? Consider: - Whether your change will make the policy harder to moderate - Whether your change will make it harder to come to a consensus - Does your concern need to be addressed before merging or can it be addressed in a follow-up? - Keep in mind the cost of *not* creating a policy. ### If your concern is for yourself or for your team - What are the *specific* parts of your workflow that will be disrupted? - In particular we are *only* interested in workflows involving `rust-lang/rust`. Other repositories are not affected by this policy and are therefore not in scope. - Can you live with the disruption? Is it worth blocking the policy over? --- Previous versions of this document were discussed on Zulip, and we have made edits in responses to suggestions there. ## Motivation [motivation]: #motivation - Many people find LLM-generated code and writing deeply unpleasant to read or review. - Many people find LLMs to be a significant aid to learning and discovery. - `rust-lang/rust` is currently dealing with a deluge of low-effort "slop" PRs primarily authored by LLMs. - Having *a* policy makes these easier to moderate, without having to take every single instance on a case-by-case basis. This policy is *not* intended as a debate over whether LLMs are a good or bad idea, nor over the long-term impact of LLMs. It is only intended to set out the future policy of `rust-lang/rust` itself. ## Drawbacks [drawbacks]: #drawbacks - This bans some valid usages of LLMs. We intentionally err on the side of banning too much rather than too little in order to make the policy easy to understand and moderate. - This intentionally does not address the moral, social, and environmental impacts of LLMs. These topics have been extensively discussed on Zulip without reaching consensus, but this policy is relevant regardless of the outcome of these discussions. - This intentionally does not attempt to set a project-wide policy. We have attempted to come to a consensus for upwards of a month without significant process. We are cutting our losses so we can have *something* rather than adhoc moderation decisions. - This intentionally does not apply to subtrees of rust-lang/rust. We don't have the same moderation issues there, so we don't have time pressure to set a policy in the same way. ## Rationale and alternatives [rationale-and-alternatives]: #rationale-and-alternatives - We could create a project-wide policy, rather than scoping it to `rust-lang/rust`. This has the advantage that everyone knows what the policy is everywhere, and that it's easy to make things part of the mono-repo at a later date. It has the disadvantage that we think it is nigh-impossible to get everyone to agree. There are also reasons for teams to have different policies; for example, the standard for correctness is much higher within the compiler than within Clippy. - We could have a more strict policy that removes the [threshold of originality](https://fsfe.org/news/2025/news-20250515-01.en.html) condition. This has the advantage that our policy becomes easier to moderate and understand. It has the disadvantage that it becomes easy for people to intend to follow the policy, but be put in a position where their only choices are to either discard the PR altogether, rewrite it from scratch, or tell "white lies" about whether an LLM was involved. - We could have a more strict policy that bans LLMs altogether. It seems unlikely we will be able to agree on this, and we believe attempting it will cause many people to leave the project. ## Prior art [prior-art]: #prior-art This prior art section is taken almost entirely from [Jane Lusby's summary of her research](rust-lang/leadership-council#273 (comment)), although we have taken the liberty of moving the Rust project's prior art to the top. We thank her for her help. ### Rust - [Moderation team's spam policy](https://github.com/rust-lang/moderation-team/blob/main/policies/spam.md/#fully-or-partially-automated-contribs) - [Compiler team's "burdensome PRs" policy](rust-lang/compiler-team#893) ### Other organizations These are organized along a spectrum of AI friendliness, where top is least friendly, and bottom is most friendly. - full ban - [postmarketOS](https://docs.postmarketos.org/policies-and-processes/development/ai-policy.html) - also explicitly bans encouraging others to use AI for solving problems related to postmarketOS - multi point ethics based rational with citations included - [zig](https://ziglang.org/code-of-conduct/) - philosophical, cites [Profession (novella)](https://en.wikipedia.org/wiki/Profession_(novella)) - rooted in concerns around the construction and origins of original thought - [servo](https://book.servo.org/contributing/getting-started.html#ai-contributions) - more pragmatic, directly lists concerns around ai, fairly concise - [qemu](https://www.qemu.org/docs/master/devel/code-provenance.html#use-of-ai-content-generators) - pragmatic, focuses on copyright and licensing concerns - explicitly allows AI for exploring api, debugging, and other non generative assistance, other policies do not explicitly ban this or mention it in any way - allowed with supervision, human is ultimately responsible - [scipy](https://github.com/scipy/scipy/pull/24583/changes) - strict attribution policy including name of model - [llvm](https://llvm.org/docs/AIToolPolicy.html) - [blender](https://devtalk.blender.org/t/ai-contributions-policy/44202) - [linux kernel](https://kernel.org/doc/html/next/process/coding-assistants.html) - quite concise but otherwise seems the same as many in this category - [mesa](https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/docs/submittingpatches.rst) - framed as a contribution policy not an AI policy, AI is listed as a tool that can be used but emphasizes same requirements that author must understand the code they contribute, seems to leave room for partial understanding from new contributors. > Understand the code you write at least well enough to be able to explain why your changes are beneficial to the project. - [forgejo](https://codeberg.org/forgejo/governance/src/branch/main/AIAgreement.md) - bans AI for review, does not explicitly require contributors to understand code generated by ai. One could interpret the "accountability for contribution lies with contributor even if AI is used" line as implying this requirement, though their version seems poorly worded imo. - [firefox](https://firefox-source-docs.mozilla.org/contributing/ai-coding.html) - [ghostty](https://github.com/ghostty-org/ghostty/blob/main/AI_POLICY.md) - pro-AI but views "bad users" as the source of issues with it and the only reason for what ghostty considers a "strict AI policy" - [fedora](https://communityblog.fedoraproject.org/council-policy-proposal-policy-on-ai-assisted-contributions/) - clearly inspired and is cited by many of the above, but is definitely framed more pro-ai than the derived policies tend to be - [curl](https://curl.se/dev/contribute.html#on-ai-use-in-curl) - does not explicitly require humans understand contributions, otherwise policy is similar to above policies - [linux foundation](https://www.linuxfoundation.org/legal/generative-ai) - encourages usage, focuses on legal liability, mentions that tooling exists to help automate managing legal liability, does not mention specific tools - In progress - NixOS - NixOS/nixpkgs#410741 ## Unresolved questions [unresolved-questions]: #unresolved-questions See the "Moderation guidelines" and "Drawbacks" section for a list of topics that are out of scope.

jieyouxu

I really like this version, and thanks a ton for working on it. Specifically:

It doesn't try to dump entire walls of text, which is unfortunately a good way to be sure nobody reads it. Instead, it gives you concrete examples, and a guiding rule-of-thumb for uncovered scenarios, and acknowledges upfront that it surely cannot be exhaustive.
I also like where it points out the nuance and recognizes the uncertainties.
I like that it covers both "producers" and "consumers" (with nuance that reviewers can also technically use LLMs in ways that are frustrating to the PR authors!)

I left a few suggestions / nits, but even without them this is still a very good start IMO.

(Will not leave an explicit approval until we establish wider consensus, which likely will take the form of 4-team joint FCP.)

View changes since this review

ChayimFriedman2 · 2026-04-17T10:16:21Z

The links to Zulip are project-private, FWIW.

jyn514 · 2026-04-17T11:04:48Z

The links to Zulip are project-private, FWIW.

I'm aware. This PR is targeted towards Rust project members moreso than the broad community.

davidtwco

I'm happy with this as an initial policy for the rust-lang/rust repository.

View changes since this review

joshtriplett · 2026-06-23T18:03:34Z

The goal of this policy, as I understood it, was to mitigate the harmful effects of LLMs on the project.

The goal of this policy is also to have an experiment for supporting contribution to Rust using LLMs. The problem is that that experiment (previously) had no experimental parameters, no success or failure criteria, no end, and no motivation to establish further non-experimental policies for such contributions (in any direction, whether to support or restrict any given kind of such contribution). The net result of that would be a permanent "experiment" that isn't actually conducting an experiment.

And adding criteria/bounds/etc to the experiment immediately creates the problem of "when we hit that, if we can't agree on what to do, which policy is in effect?". No option for that default answer is going to satisfy everyone, because no option for that is going to bring everyone to the table to discuss and establish a non-experimental policy. So, any kind of "after X time" or "after X PRs" would not work there.

But at the very least, if more than half of PRs to Rust are using LLMs, it's not an experiment anymore, and a policy setting out to do an experiment is no longer appropriate.

One of the important harmful effects to avoid is driving people who don't use LLMs out of the project, or making them second-class contributors.

tmandry · 2026-06-24T01:55:28Z

+If more than half of PRs merged in a 6-week window are LLM-created, we disallow merging new LLM-created PRs until we go back below 50%, with a minimum cooldown of 10 days.
+This window is chosen to align with our existing release cycle, and the cooldown is to avoid flip-flopping between allowed and disallowed.


A minimum cooldown makes the flip-flopping problem worse. You would get a large backlog of PRs that flood in at the end of the cooldown period, immediately triggering the circuit breaker again. It's the thundering herd problem.

View changes since the review

As I understand it, the intent here is for it to be an experiment, not to have LLM contributions go through the same procedure as normal ones.

It is about accepting a very limited amount of LLM contributions, as an experiment to see what the project would like to do in the future.

At the point where we would get to half of the merged PRs being LLM-generated, this would definitely not be an experiment anymore. This thundering herd problem would be by design, as the whole point is for there not to be many of them.

Hoping I'm not putting words in anyone's mouth, feel free to call me out if my interpretation is not correct.

A minimum cooldown makes the flip-flopping problem worse. You would get a large backlog of PRs that flood in at the end of the cooldown period, immediately triggering the circuit breaker again. It's the thundering herd problem.

Without having hysteresis prevention, we can flip-flop on and off with every couple of PRs, or every bors merge; that would be substantially more chaotic. A longer period is more predictable, and gives us time to have interesting conversations and design parameters for how we want to handle LLM contributions going forward.

A longer period will also give time for other contributions to make their way in, and give time to experience whether things feel qualitatively different during the pause.

This isn't meant to be a "rate limit" that just gratuitously slows down LLM contributions, and a circuit breaker without hysteresis would act more like a rate limit. It's meant to be a guardrail on the experiment: an indicator of whether it is still, in fact, an experiment that we are evaluating, rather than business-as-usual. And it's meant to preserve the other important property: it is important that we continue to support non-LLM contributions.

traviscross · 2026-06-24T05:52:09Z

The goal of this policy is also to have an experiment for supporting contribution to Rust using LLMs. The problem is that that experiment (previously) had no... end, and no motivation to establish further non-experimental policies for such contributions... The net result of that would be a permanent "experiment" that isn't actually conducting an experiment.

In declining to call this policy interim, the author explained:

I stand by this policy. I would be happy for this to be a semi-permanent policy. We can of course edit it, but I consider "interim" to be a forward-looking statement and I don't want to make those in this policy.

That seems applicable here too — like interim, experimental is forward looking. We may not intend for some parts to be permanent, but we have to be prepared for the fact that they might be permanent, or at least semipermanent.

While we might wish otherwise, changing this policy is likely to be hard. Right now, if the provisions on LLM use were a bit more limiting, some would raise blocking concerns. And if they were a bit less limiting, others would raise blocking concerns. Changing the policy later is going to require that nobody raises a blocking concern.¹ Whether such a future compromise is possible on different terms is not something we can know today.

Consequently, when someone raises a concern about harmful effects, as @tmandry is doing,²³ I don't think pointing to intentions is enough. Despite our intentions, these provisions may be with us for a long time.

Similarly:

And adding criteria/bounds/etc to the experiment immediately creates the problem of "when we hit that, if we can't agree on what to do, which policy is in effect?". No option for that default answer is going to satisfy everyone, because no option for that is going to bring everyone to the table to discuss and establish a non-experimental policy. So, any kind of "after X time" or "after X PRs" would not work there. But at the very least, if more than half of PRs to Rust are using LLMs, it's not an experiment anymore, and a policy setting out to do an experiment is no longer appropriate.

This rejects certain thresholds, as no default will bring everyone to the table when the threshold is reached. But the recent change to add a 50% threshold and set a default of halting work at that point creates the same problem described: the default won't necessarily bring everyone to the table later, at the threshold, and so won't satisfy everyone today. And @tmandry is suggesting it's a default we shouldn't be willing to live with, which we might have to if we reach it.

That this was added so late makes this more of a problem, as @tmandry also points out.

The approval threshold was lowered from N-2 to simple majority and then again to an MCP from all participating teams to try to make the policy less hard to change, but it still requires unanimous absence of concerns. ↩
@tmandry's concern is that:

This addition shifts the goal entirely to enforcing a kind of aesthetic limit. Now it says: "If adoption of LLMs organically grows beyond a limit that I am uncomfortable with, I will force it all to stop." That imposes a level of control beyond what I consider reasonable, and sends a strong and clear signal to a class of contributors that they are second-class and not welcome here. I don't see how we can support that.

↩
In noting that we want to avoid "driving people who don't use LLMs out of the project, or making them second-class contributors", the comment I'm addressing also raises potential harmful effects. In this comment, I'm not attempting to weigh those against the ones @tmandry raises. ↩

joshtriplett · 2026-06-24T16:36:33Z

I want to dig into what I think may be a core difference in how people see and interpret this policy and its presentation of an experiment in allowing LLM contributions. I think this difference might be key to and how those be more explicit about a dichotomy here, because I think it's important:

Some people are seeing the thing this policy presents as an experiment as an experiment: something we're doing to see how it goes, evaluate, consider, with experimental parameters and methodology, and the expectation of the experiment having experimental results and an outcome. (I think this is true for people with a variety of positions.)
Some people are seeing the thing this policy presents as an experiment as the enduring policy for how we allow LLM contributions on an ongoing basis.

For people who see the experiment as an experiment, I think the circuit-breaker makes perfect sense as a way to set a boundary on that experiment that isn't an ideal and desirable end state for any stakeholders, thus bringing people to the table to talk about how the experiment is going and what additional steps we should take next.

For people who see the experiment as the enduring policy for LLM contributions, then it makes sense to feel like the circuit-breaker gets in the way of having it be an ideal enduring policy.

But one thing that doesn't work is to have the experiment be a quantum superposition of both of those things, and pass a policy with different people having different interpretations of one of its core pillars, without surfacing those differing interpretations as a problem waiting to happen. If some people see the experiment as an experiment, and some people see it as a thing likely to continue indefinitely, that expectation mismatch is going to blow up in our faces later.

traviscross · 2026-06-24T20:19:23Z

This is not anyone's ideal policy. This is a compromise. The normative provisions in this compromise policy will be the enduring ones until we change them, regardless of what purpose each of us sees in some provisions having been marked as experimental.¹

This limit would not necessarily bring all sides to the table and risks further polarization. If we reach 50% of merged (and therefore approved) PRs being AI assisted, that'd sound like a successful experiment to me. That success would make some less inclined to adopt new restrictions and more inclined to relax existing ones. But for others who (e.g., for valid moral reasons) would prefer the percentage be zero, reaching 50% might well be alarming and (reasonably) make them more inclined to adopt new restrictions and less inclined to relax existing ones.

In lieu of a new compromise, we'd be left with the 50% limit on an ongoing basis. @tmandry is suggesting that's not something we should accept, and I'm suggesting that framing some provisions as experimental does not relieve us of considering the merits of his concern.

Some in favor of continuing to allow AI-assisted contributions had wanted the entire policy, including the restrictions, to be framed as temporary (or experimental) while we engaged in a structured process, under a timeline, for collecting data and working toward a more enduring policy (e.g.). Isolating only the allowance as experimental does not capture this. Though drafted in that way, were the policy to lean on it by time-limiting this allowance, blocking concerns would be filed. This suggests that the ongoing normative effect of these provisions is load bearing for the compromise. I.e., the fact that we cannot time bound these without perturbing the compromise suggests, indeed, something is at stake beyond a pure experiment — and that this residue is important. ↩

clarfonthey · 2026-06-24T21:54:23Z

The goal of this policy is also to have an experiment for supporting contribution to Rust using LLMs. The problem is that that experiment (previously) had no... end, and no motivation to establish further non-experimental policies for such contributions... The net result of that would be a permanent "experiment" that isn't actually conducting an experiment.

In declining to call this policy interim, the author explained:

I stand by this policy. I would be happy for this to be a semi-permanent policy. We can of course edit it, but I consider "interim" to be a forward-looking statement and I don't want to make those in this policy.

That seems applicable here too — like interim, experimental is forward looking. We may not intend for some parts to be permanent, but we have to be prepared for the fact that they might be permanent, or at least semipermanent.

The experimental section explicitly states:

We leave space open to experiment with LLMs to inform future policies. This experiment is meant to inform future non-experimental policy, not to serve as the perpetual LLM usage policy.

And, implicitly, an experiment can always be terminated, but such clause does not exist in the policy and thus people feel conflicted on how to feel about that.

I don't want this discussion to get any more drawn out than it needs to be, and I also have an obvious conflict of interest in this discussion because my desire is for this to be an interim policy which gets replaced by a project-wide policy via an RFC, one option of which I've personally written. So, I don't think that it would be fair for me to really provide any bounds on this discussion because my opinions on the subject are counter to the plan to get this policy in place. (Plus, I don't have any say on the FCP anyway here.)

It is very clear that the experiment exists as a compromise: this policy proposes very strict LLM restrictions on contributions, which is not a universally favoured position. The experiment exists as a compromise to that: providing a dedicated means through which LLM-created changes can still be reviewed and merged.

The issue, of course, is that the compromise appears to be framed as an experiment, and a few details of that are intentionally left open. Particularly, experiments are not run in a vacuum, and experiments have data which is intended to be used in a particular way. There is a fun matrix of possibilities we can explore:

Say the experiment "fails" and no good LLM contributions are collected over a lengthy time frame, explicitly kept ambiguous because each party will have a different time frame in mind. It's fair to say that people who are already predisposed to restrict LLM usage would say this is a reason to conclude the experiment and completely halt all contributions under the experiment. However, there is no clause at all for this, and presumably there would be issues with this. If all the LLM-change reviewers decide to stop doing that, obviously, the experiment stops on its own, but that's a pretty big hypothetical for something that seems to have a decent amount of interest.
Say the experiment "succeeds" and we have several LLM contributions submitted, and let's say there is no stopgap and so people are allowed to continue increasing the percentage of LLM contributions to the project. What do pro-LLM people intend to do in this case? Treat the experiment as canonical, remove experimental wording from the policy, and call that a simple wording fix?
Let's imagine the experiment "fails" in a soft way: there are multiple LLM contributions that the dedicated reviewers have agreed are acceptable, but other people in the project disagree about their efficacy, and people want to add additional guardrails on the experiment or cancel it entirely. What would be an acceptable outcome here? Just tell the people cautious against LLMs that this is the policy, that you're going to refuse to change it, and continue on like this? How much data would change your mind? Would data change your mind?
Let's imagine that the experiment "succeeds" in a soft way: there are multiple high-profile LLM changes that get merged into the project. Let's say that so far, this hasn't caused any "long-term" issues to the project, for whatever definition of "long-term" and "issue" you like. But a lot of people in the project get more and more drained by the LLM reviews, by the kind of code they have to interact with to contribute to the project, and the type of contributions it gets. Many of these people would not want to stay on the project if it continues on this trajectory. What would be an acceptable outcome here? Just tell the people who like LLMs that you're quitting, and they win? Would any amount of data change your mind?

Note that I'm obviously biased toward the anti-LLM side, and I'm not going to pretend that I think that this is an equal-sided debate. I have made my position on this crystal-clear, but I know most people haven't read that, and I do plan to try and "meet in the middle," whatever that means, for motivation.

But, I think that the experimental framing only works if people understand what experiment means in this context, and how the experiment is allowed to succeed or fail. In the current wording, the compromise is that people are going to have to talk about this again in a few months based upon the status of the experiment, because nobody will be happy with the status quo, which is a decision. But I'm not entirely sure that's the right one, and mostly just want to provide my analysis of it and let people figure it out.

joshtriplett · 2026-06-25T03:26:16Z

@clarfonthey And conversely, in the interests of fairness: I can also imagine the experiment producing more nuanced results that need changes in both directions. For instance, more policy requirements in some areas or for some types of contributions, and less requirements in other ways. We may not know precisely what works and doesn't work, and what requirements are net positives vs net negatives, until the experiment has been underway for a while.

My hope is that, based on the results of the experiment, we end up establishing some nuanced and precise additional policy in specific areas where we've found things the project resoundingly thinks are successful (the way that, for instance, the project seems to have a pretty firm consensus on the value of first-party LLM-based security analysis). That's part of the point of the experiment: to give space to discover and explore such things.

clarfonthey · 2026-06-25T16:32:03Z

To be fair, I do think that the most likely outcome of the experiment is a nuanced policy discussion and potential changes to the experiment itself. But I guess that the main concern of everyone involved is what happens if the experiment effectively gets either completely cancelled or canonised as policy, and that nuanced discussion and changes are always possible, just, not really a concern since those are going to require a larger discussion anyway.

tmandry · 2026-06-26T00:06:13Z

The experimental section explicitly states:

We leave space open to experiment with LLMs to inform future policies. This experiment is meant to inform future non-experimental policy, not to serve as the perpetual LLM usage policy.

To clarify the timeline: That wording was added in 356eea0 after the other reviewers had already checked their boxes. It wasn't part of the originally reviewed consensus, so we can't use it retroactively to justify the framing of the rest of the policy.

I fully support gathering data, having a fair and equitable discussion, and updating our policies. But we have to look at the structural reality of our process. In a polarized environment where 30 people can individually block a change, a policy isn't "experimental", it's effectively permanent.

Saying a policy is not meant to serve as the perpetual LLM policy does not change the reality that this policy will be exceptionally difficult to amend, a point we seem to agree on. Without a low-friction way to change the policy, calling it "an experiment only" is a misnomer.

tmandry · 2026-06-26T00:36:25Z

The problem with the circuit breaker is that it creates a risky and unwelcoming environment today for a class of contributors. Let's keep in mind that this includes the employees at ~every major tech company, along with many smaller ones.

It is difficult enough for Rust to sustain the flywheel of contributions and maintenance it needs from these companies. While I think their performance and promotion structures are partly to blame, it is not the full story. LLVM has managed to do it for years.

Like it or not, AI is currently part of the standard tooling developers use day-to-day at these companies. In many cases individuals do not have the choice to stop using it, or doing so would harm their careers. Putting reasonable guard rails around that use, as the policy was doing, is workable. Pointing them off the edge of a cliff – by saying that at some point we will just stop accepting their PRs for ten days at a time – is not.

That creates an unacceptable risk for anyone trying to justify starting a Rust compiler team at a company.

It's not that the companies investing more employee time in Rust will see this problem coming and spend a bunch of resources to fix it. It's that they will stop investing the time in the first place. Nothing requires them to upstream their patches. Many companies today are slow to do this, and AI only makes rebasing easier.

If we make it easier and less risky for companies to work around problems instead of fixing them upstream in the compiler, that is what they will do. As much as I would wish them to do otherwise – as much as I've devoted my career to making that happen – I won't blame them.

lumi-me-not · 2026-06-26T01:35:56Z

Like it or not, AI is currently part of the standard tooling developers use day-to-day at these companies. In many cases individuals do not have the choice to stop using it, or doing so would harm their careers. Putting reasonable guard rails around that use, as the policy was doing, is workable. Pointing them off the edge of a cliff – by saying that at some point we will just stop accepting their PRs for ten days at a time – is not.

Wouldn't this just give those employees more leverage to stop using GenAI? The company wants to contribute to a project that does not allow it, so they have to play by the rules of that project.

I also believe it is dangerous to let corporations dictate what the project should accept or not. The corporations should follow the project, not the project following the corporations.

steffahn · 2026-06-26T03:15:00Z

For some reason, ever since f716356 the line endings of the entire file are all changed to be CRLF. (The large red and green sections in the diff for that commit made me investigate what's going on…)

AFAICT - besides 2 random lines in src/triagebot/zulip-commands.md - this would be the first use of that line ending style in this repo

View changes since the review

clarfonthey · 2026-06-26T03:26:59Z

Like it or not, AI is currently part of the standard tooling developers use day-to-day at these companies. In many cases individuals do not have the choice to stop using it, or doing so would harm their careers. Putting reasonable guard rails around that use, as the policy was doing, is workable. Pointing them off the edge of a cliff – by saying that at some point we will just stop accepting their PRs for ten days at a time – is not.

I don't think that the corporate abuse should dictate Rust's policy. You're not talking about people using these tools because of their merits; you're talking about people using them because they've been coerced into doing so, and I do not think we should encourage such abuse. If a company wants to contribute to the project, they have to follow our rules. Another obvious example here is that the Rust project also continues to require respect for people regardless of backgrounds, and companies in some jurisdictions are legally bound to do the opposite; we do not yield to these cases because we have our own values and rules that don't care about people being coerced into not following them.

Again, I do not want to persuade any particular policy changes here because of my conflict of interest, and that still applies here. But I think that arguments which perpetuate obvious abuse should not be permitted here. Any argument in favour of LLM usage should at minimum be on the merits of the technology; even though I think it's irresponsible to disregard the downsides in that situation, I think that it's better than just "I don't care whether the technology is good; I don't care whether the technology is bad; I am forced to use it against my better judgement."

traviscross · 2026-06-26T04:02:07Z

Wouldn't this just give those employees more leverage to stop using GenAI? The company wants to contribute to a project that does not allow it, so they have to play by the rules of that project.

Companies, by default, often don't see much reason to contribute. People in companies, such as @tmandry, make those contributions happen — at personal cost — by convincing others it's in the enlightened self-interest of the company to do this.

But let's be real. Tech companies will survive just fine without contributing to Rust. And @tmandry, who has done a lot to make contributions to Rust happen, is saying what we're doing here will make his efforts more difficult. Maybe impossible.

That gets my attention.

clarfonthey · 2026-06-26T04:16:23Z

Companies doubling down on their abusive tactics, adding more barriers to contributing, etc. should not be seen as us adding barriers. The ordering of the restrictions, e.g. companies requiring LLM usage before we try and reduce it, does not imply a causal ordering.

We are not making things more difficult for employees; companies are making things more difficult for employees, and we should not yield to companies abusing their employees simply because the abuse happened too fast for us to restrict its effects.

Again, I am explicitly not excluding arguments against LLM usage on the merits, even though I've explained why I disagree with them. But "help, I'm being forced to use this tool against my will" is obviously not a discussion on the merits. It's obvious abuse, even if the employees being abused are fine with it for whatever reason.

Like, in fact, want to explicitly point out how using this as an argument because you're fine with LLMs, as if it's just another tool in the arsenal, would be absolutely disgusting. Abuse is completely off the table as a valid tactic here. I'm going to assume that this is not the case here because, well, I sure hope not, but it should be said at least.

clarfonthey · 2026-06-26T04:44:09Z

The experimental section explicitly states:

We leave space open to experiment with LLMs to inform future policies. This experiment is meant to inform future non-experimental policy, not to serve as the perpetual LLM usage policy.

To clarify the timeline: That wording was added in 356eea0 after the other reviewers had already checked their boxes. It wasn't part of the originally reviewed consensus, so we can't use it retroactively to justify the framing of the rest of the policy.

Also to clarify on this point, since it's unrelated (and so will be in a separate comment): I think that this framing always existed, but just wasn't clarified. I think that all of the arguments put forth in terms of at least redoing the discussion and making sure all the people checking their boxes are aware of the changes is fair. We've talked at length about how the multi-team FCP is kind of unwieldy and the number of people involved is definitely a very legitimate reason to put on the brakes and want to look more in-depth at this change for a while.

That said, I do think it's worth clarifying that an experiment inherently has this kind of framing; "experimental" policy is not regular policy in its own, and the worry is that this framing would be lost in the future. That is a valid concern to potentially frame this as not an experiment if the goal is to keep this as indefinite policy, but at least, that's one of the other points on the table IMHO. I admittedly haven't been following the timeline closely for, again, conflict of interest/not on the FCP list reasons, but I agree with the general point that the experiment's overall everything is a point of contention that more people should have eyes on, at least.

traviscross · 2026-06-26T04:49:15Z

Companies doubling down on their abusive tactics... / ...we should not yield to companies abusing their employees simply because the abuse happened too fast... / It's obvious abuse, even if the employees being abused are fine with it for whatever reason... / ...using this as an argument because you're fine with LLMs, as if it's just another tool in the arsenal, would be absolutely disgusting. Abuse is completely off the table as a valid tactic here...¹

Companies that require employees to use provided AI tooling do so because they believe it makes work more efficient. Since they're paying those people for work, that's a legitimate thing to do. Characterizing this as abusive is unhelpful.

We're free to make our own choices. Nobody is coercing us. And others are free to make their own choices too, including whether to help us. Our choices may affect who helps us. That's a consequence of our actions.

@tmandry is informing us about how what we're doing will make his efforts to attract help for us more difficult.

Edit: The comment to which I'm replying was later deleted; a replacement comment has been posted. ↩

workingjubilee · 2026-06-26T09:48:41Z

@tmandry The perspectives advocated by that position during this entire debate has been a major obstruction to my, and several other contributor's, time and desire to keep working on Rust. Google, et alia, are the ones putting out their hand to halt the flywheel of contributions, while claiming they are responsible for it. Absurd. They are responsible for your paycheck, yes, but what of others? We aren't all on that payroll, not even counting just the ones who work on making sure the features they want to land, correctly.

Corporations like Google get more than just an abstract benefit of "upstreaming code" when they do. They get to offload risk. By taking on the implementation entirely for themselves, they internalize the risk of its correctness. A risk they take on while using tools that are known to have errors and disclaimed to have factual inaccuracies, to boot.

It is madness that we are being told that we must be exceedingly and indefinitely hospitable to things for which the engineering consequences of their use at scale have largely not yet been discovered, in order to perpetually appease the companies trying to sell people on unchecked usage of such tools. Why would doing so actually increase the likelihood those companies will invest in us, if it just threatens the one value proposition we actually have, of giving a damn above and beyond their corporate interests? What is there left for those corporations to invest in if we make ourselves just like them? We have to be actually different.

They will need to change something when they discover the consequences of their experiments. We will need to change something when we discover the consequences of our experiments.

If you want to know how to sell this up your chain, because you can't think of how, then consider paying for a better wordsmith.

phaylon · 2026-06-26T09:52:29Z

Companies that require employees to use provided AI tooling do so because they believe it makes work more efficient. Since they're paying those people for work, that's a legitimate thing to do. Characterizing this as abusive is unhelpful.

Just to add: It will be common for organizations, groups or people that are in general already regular AI users (for whatever reason) to already have significant AI work done at the point where upstreaming issues or fixes are considered. So this is less about being forced to use AI for upstream work, and more about how much additional effort they have to spend after they have arrived at a solution.

In that case the developer would have to convince their employer to spend additional resources (potentially even to fully clean room the solution?) and then also convince the project/team members that it is clean enough. And that's without the additional element of dealing with an issue that potentially has moderation involvement.

jyn514 · 2026-06-26T09:55:12Z

In that case the developer would have to convince their employer to spend additional resources (potentially even to fully clean room the solution?) and then also convince the project/team members that it is clean enough

That is not what the policy says, no. The policy says you may have to wait up to 10 days.

phaylon · 2026-06-26T10:27:41Z

That is not what the policy says, no. The policy says you may have to wait up to 10 days.

Is that so? That depends on if you make it into the window built-up by the non-AI contributions in that 10-day time window, right? So the incentive is already to deploy locally until-we-get-around-to-it. If you don't want that hassle, you'd have to produce a non-AI contribution. (Apologies if I misread anything. It's 29°C inside here.)

I'm not saying everyone will be that unlucky, just reinforcing that the incentive structures at that point don't rely on any kind of extra force or push from the employer towards upstream AI contributions. Because the current disagreement seems to be about where and how the effects of the hurdles come in.

jyn514 · 2026-06-26T10:42:13Z

So the incentive is already to deploy locally until-we-get-around-to-it.

the incentive is always that. there is no way we can make it easier to upstream code than to just run your own fork, other than completely discarding the idea of a review process. we are quibbling about the degree of review required, not about structural incentives.

That depends on if you make it into the window built-up by the non-AI contributions in that 10-day time window, right?

... which is why the point of the window is to have a discussion, not just to blindly follow the rules we made months ago. i know people don't believe me but i really do mean it when i say that i want us to be able to change the policy.

steffahn · 2026-06-26T10:47:05Z

Hey everyone, please make sure to open review threads¹ whenever possible²; any longer back-and-forth on specific discussion points held in the top level thread is increasingly making the discussion very hard to navigate (and also burying existing threaded discussion in the conversation tab).

by creating a review-comment on a specific line, or line-range, or even on the whole file ↩
GitHub doesn't allow us to move anything into threads retroactively! So this is easiest if you start discussing a new thing; it’s second easiest if you reply to something the author may not have intended to become a longer discussion point; it's harder but still valuable to pivot discussions into a dedicated review thread at a later point

e.g. for any additional commentary on the circuit breaker topic now that there's been a new commit might be a good time ↩

traviscross · 2026-06-26T13:44:21Z

 ### Circuit breaker

 To avoid the risk of LLMs "overwhelming" the codebase, or becoming de-facto required, we set a limit on how many LLM-created PRs can be merged.
 If more than half of PRs merged in a 6-week window are LLM-created, we disallow merging new LLM-created PRs until we go back below 50%, with a minimum cooldown of 10 days.
-This window is chosen to align with our existing release cycle, and the cooldown is to avoid flip-flopping between allowed and disallowed.


(This continues the top-level thread at #1040 (comment) through #1040 (comment).)

I know people don't believe me but I really do mean it when I say that I want us to be able to change the policy.

I believe you.

Meaning that you want us to be able to change the policy is different, though, than the forward-looking factual question of how hard it will prove.

The policy says you may have to wait up to 10 days.

If the natural rate of some group of PRs (that we have the review capacity for and would otherwise accept) is 60% and we impose a 50% rate limit, then (absent backpressure) that will produce an unbounded backlog and unbounded latency, not 10 day latency.

It is madness that we are being told that we must be exceedingly and indefinitely hospitable to things for which the engineering consequences of their use at scale have largely not yet been discovered, in order to perpetually appease the companies trying to sell people on unchecked usage of such tools.

Nobody is suggesting that we must be hospitable or appeasing. The choices we make simply affect the help we can expect to receive.

We should act with knowledge of the consequences of our actions. @tmandry, who has spent of himself for years to attract help for us, is offering valuable insight into that when it would be easier for him to stay silent and keep out of this unpleasant discussion.

I'd suggest we'd do well to be appreciative and curious.

View changes since the review

As someone in a similar role as @tmandry, working to attract support to the Rust Project as lead of the Safety-Critical Rust Consoritum and guiding Rust adoption at Toyota Group, I feel the concern @tmandry raised sharply.

Even within safety critical spaces, where there's potential for loss of human life, we're working on how AI tooling can be incorporated in our processes while ensuring quality, verification, and compliance. If, after doing that substantial work, we couldn't use our own validated processes for working on Rust (without incurring arbitrary and unmanagable delays), that would impair my ability to attract support for the Project.

Why are conflicts with the policies of large corporations and contributor alienation that may cause considered an on-topic motivation for policy here, when contributor alienation caused by the usage of LLM-generated code at all due to things in the Official List of Off-Topic Topics in the OP is not?

I don't think @tmandry is wrong, and I do think it is valuable to let us know about this, but I just don't think we should be sacrificing our values as a project to appease corporate interests. Having boundaries is good, and there are some we should not cross, no matter what potential gains we could get.

My longer comment was deleted after a mutual agreement that it was unhelpful. And so, I'm going to include my shorter bit to at least address something important:

Companies that require employees to use provided AI tooling do so because they believe it makes work more efficient. Since they're paying those people for work, that's a legitimate thing to do. Characterizing this as abusive is unhelpful.

Forcing people to do something that does not necessarily make them productive, that sometimes makes them less productive, that sometimes makes them less happy, even in circumstances where other projects have explicitly drawn boundaries around its usage, is abusive. Full stop, not debatable. I have experienced enough abuse in my days to know what it looks like, and this is it. This is not sensationalising; this is correctly using a term to describe something that's happening.

Also, keeping this analogy in because I find it apt: imagine if companies instituted mandatory pomadoro timers and apps upon all employees, because one employee thought they were helpful for productivity. For some people, it would be incredible to have productivity tools supported by their company like this. For other people, it would be literal hell on earth and absolutely make them miserable and worse at their job. We can call this what it is: abuse. There are enough people who've talked about how LLM mandates at their jobs demonstrably made their working life miserable that I don't think it's debatable that calling this abuse is correct.

Again, if individual employees are aided by LLM usage, if it makes them more productive, if it makes them happier, that is a valid argument that I feel should be on the table for allowing its use. That people are forced to use it, is not a valid argument. This is going to be my final message here.

I felt just saying "the circuit breaker is the wrong answer" without giving an alternative wasn't very helpful. So, a suggestion, perhaps the experiment should be limited to a specific section of the rust project, rather than limitted by contribution quantity?

This ensures the experiment doesn't take over the project at large, nor does it become a de-facto rule. It also allows for introspection and comparison to the non-experimental areas of the project (they act as a control group).

This kind of experiment could better inform future policy revision discussions by providing a solid technical basis on which to make future decissions.

I want to mention that I'm staunchly anti-LLM for many reasons of ethics, and wish there was concensus to bar it's use at the Code of Conduct level. But, that concensus is not present, therefore this policy is one based on technical merits. I belive this experimental model gives an opportunity for the technology to prove or disprove it's merits and the extent of it's utility.

My personal belief is the non-experimental part of the policy is already an accurate reflection of the technology's meritted technical uses (i.e, reductive ways from big data [code] to small data [analysis], as opposed to generation from small data [prompts] to big data [code]). It is my belief that the experiment would serve to prove that point.

As this policy (and the experiment) is written, there is not a way to change the breaker nor the current without going through this whole process again and revising the policy with consensus.

"This process" is currently the creation of a completely new policy document with FCP.

Changinging it is the creation of a specific (and well-motivated) change, is inherently smaller in scope, and additionally and goes through MCP not FCP. It is supposed to be an easier process, and if it ends up not working regardless, then it can also be changed by Leadership Council.

Claiming anything else would also put into question the (IMHO very valuable to ever get things done) appeals such as

As you are reading, we ask you to consider […]

Does your concern need to be addressed before merging or can it be addressed in a follow-up?

Keep in mind the cost of not creating a policy.

because for that to make sense we need to ensure that there will be follow-ups.

Anyway, I'm not arguing in favor or against the circuit-breaker rule with this. Though - if anything - the existence of a circuit breaker rule probably gives the text more of a pessimistic spin¹ towards the outlook on being able to change things once we actually know more.²

Footnotes

which I hope won’t be self-perpetuating ↩

and I don’t really love that implication as it can reduce trust ↩

... additionally and goes through MCP not FCP...

Ah, thank you for that explanation. I'm completely outside all of these processes, so I wasn't aware of that.

I was really hoping to be able to stay out of this thread. Oh well, here we go...

While I am very sympathetic to the goals of the circuit breaker, and very skeptical about large-scale LLM use even just on technical grounds (let alone ethical ones), and I find it appalling when company execs force employees to use tools that can cause harm without regard for technical judgment, I don't think a hard threshold for LLM-generated PRs is sufficiently effective or necessary.

First of all, I think the analogy is just not particularly apt: Electrical circuit breakers are simple because we have a very good idea of how much current at a given voltage can be safely sent through a wire. We have no idea (and no consensus!) how many LLM-generated PRs is "too many". Electrical circuit breakers are necessary because in case of a short, someone or something needs to react fast, or else the building is literally on fire. The quality of the code in the compiler is not going to deteriorate on the scale of hours or days, so we have plenty of time to let our normal social processes work their way -- and we have the magic of undo (aka git revert) in case a PR causes acute issues.

In terms of effectiveness: "fraction of PRs merged" is an odd metric. We have very different kinds of PRs in the repo, from small doc fixes to large-scale refactorings to major new features, and we have PRs touching various different parts of the compiler. Throwing them all in one bin doesn't result in a very meaningful measurement IMO. If 100% of the large-scale new features are LLM-generated, I am fairly sure that would still come out to way less than 50% of the PRs. It also creates odd incentives, like quickly getting a bunch of small non-LLM PRs merged to drag down the average. It may be worth watching the "fraction of LLM-generated PRs metric", but the moment we make that metric a basis for automatic decisions, it ceases being a good metric.

In terms of necessity: what I personally would do if I get the impression that the codebase is turning to the worse due to too much LLM-generated code, is to start a discussion with my t-compiler colleagues (in particular the ones that were involved in these PRs, which as per this policy always need to be involved from the planning stage). I trust that we can have a useful technical discussion about whatever problems have come up with LLM-generated PRs. I trust that if a notable fraction of the compiler team echos a shared concern, the rest of the team won't just disregard that and keep landing the same kinds of changes at the same pace. (And it's not like the circuit breaker alleviates the need for trust, see the points about gaming the metric above.)

tl;dr: I can't really think of any problem that this circuit breaker would actually solve, even if it ever triggered, which I also doubt will happen under this policy.

We have no idea (and no consensus!) how many LLM-generated PRs is "too many".

The exact quantitative level isn't what makes it a circuit breaker; if anything, "50%" is just one of those numbers that feels fair but isn't intrinsically perfect here.

The more important detail is "it's something more than 0 and less than 100". If we agree that it's important to continue being welcoming to people who don't want to use LLMs, and if we agree that an experiment should actually be an experiment that's trying to evaluate this, then somewhere between 0% and 100% is a threshold for "this isn't an experiment anymore, it's just the thing we're doing".

The circuit breaker makes the experiment actually an experiment, which has some bound on it where we say "okay, this isn't an experiment anymore, so what should we do now?". And, importantly, it avoids pre-determining either of the two most obvious answers to "what should we do now?" that would be maximally satisfying to one group and maximally unsatisfying to another, instead picking something that leads people to want to discuss and find the right path forward.

There may be other ways to accomplish that. It's not obvious what would accomplish the two goals of making sure the experiment is an experiment and making sure the parameters don't default into becoming either a perma-experiment or a perma-ban.

traviscross · 2026-06-26T13:53:14Z

(The top-level thread on the subject raised in #1040 (comment) now continues at #1040 (comment).)

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 17, 2026

rustbot assigned jieyouxu Apr 17, 2026

rustbot added T-bootstrap Team: Bootstrap T-compiler Team: Compiler T-libs Team: Library / libs T-rustdoc Team: rustdoc labels Apr 17, 2026

Turbo87 reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

This was referenced Apr 17, 2026

[blocked] Link to proposed LLM policy rust-lang/rustc-dev-guide#2835

Draft

[blocked] Link to the proposed LLM policy rust-lang/std-dev-guide#75

Draft

jyn514 force-pushed the llm-policy branch from 1c6c4ed to 772edeb Compare April 17, 2026 09:23

jyn514 mentioned this pull request Apr 17, 2026

[blocked] Link to proposed LLM policy in CONTRIBUTING and pull request template rust-lang/rust#155424

Draft

jieyouxu reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

Comment thread src/policies/llm-usage.md Outdated

Comment thread src/policies/llm-usage.md Outdated

Comment thread src/policies/llm-usage.md Outdated

oli-obk reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

jieyouxu reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

address some of jieyouxu's comments

815da6e

davidtwco approved these changes Apr 17, 2026

View reviewed changes

clarfonthey reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

clarfonthey reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

clarfonthey reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

clarfonthey reviewed Apr 17, 2026

View reviewed changes

Comment thread src/how-to-start-contributing.md Outdated

clarfonthey reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md

revert extraneous change

17a35f4

alice-i-cecile reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

alice-i-cecile reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

alice-i-cecile reviewed Apr 17, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

meetings are out of scope for this policy

ab6f5c6

joshtriplett reviewed Jun 23, 2026

View reviewed changes

Comment thread src/policies/llm-usage.md Outdated

add back 'policies and procedures' clause

1b49326

tmandry reviewed Jun 24, 2026

View reviewed changes

steffahn reviewed Jun 26, 2026

View reviewed changes

explain why i picked 10 days, that it's not just random

a4d63ad

traviscross reviewed Jun 26, 2026

View reviewed changes

rust-lang deleted a comment from clarfonthey Jun 26, 2026

		If more than half of PRs merged in a 6-week window are LLM-created, we disallow merging new LLM-created PRs until we go back below 50%, with a minimum cooldown of 10 days.
		This window is chosen to align with our existing release cycle, and the cooldown is to avoid flip-flopping between allowed and disallowed.

Uh oh!

Conversation

jyn514 commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Ethical issues

Moderation guidelines

Feedback guidelines

If your concern is for yourself or for your team

Motivation

Drawbacks

Rationale and alternatives

Prior art

Rust

Other organizations

Unresolved questions

Uh oh!

rustbot commented Apr 17, 2026

Uh oh!

jyn514 commented Apr 17, 2026

Uh oh!

Uh oh!

jieyouxu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ChayimFriedman2 commented Apr 17, 2026

Uh oh!

jyn514 commented Apr 17, 2026

Uh oh!

davidtwco left a comment • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joshtriplett commented Jun 23, 2026

Uh oh!

tmandry Jun 24, 2026 • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lumi-me-not Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

joshtriplett Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

traviscross commented Jun 24, 2026

Footnotes

Uh oh!

joshtriplett commented Jun 24, 2026

Uh oh!

traviscross commented Jun 24, 2026

Footnotes

Uh oh!

clarfonthey commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joshtriplett commented Jun 25, 2026

Uh oh!

clarfonthey commented Jun 25, 2026

Uh oh!

tmandry commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

jyn514 commented Apr 17, 2026 •

edited

Loading

jieyouxu left a comment •

edited

Loading

davidtwco left a comment •

edited by rustbot

Loading

tmandry Jun 24, 2026 •

edited by rustbot

Loading

lumi-me-not Jun 24, 2026 •

edited

Loading

joshtriplett Jun 24, 2026 •

edited

Loading

clarfonthey commented Jun 24, 2026 •

edited

Loading

tmandry commented Jun 26, 2026 •

edited

Loading

steffahn Jun 26, 2026 •

edited by rustbot

Loading

clarfonthey commented Jun 26, 2026 •

edited

Loading

traviscross commented Jun 26, 2026 •

edited

Loading

phaylon commented Jun 26, 2026 •

edited

Loading

steffahn commented Jun 26, 2026 •

edited

Loading

traviscross Jun 26, 2026 •

edited by rustbot

Loading

PLeVasseur Jun 26, 2026 •

edited

Loading

clarfonthey Jun 26, 2026 •

edited

Loading

6TELOIV Jun 26, 2026 •

edited

Loading

steffahn Jun 26, 2026 •

edited

Loading