AI commit messages are low quality

Commit messages should focus on explaining _why_ a change was made, and _what_ the outcome is, and not really worry about the details of _how_ it was implemented. (Much has [been written](https://cbea.ms/git-commit/#why-not-how) [about this](https://dhwthompson.com/2019/my-favourite-git-commit).) AI generated messages don't do this very well:

- They never seem to explain the _why_ behind the change. (Not a surprise, this information is usually only present in the mind of the committer!)
- At the same time, they often include pointless detail about _how_ the change was made.
- The writing style is often "fluffy" i.e. low signal to noise ratio.
- I also found one case where the message appears to have been generated off the system prompt and not the actual commit contents???? https://github.com/gitbutlerapp/gitbutler/commit/770b0c4e9d6447903c166dbeac310b406f9fe255

To demonstrate this I scrolled through this repo's commit logs looking for seemingly AI generated messages that demonstrate this problem

https://github.com/gitbutlerapp/gitbutler/commit/e0b93f22ecb5757162c5604fab44016b64dd4fd4
> feat: handle null newCommit in commit creation process
> 
> Checks for a null newCommit value and throws an error if the commit 
> creation fails. Updates the commit outcome type to allow null values 
> for newCommit. Ensures the UI state updates only when a valid commit 
> ID is present, improving error handling and user feedback during 
> commit operations.
- missing any information about why this was happening and worth fixing
- yet several sentences of detail on the details of the code change
- "improving error handling and user feedback during commit operations" low SNR. feels like an attempt to explain why, except that information was just not in the context window so the model made something up.

https://github.com/gitbutlerapp/gitbutler/commit/6c0351239f1b8edfb56d360b3481f56b22c6ea94
> feat(state): add mapReduxResult to transform query result data
> 
> Introduce mapReduxResult helper to transform the `data` property of a 
> successful Redux query result using a provided function. This enables 
> safe and reusable data transformations while preserving the original 
> query result structure and status.
- again, no explanation of why
- "This enables safe and reusable data transformations while preserving the original query result structure and status." no useful information here. delete this sentence

https://github.com/gitbutlerapp/gitbutler/commit/c49525cf68f67933f52142eb623c0775a2a6f7a4
> feat: refactor upstream status fetching and stack retrieval
>
> Add fetchStacks method to stackService for direct stack fetching with
> transformation. Refactor upstreamStatuses in upstreamIntegrationService
> to be async and fetch data directly instead of using reactive queries.
> Update IntegrateUpstreamModal to handle async upstream status fetching
> and remove reactive derived state. These changes improve data fetching
> clarity and simplify status handling logic.
- same complaint - too detailed and also not detailed enough
- "These changes improve data fetching clarity and simplify status handling logic." again, hallucinated _why_

---

I would conclude that the idea of fully automatic commit message generation is just not a good one. Fundamentally, a large amount of the information that should go in commit messages (why? what is the outcome? why not some other approach?) is just not in the final diff. Half the point of writing a commit message is to create a record of what the author was thinking, and that's just not possible to automate. 

That said, many of the human written commit messages are also not very good! Reading through the messages in this repo made me realize there's an opportunity to have the LLM and developer work together to create real improvement over just human-written messages. Ask the LLM to check whether the commit message seems to be missing useful information, and if so generate a question for the developer to clarify the context. Then after maybe a few of these, take all the answers and generate a message based on that. (In my own uses of LLMs, I've found their real value to be as motivation, https://xkcd.com/386/ style :P)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AI commit messages are low quality #8755

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

AI commit messages are low quality #8755

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions