GitHub: General, Ansibullbot & Scaling issues #357

gundalow · 2018-09-20T10:54:02Z

I should be able to make contact with people in the (platform /API developer experience) team(s) at GitHub. Need to think what to ask them

Issues with scaling Ansibullbot
- Was chatting internally about this. One of the things we've been told before is to move to hook/event based bots. Though that doesn't work for us
- 1. hooks are not reliable
- 1. collector has to be able to process them quick enough
- 1. Ansibullbot takes a while to effectively pull in all the relevant data to calculate state for a given issue/PR
- 1. There isn't a hook sent when (say) a Shippable-ci Job is restarted - we have to poll
- 1. all of our "stale" processes don't work either in a purely hook based flow
Do they have any other advice for dealing with large backlogs
Comments on rebasing, resolving conflicts - Can't use UI as it adds $something
Let them know the issues we have to work around
What have similarly sized groups have done
Can we set branch protections, user permissions at the GH Org level, rather than per repo?

This is just a place so we can collect ideas

mscherer · 2018-09-20T15:22:16Z

So, on the hook stuff, I think there is some workload that could be moved there. For example, the one that remind people we have a SIG, it doesn't change much if we lose it. I also wonder if we can get a hybrid approach, eg have a hook based workflow, and then run the bot less often or something like this ?

mattclay · 2018-09-20T16:50:07Z

To make webhook handling more reliable, we could use a Lambda function to receive the webhooks and drop them into an SQS queue. The bot could consume the events from that SQS queue instead of needing to receive the directly.

gundalow · 2018-09-20T19:09:52Z

From Tanner

Rate limiting
Real fields or components - selection box/drop down rather than free field

webknjaz · 2018-09-20T21:43:18Z

Ref https://platform.github.community/c/integrations

webknjaz · 2018-09-20T21:46:08Z

Though that doesn't work for us

python org bots fix this with adding delays after receiving hooks and then do some API queries

webknjaz · 2018-09-20T21:47:07Z

Also, GitHub Apps integrations have a UI for checking on all the events they've sent to the integration along with a response from the service.

sivel · 2018-09-20T21:53:49Z

Though that doesn't work for us
python org bots fix this with adding delays after receiving hooks and then do some API queries

IIRC the problems were that even receiving the hook was unreliable, and we couldn't guarantee we would ever receive it

webknjaz · 2018-09-20T22:33:20Z

Looks like an infra problem

webknjaz · 2018-09-21T19:05:36Z

@gundalow tell gh that we're also unhappy with "GitHub Releases" feature not allowing to hide links to autogenerated git-archives sources, when overriding that with our proper assets.

jctanner · 2019-01-10T14:38:15Z

Looks like an infra problem

Sometimes. It's also due to not every event triggering a webhook, such as shippable job re-runs or job-completes.

gundalow · 2019-01-10T14:40:31Z

Assign to/request reviews from people that don't have commit
Notifying over 50 people added aix_nimclient module ansible#50760 (comment)

I have a meeting with one of the Product Managers at GitHub early next week, so if anyone has other comments please do add them.

pabelanger · 2019-01-10T14:46:19Z

Allow a contributor to set approved for self code reviews, this is impossible today and forces the creation of a 2nd account to set approved.

gundalow · 2019-01-10T14:48:16Z

Forms rather than freeform text for issue and PR creation to replace ISSUE TEMPLATES or PR TEMPLATES

GregSutcliffe · 2019-01-10T14:55:31Z

There's some kind of issue with the v3 API when requesting a large amount of PR information - I was doing some indexing of data, and requesting anything larger that "all PRs from May1st 2018 to now" resulted in errors. Details here and seems to affect other large repos (such as Kubernetes) too.

pilou- · 2019-01-10T14:59:56Z

Allow people who don't have write access to ansible/ansible repository (GitHub contributors/maintainers in BOTMETA) to edit PR.

Reset all GitHub approvals when a PR is updated.

webknjaz · 2019-01-10T15:05:20Z

@pilou-

Editing PRs is probably too risky and it's ACL control after all. I don't think it's possible.

Approvals reset can be configured on protected branches in GH settings.

webknjaz · 2019-01-10T15:06:11Z

@GregSutcliffe anyway I think it's better to requests chunks rather than whole db...

dagwieers · 2019-01-10T15:11:32Z

~~How about an API to hide comments so we can make ansibot hide outdated or resolved comments. That would solve for me the biggest annoyance of ansibot (not cleaning up after himself).~~

~~As a result I am hiding irrelevant stuff in issues and PRs just to get a higher signal-to-noise and avoid the wall of text in some of the PRs.~~

This is now live:
You can now hide comments via GitHub's API:

ReportedContentClassifiers = OUTDATED https://developer.github.com/v4/enum/reportedcontentclassifiers/

webknjaz · 2019-01-10T15:12:59Z

~~+1 I also do that, it's sad that there's no API endpoint for hiding comments~~

This is now live:
You can now hide comments via GitHub's API:

ReportedContentClassifiers = OUTDATED https://developer.github.com/v4/enum/reportedcontentclassifiers/

pilou- · 2019-01-10T15:14:05Z

Editing PRs is probably too risky and it's ACL control after all. I don't think it's possible.

Then it becomes "add ACL control".

Approvals reset can be configured on protected branches in GH settings.

It would be useful for unprotected branches too.

dagwieers · 2019-01-10T15:16:48Z

Reset all GitHub approvals when a PR is updated.

I do not agree with this, there are many situations where an update to a PR should not reset approvals. An important one is a (required) rebase, but also smaller changes should not affect a prior approval. This really depends on the interpretation of an approval as well as the changes being made.

We would be shooting ourselves in the foot if we would reset approvals IMO. And it definitely does not improve Contributor Experience.

dagwieers · 2019-01-10T15:19:17Z

Allow quick Github edits by people with commit rights. Currently it either means we edit the branch directly, or create a branch in the official project. I want to have the same option as users with read-access, create a branch in my personal space.

dagwieers · 2019-01-10T15:22:04Z

Add more Github reactions, like one to indicate the commenter should have used Github reactions instead of adding a comment to approve a previous comment :-P

Maybe also a Github reaction to indicate a commenter should have read the fine documentation ?

Or a Github reaction to indicate you do not think it is a priority (rather than +1 and -1).

webknjaz · 2019-01-10T15:27:18Z

Regarding new Checks API thing. It doesn't affect us a lot but it will.
Checks listed in the bottom of PRs now add an extra click: they redirect one to the Check Suite details page however they often care about some of Check Runs failing.
This could be solved with a hovercard (like ones when you hover users' avatars) listing all check runs and maybe having a button next to each of them redirecting directly to that third-party service.

OTOH... It might be Travis CI misusing Checks API and dumping all jobs info to one page...

dagwieers · 2019-01-10T15:30:45Z

Automatically link users, issues and PRs in the Wiki (like how they work in comments).
Now I need to create individual links which is a drag.

alikins · 2019-01-10T16:49:20Z

One thing I'd like is a sidebar box with a list of other issues and pr's linked to the current issue/pr. Ideally 'above the fold' on the page. For an example, ansible/ansible#31751 is referenced by 3 other issues, and reference another issue and a project. But you have to read through all the comments to find it. Those references are easy to miss since they unconsciously get lumped in with the usually noisy issue event entries.

Hopefully that would help cut down on duplicate issues. And make the issue cross references more valuable.

And highlighting pull requests that reference an issue are more highlighted, it might increase the folks looking at or trying a pull request which can help it get resolved quicker.

A bonus step would be a page that summarizes the clumps of issues/prs that reference each other.
Maybe the N largest clumps/subgraphs that likely point to high value systemic flaws to address.

bcoca · 2019-01-10T16:49:34Z

autolock issues older than X: we plan on adding this to bot since those get ignored anwyays, but it would prevent users from uselessly posting and feeling frustrated due to lack of answers on old closed tickets
fine grained commit access (subdirs): we do this with bot/shipit/botmeta .. but it would be much nicer as a GH feature
real forms (3rd mention, i know, but dropdowns/checkboxes would help soooo much).
a way to move comment into it's own ticket: cause no one ever posts unrelated issues on a ticket ...

alikins · 2019-01-10T17:17:12Z

I would like to see a version of the 'Commits' log view that includes the full commit message. Both for the main repo view (https://github.com/ansible/ansible/commits/devel for example), and for the commits list in a PR. Even better would be a view with a 'git log --stat' equivalent.

Would make it easier for users to scan the commit history when looking for related changes.

For someone reviewing a PR it would help surface info (the full commit messages) which is tedious to view until you go to squash and merge and see them concatenated together and have to manipulate them into a coherent single commit message.

Related... Show more of first line of the commit message, even if it is longer than 50 char even if it has to be line wrapped. Up to some reasonable limit (500 chars? 1000) at least. The current display for the first/summary line often reads like a mystery with the last page ripped out. Any thing to encourage better and more detailed commit messages would be useful to project users and developers.

alikins · 2019-01-10T17:37:13Z

Support finding pull requests that affect particular files. Possibly as a pr search field.
A file name oriented list/view of prs (something like https://ansible.sivel.net/pr/byfile.html ) is very
useful for project developers and users. Particularly for something like ansible where a given person
may only be interested in a particular module or plugin.

For pull requests, that data should exist.

It would be much much more complicated to be able to search issues by file though it would be kind of amazing. It probably makes sense for the issue->files info to be determined by something tailored to the project via some integration point.

Ansible does issue->file mapping to some degree with ansibot (ansible/ansible#50509 (comment) for ex).

But it is not easy for users to make use of that data. An example use for ansible is a user who is seeing problems with the 'at' module. The options for finding existing issue reports in that case are very slim. An issue search for 'at module' finds 1209 issues, mostly spurious. A search for 'lib/ansible/modules/system/at.py' or 'at.py' finds two relevant open issues.

dagwieers · 2019-01-10T18:15:16Z

@pilou- We can't use the Github review system anyway, because Github does not have the same workflows or knowledge as ansibot does. Github does not know who a maintainer is, or how many shipits work for a given PR. Obviously being able to do everything we do in ansibot in Github could be useful, but singling out a specific feature will not help (instead it could make it worse if it lacks the holistic view)

pilou- · 2019-01-10T18:48:22Z

@dagwieers Being able to use GitHub ~~review system~~ approval status would make Ansibot simpler.

felixfontein · 2019-01-15T20:29:04Z

In the diff view of a PR, longer sections between two diff positions are collapsed away. When clicking the expand button, lines from below the expand button are added. If one is interested in the lines below the diff above the expand button, one has to click the expand button very many times (depending on the distance to the next diff). That's very annoying.

Also, sometimes one wants to get rid of all collapsed positions to see the diff of the complete file, for example to see how a PR interacts with the total file. At the moment, this involves a lot of manual clicking. A button "expand all" per file (or even per PR) would improve this situation alot!

webknjaz · 2019-01-17T19:53:07Z

This question needs to be addressed for reliability improvements in bots:

Long story short, they provide API to query events but there's no way to match them with events received via webhooks because they use different kinds of identifiers, so I can see the same events on the UI but there's no way to connect them 😞

I think that the solution is extremely simple on their side: just add another field to data in API or to headers in their webhook events.

dagwieers · 2019-03-28T13:03:27Z

It would be nice if you can easily go from a PR to the author's branch. Currently that branch is listed at the top as:

dagwieers wants to merge 2 commits into ansible:devel from dagwieers:file-locking

But it would be more convenient if you could click on dagwieers:file-locking to go to that branch (i.e. for testing/downloading that branch).

webknjaz · 2019-03-31T13:17:41Z

@dagwieers it's actually what GitHub has implemented a few days ago 👍

webknjaz · 2019-04-02T07:23:08Z

@gundalow can we ask GitHub to allow buttons in Checks API to be available to everyone so that access control would be possible to do in the click handlers on GitHub App side.

webknjaz · 2019-04-02T08:03:57Z

@gundalow I've created an FR post on the community forum for this:https://github.community/t5/GitHub-API-Development-and/FR-Make-possible-to-show-action-buttons-posted-via-Checks-API-to/m-p/21423/highlight/true#M1342

webknjaz · 2019-04-12T13:13:47Z

@gundalow Another possible improvement: https://github.community/t5/GitHub-API-Development-and/FR-Enable-deployment-and-check-run-event-propagation-within/m-p/22068/highlight/true#M1450

gundalow added the contributor_experience https://github.com/ansible/community/wiki/Contributor%20Experience label Sep 20, 2018

gundalow assigned gundalow and mkrizek Sep 20, 2018

gundalow added this to Needs prioritizing in Community Sep 21, 2018

gundalow changed the title ~~GitHub Contact Ansibullbot & Scaling issues~~ GitHub: General, Ansibullbot & Scaling issues Sep 21, 2018

gundalow added this to Medium in Contributor Experience Sep 25, 2018

gundalow moved this from Medium to To Triage in Contributor Experience Sep 26, 2018

This comment has been minimized.

Sign in to view

gundalow mentioned this issue Jan 10, 2019

Contributor Experience Agenda #390

Closed

mkrizek removed their assignment Apr 12, 2019

samccann closed this as completed Nov 10, 2023

GitHub: General, Ansibullbot & Scaling issues #357

GitHub: General, Ansibullbot & Scaling issues #357

Comments

gundalow commented Sep 20, 2018 • edited

mscherer commented Sep 20, 2018

mattclay commented Sep 20, 2018

gundalow commented Sep 20, 2018

webknjaz commented Sep 20, 2018

webknjaz commented Sep 20, 2018

webknjaz commented Sep 20, 2018

sivel commented Sep 20, 2018

webknjaz commented Sep 20, 2018

webknjaz commented Sep 21, 2018

jctanner commented Jan 10, 2019

gundalow commented Jan 10, 2019

pabelanger commented Jan 10, 2019 • edited

gundalow commented Jan 10, 2019

This comment has been minimized.

GregSutcliffe commented Jan 10, 2019

pilou- commented Jan 10, 2019

webknjaz commented Jan 10, 2019

webknjaz commented Jan 10, 2019

dagwieers commented Jan 10, 2019 • edited by gundalow

webknjaz commented Jan 10, 2019 • edited by gundalow

pilou- commented Jan 10, 2019

dagwieers commented Jan 10, 2019

dagwieers commented Jan 10, 2019

This comment has been minimized.

dagwieers commented Jan 10, 2019 • edited

webknjaz commented Jan 10, 2019 • edited

dagwieers commented Jan 10, 2019 • edited

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

alikins commented Jan 10, 2019

bcoca commented Jan 10, 2019

alikins commented Jan 10, 2019

alikins commented Jan 10, 2019

dagwieers commented Jan 10, 2019

pilou- commented Jan 10, 2019

felixfontein commented Jan 15, 2019

webknjaz commented Jan 17, 2019

dagwieers commented Mar 28, 2019 • edited

webknjaz commented Mar 31, 2019

webknjaz commented Apr 2, 2019

webknjaz commented Apr 2, 2019

webknjaz commented Apr 12, 2019

gundalow commented Sep 20, 2018 •

edited

pabelanger commented Jan 10, 2019 •

edited

dagwieers commented Jan 10, 2019 •

edited by gundalow

webknjaz commented Jan 10, 2019 •

edited by gundalow

dagwieers commented Jan 10, 2019 •

edited

webknjaz commented Jan 10, 2019 •

edited

dagwieers commented Jan 10, 2019 •

edited

dagwieers commented Mar 28, 2019 •

edited