Refactor to improve synchronization and testability #45

bryanburgers · 2019-06-07T15:12:32Z

A start to the task of refactoring to improve synchronization and add more testing to Homu. More information about the target end goal can be found at the Proof-of-Concept PR #44

Test command parsing logic (Test the command parsing logic #32)
Test authorization code
Pull PullReqState into its own file
Use Timeline events to perform the initial synchronization
Add tests for timeline event synchronization
Track state for multiple tries
Migrate webhook handling code to use same API as initial synchronization

bryanburgers · 2019-06-21T11:55:25Z

Making progress! I now have Homu synchronizing its initial state from GitHub Timeline Events in a pretty accurate way. A few differences yet where I need to track down what the actual expected state should be.

Current Homu:

Local Homu with these changes:

Anybody can pull this branch and try it locally. Some notes about that:

It only does the synchronization. Because it won't be set up for webhooks, it will not keep in sync at this point.
You can use any access token from any user. I'd suggest not using an access token for a user that has access to rust-lang/rust, just in case there's a bug somewhere that would affect the real functioning of bors.

Pull code that creates a comment on a pull request out of the authorization check, and add tests around the authorization checks. After pulling out the code that creates a comment, we still need to know what the text of the comment will be, so change from a function that returns a True on success and a False on failure to a function that returns a True on success and raises an Exception (with the failure comment) on failure.

Remove the free function `db_query` in favor of a `LockingDatabase` class that wraps a database connection. This is done so that, in order for a class to use the database, it only needs its `db` instance, instead of needing both a `db` instance and a `db_query` function. This allows us to break out classes into separate files.

Extract PullReqState into its own file for code readability. Because PullReqState shares some constants with main and server, extract those constants into a `consts.py` file as well.

For some critical comments, Homu adds extra information about its state in the form of a JSON blob to the comment that isn't visible to the user but is visible in the source for the comment. For example, Homu may leave a comment like the following, where the JSON blob is not visible because of the `` markdown/html comments. ⌛ Trying commit abcdef with merge 012345...  This change parses this extra information out of the comments and makes it available to the initial synchronization algorithm.

Create the general structure for `process_event` and its testing, and get a long way toward testing approval comments.

Break up the "status" field into multiple orthogonal state fields: * build state (whether the primary is running or has succeeded or failed) * try state (whether the most recent try is running or has succeeded or failed) * approval state (whether the pull request is approved) Previously (and still) these were mostly possible to determine by looking at `state.get_status()` and `state.try_`, but storing them separately helps make state changes more explicit. Also, keep track of the current github synchronization cursor in the pull request state, so that we can use it later.

Test that issuing a `@bors retry` command moves the state from 'pending' to '' for pending pull requests. This is frequently used as a way to yield the current build to a different pull request.

Homu creates a message when a try or build timeout occurs. Handle this to keep the state properly updated.

bryanburgers · 2019-07-10T12:51:56Z

Status update

I've been running a "follower" against the rust-lang/rust repo using this new sync method for a couple of days now. Every minute or so, it grabs all of the PRs from the API and resyncs them from the previous sync point to current.

https://homu.burgers.io/queue/rust

Takeaways:

I'm frequently seeing it stay in sync! Which means that this method is definitely working as expected.
This has actually been a very effective way to find edge cases that I wasn't aware of. Whenever the follower doesn't match the current version (https://buildbot2.rust-lang.org/homu/queue/rust), I figure out why and adjust the synchronization code.
- Unfortunately, that won't work so well when I need to start reacting to changes and issuing comments, at which point I'll need to drop back down to test repos
current version often doesn't have ALL the open PRs. It appears that on initial sync, it might only synchronize the newest 100 open pull requests (?)
- This doesn't seem to matter much in practice. Old PRs end up showing up after they get updated anyway.
I often see differences in "Mergeable" and "Assignee" columns. It looks like current version doesn't always stay in sync for those fields
Rate limit: GitHub v4 has a 5000 "cost" rate limit. Despite pulling changes every minute, I've only ever used 398 of those (still have 4602 remaining) before the rate limit resets. So this method doesn't appear like it will run into issues running up against the rate limit.

Mark-Simulacrum · 2019-07-10T12:53:30Z

I think GitHub returns at most 100 objects from any query, so we're probably not asking for the next page on some query.

bryanburgers · 2019-07-10T13:08:29Z

@Mark-Simulacrum

homu/homu/main.py

Line 1495 in abd0083

for pull in repo.iter_pulls(state='open'):

Right. https://github3py.readthedocs.io/en/stable-0.9/repos.html#github3.repos.repo.Repository.iter_pulls suggests that it returns all available pull requests, but it seems likely we're just getting the first page of results.

But at this point, I don't believe it's worth investigating.

Mark-Simulacrum · 2019-07-10T13:21:14Z

Somewhat agreed, though I think we'd want to investigate before going forward -- missing PRs in the queue are annoying because you can't easily tell if we have all of them by comparing the number, you have to check one by one.

bryanburgers · 2019-07-24T10:59:41Z

Status update: I've been incredibly busy the last couple of weeks with other things and haven't been able to get back to working on this.

From what I see, this change is not something I can introduce in pieces, so it will take time to get it to a production state.

I hope to be able to return to this mid-August.

Add more of the state to the Repository class, and make each PullReqState reference it's Repository and get information from there.

Include a history of all of the tries and all of the builds when parsing the history of a pull request.

bryanburgers · 2019-08-16T20:28:14Z

Slowly but surely trying to keep working on this.

I now integrated a history of tries and builds, based on the GitHub history and bors' past comments, so we can have something like the following image (but with more UI work).

https://homu.burgers.io/results/rust/58281

I'll keep running https://homu.burgers.io/queue/rust while I work on this. From what I can tell checking in every once in a while, it's a pretty accurate representation of the state of the world. And is almost always more accurate with respect to mergeability and assignees.

Unfortunately, it's getting long and is going to be a huge pain to review, and a huge risk to switch over.

Previously, we discretely set `status`, `try`, `build_state`, and `try_state` on each event. With the addition of the run histories, we can now glean all of this information from those histories instead of tracking them independently. The results appear to all be identical on the current Homu queue except for one edge case: a pull request was tried, approved, then unapproved. - Using the previous method, the status would be '', because approval changes the status from 'success (try)' to '', and unapproval doesn't change the status. - Using the current method, the status would be 'success (try)', because a successful try has occured for the relevant commit hash and the PR isn't approved.

bryanburgers changed the title ~~Refactor synchronization~~ Refactor to improve synchronization and testability Jun 7, 2019

bryanburgers force-pushed the refactor-synchronization branch from bd03bec to 6356b54 Compare June 12, 2019 17:34

bryanburgers force-pushed the refactor-synchronization branch from 6356b54 to d9251e0 Compare June 21, 2019 11:49

bryanburgers added 12 commits July 2, 2019 07:13

Pull PullReqState into its own file

47e0117

Extract PullReqState into its own file for code readability. Because PullReqState shares some constants with main and server, extract those constants into a `consts.py` file as well.

Structure for process_event; test approvals

a5fe2e6

Create the general structure for `process_event` and its testing, and get a long way toward testing approval comments.

More tests written and passing

b693b15

Use process_event in synchronization

7f9443b

Test builds and try builds

2c6d3b3

Pushed commits reset the try_ state

2526ffe

Test that @bors retry resets the state

a9234c7

Test that issuing a `@bors retry` command moves the state from 'pending' to '' for pending pull requests. This is frequently used as a way to yield the current build to a different pull request.

Handle build timeout comments

2b83596

Homu creates a message when a try or build timeout occurs. Handle this to keep the state properly updated.

bryanburgers force-pushed the refactor-synchronization branch from d9251e0 to 2b83596 Compare July 2, 2019 15:28

Add GitHubPullRequestState

3b2562f

bryanburgers mentioned this pull request Jul 24, 2019

Allow try builds after approval #58

Open

bryanburgers added 3 commits July 30, 2019 12:45

Pass around less repository state!

c2b343a

Add more of the state to the Repository class, and make each PullReqState reference it's Repository and get information from there.

Switch to github_v4.py

678a195

Include build history when parsing history

68bb030

Include a history of all of the tries and all of the builds when parsing the history of a pull request.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor to improve synchronization and testability #45

Refactor to improve synchronization and testability #45

bryanburgers commented Jun 7, 2019 •

edited

Loading

bryanburgers commented Jun 21, 2019

bryanburgers commented Jul 10, 2019 •

edited

Loading

Mark-Simulacrum commented Jul 10, 2019

bryanburgers commented Jul 10, 2019

Mark-Simulacrum commented Jul 10, 2019

bryanburgers commented Jul 24, 2019

bryanburgers commented Aug 16, 2019

Refactor to improve synchronization and testability #45

Are you sure you want to change the base?

Refactor to improve synchronization and testability #45

Conversation

bryanburgers commented Jun 7, 2019 • edited Loading

bryanburgers commented Jun 21, 2019

bryanburgers commented Jul 10, 2019 • edited Loading

Mark-Simulacrum commented Jul 10, 2019

bryanburgers commented Jul 10, 2019

Mark-Simulacrum commented Jul 10, 2019

bryanburgers commented Jul 24, 2019

bryanburgers commented Aug 16, 2019

bryanburgers commented Jun 7, 2019 •

edited

Loading

bryanburgers commented Jul 10, 2019 •

edited

Loading