ARROW-18048: [Dev][Archery][Crossbow] Comment bot waits for a while before generate a report #14412

kou · 2022-10-14T08:07:16Z

If we generate a report immediately after we submit CI tasks, the generated report doesn't have suitable task CI task URL. Because CI tasks aren't started. We need to wait for a while before we generate a report to collect suitable CI task URLs.

Before:
#14409 (comment)

https://github.com/ursacomputing/crossbow/tree/actions-1e49219a52-github-r-binary-packages is used.

After:
#14409 (comment)

https://github.com/ursacomputing/crossbow/actions/runs/3248297802/jobs/5329340988 is used.

…efore generate a report If we generate a report immediately after we submit CI tasks, the generated report doesn't have suitable task CI task URL. Because CI tasks aren't started. We need to wait for a while before we generate a report to collect suitable CI task URLs. Before: apache#14409 (comment) https://github.com/ursacomputing/crossbow/tree/actions-1e49219a52-github-r-binary-packages is used. After: apache#14409 (comment) https://github.com/ursacomputing/crossbow/actions/runs/3248297802/jobs/5329340988 is used.

github-actions · 2022-10-14T08:07:44Z

https://issues.apache.org/jira/browse/ARROW-18048

github-actions · 2022-10-14T08:07:47Z

⚠️ Ticket has not been started in JIRA, please click 'Start Progress'.

kou · 2022-10-14T08:07:52Z

@raulcd What do you think about this (heuristic) approach?

raulcd · 2022-10-14T09:44:47Z

dev/archery/archery/bot.py

@@ -269,6 +272,10 @@ def submit(obj, tasks, groups, params, arrow_version):
        queue.put(job, prefix="actions", increment_job_id=False)
        queue.push()

+        # # wait for tasks of the job are triggered to collect more
+        # # suitable task URLs
+        time.sleep(wait)


I am ok with the heuristic implementation but I think I would prefer this to be implemented inside the CommentReport and the task_url itself as I can see us requiring to wait for the job to trigger on other places. I had something like this patch in mind:

diff --git a/dev/archery/archery/bot.py b/dev/archery/archery/bot.py index c548e9a..8b02dc7 100644 --- a/dev/archery/archery/bot.py +++ b/dev/archery/archery/bot.py @@ -270,7 +270,7 @@ def submit(obj, tasks, groups, params, arrow_version): queue.push() # render the response comment's content - report = CommentReport(job, crossbow_repo=crossbow_repo) + report = CommentReport(job, crossbow_repo=crossbow_repo, wait_for_task=wait) # send the response pull_request.create_issue_comment(report.show()) diff --git a/dev/archery/archery/crossbow/reports.py b/dev/archery/archery/crossbow/reports.py index a3958d8..d887574 100644 --- a/dev/archery/archery/crossbow/reports.py +++ b/dev/archery/archery/crossbow/reports.py @@ -20,6 +20,7 @@ import csv import operator import fnmatch import functools +import time import click import requests @@ -41,7 +42,7 @@ class Report: "arrow_commit", ] - def __init__(self, job, task_filters=None): + def __init__(self, job, task_filters=None, wait_for_task=None): self.job = job tasks = sorted(job.tasks.items()) @@ -53,6 +54,7 @@ class Report: tasks = [(name, task) for name, task in tasks if name in filtered] self._tasks = dict(tasks) + self._wait_for_task = wait_for_task @property def repo_url(self): @@ -66,6 +68,8 @@ class Report: return '{}/tree/{}'.format(self.repo_url, branch) def task_url(self, task): + if self._wait_for_task: + time.wait(self._wait_for_task) if task.status().build_links: # show link to the actual build, some CI providers implement # the statuses API others implement the checks API, retrieve any.

what do you think?

OK. I changed to use the approach.

Hmm... It seems that the approach may do N wait sleeps when we run N tasks. For example, crossbow submit -g wheel will sleep 9 times.

you are correct, sorry about that, we probably should only wait if there are no links to the individual task. In that case I would expect the links to be available and the wait to be only for the first one:

if not task.status().build_links and self._wait_for_task: time.sleep(self.__wait_for_task)

sorry this won't work because if present it would be a string. I might take a look tomorrow if is still not solved

raulcd

Thanks @kou !

ursabot · 2022-10-16T01:52:14Z

Benchmark runs are scheduled for baseline = fc01a9c and contender = d1a8f4b. d1a8f4b is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Finished ⬇️0.0% ⬆️0.0%] ec2-t3-xlarge-us-east-2
[Failed ⬇️0.0% ⬆️0.0%] test-mac-arm
[Failed ⬇️0.27% ⬆️0.0%] ursa-i9-9960x
[Finished ⬇️0.04% ⬆️0.0%] ursa-thinkcentre-m75q
Buildkite builds:
[Finished] d1a8f4ba ec2-t3-xlarge-us-east-2
[Failed] d1a8f4ba test-mac-arm
[Failed] d1a8f4ba ursa-i9-9960x
[Finished] d1a8f4ba ursa-thinkcentre-m75q
[Finished] fc01a9c3 ec2-t3-xlarge-us-east-2
[Failed] fc01a9c3 test-mac-arm
[Failed] fc01a9c3 ursa-i9-9960x
[Finished] fc01a9c3 ursa-thinkcentre-m75q
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

raulcd reviewed Oct 14, 2022

View reviewed changes

Sleep in Report

9078710

raulcd approved these changes Oct 14, 2022

View reviewed changes

kou merged commit d1a8f4b into apache:master Oct 14, 2022

kou deleted the crossbow-bot-wait-before-report branch October 14, 2022 13:08

kou mentioned this pull request Oct 17, 2022

ARROW-18068: [Dev][Archery][Crossbow] Comment bot only waits for task if link is not available #14429

Merged

asfimport mentioned this pull request Oct 17, 2022

[Dev][Archery][Crossbow] Comment bot only waits for task if link is not available #20459

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-18048: [Dev][Archery][Crossbow] Comment bot waits for a while before generate a report #14412

ARROW-18048: [Dev][Archery][Crossbow] Comment bot waits for a while before generate a report #14412

kou commented Oct 14, 2022

github-actions bot commented Oct 14, 2022

github-actions bot commented Oct 14, 2022

kou commented Oct 14, 2022

raulcd Oct 14, 2022

kou Oct 14, 2022

kou Oct 15, 2022

raulcd Oct 15, 2022

raulcd Oct 15, 2022

kou Oct 15, 2022

raulcd left a comment

ursabot commented Oct 16, 2022

ARROW-18048: [Dev][Archery][Crossbow] Comment bot waits for a while before generate a report #14412

ARROW-18048: [Dev][Archery][Crossbow] Comment bot waits for a while before generate a report #14412

Conversation

kou commented Oct 14, 2022

github-actions bot commented Oct 14, 2022

github-actions bot commented Oct 14, 2022

kou commented Oct 14, 2022

raulcd Oct 14, 2022

Choose a reason for hiding this comment

kou Oct 14, 2022

Choose a reason for hiding this comment

kou Oct 15, 2022

Choose a reason for hiding this comment

raulcd Oct 15, 2022

Choose a reason for hiding this comment

raulcd Oct 15, 2022

Choose a reason for hiding this comment

kou Oct 15, 2022

Choose a reason for hiding this comment

raulcd left a comment

Choose a reason for hiding this comment

ursabot commented Oct 16, 2022