Feature/rq dashboard #149

albrja · 2022-08-04T18:32:45Z

Feature/RQ-Dashboard

Description

Added functionality to run a RQ-dashboard when user launches a parallel simulation.

Category: Feature
JIRA issue: MIC-2934

-Added run_rq_dashboard function that launches dashboard during main psimulate runner loop
-Dashboard captures all jobs submitted on that host for all redis databases that were provide when dashboard was initialized (current implementation is all redis databases but this could be changed with little work).
-Hook to kill subprocess running dashboard when the parallel simulation ends.
-Added rq.log to log subprocess and provide user with URL to navigate to use the dashboard. URL is also printed to terminal.

Testing

Ran multiple parallel simulation and successfully saw working implementation for:
-Creation and usage of new rq.log file
-Dashboard launches at start of simulation and the process ends when the simulation finishes
-All redis databases used for parallel simulation are attached to the dashboard depending on how many were used for that simulation.

albrja · 2022-08-04T18:41:30Z

src/vivarium_cluster_tools/psimulate/runner.py

+        command, shell=True, stdout=rq_dashboard_log, stderr=rq_dashboard_log
+    )
+
+    atexit.register(proc.kill)


Hook to kill dashboard when psimulate runner loop ends.

albrja · 2022-08-04T18:45:00Z

src/vivarium_cluster_tools/psimulate/runner.py

+    logger.info("Fetching redis urls and starting RQ-Dashboard")
+    split_urls = " -u ".join(url for url in redis_urls)
+    command = "rq-dashboard -u " + split_urls + " --debug"
+


I think this implementation could be improved depending on what we want. Currently, all redis urls are submitted to the dashboard but it doesn't have to be that way. Users could also open up another dashboard with just one redis db. There are a lot of options here.

We want all redis urls in the same dashboard, we just want to change the way the dashboard handles the information

albrja · 2022-08-04T18:46:16Z

src/vivarium_cluster_tools/psimulate/runner.py

+    split_urls = " -u ".join(url for url in redis_urls)
+    command = "rq-dashboard -u " + split_urls + " --debug"
+
+    rq_dashboard_log.write(f"Dashboard running at http://{hostname}:9181\n")


This log currently only captures the local url and the url that the user can navigate to for the dashboard. Room for improvement here.

collijk · 2022-08-04T18:54:05Z

src/vivarium_cluster_tools/psimulate/runner.py

-                    written_results,
-                    unwritten_results,
-                    batch_size,
+                    output_directory, written_results, unwritten_results, batch_size


Did you delete the trailing comma here? That's what induces the whitespace noise.

I've been having issues with black/isort so I'll add this back in.

collijk · 2022-08-04T18:57:09Z

src/vivarium_cluster_tools/psimulate/runner.py

@@ -107,6 +102,26 @@ def try_run_vipin(log_path: Path) -> None:
        logger.warning(f"Performance reporting failed with: {e}")


+def run_rq_dashboard(redis_urls: list, output_directory: Path) -> None:


I'd make a monitoring module and put this function in it. It'll eventually become a subpackage with a bunch of other stuff in it.

Also, call your output_directory something more descriptive like logging_root.

Ok I will do this.

collijk · 2022-08-04T18:59:37Z

src/vivarium_cluster_tools/psimulate/runner.py

+    hostname = url.split(":")[0]
+
+    # Set up log file
+    rq_dashboard_log = (output_directory / "rq.log").open("a")


rq.log is a bad name for this. Use rq_dashboard.log.

collijk · 2022-08-04T19:01:10Z

src/vivarium_cluster_tools/psimulate/runner.py

+    split_urls = " -u ".join(url for url in redis_urls)
+    command = "rq-dashboard -u " + split_urls + " --debug"
+
+    rq_dashboard_log.write(f"Dashboard running at http://{hostname}:9181\n")


This is a lie. YOu haven't launched the dashboard at the time you're writing to this log file.

collijk · 2022-08-04T19:01:19Z

src/vivarium_cluster_tools/psimulate/runner.py

+    command = "rq-dashboard -u " + split_urls + " --debug"
+
+    rq_dashboard_log.write(f"Dashboard running at http://{hostname}:9181\n")
+    logger.info(f"Dashboard running at http://{hostname}:9181")


As is this.

collijk · 2022-08-04T19:05:41Z

src/vivarium_cluster_tools/psimulate/runner.py

+    # Set up log file
+    rq_dashboard_log = (output_directory / "rq.log").open("a")
+    logger.info("Fetching redis urls and starting RQ-Dashboard")
+    split_urls = " -u ".join(url for url in redis_urls)


Two things:

" -u ".join(url for url in redis_urls) is the same as " -u ".join(redis_urls)

and
url_flags = " ".join([f'-u {url}' for url in redis_urls])
means you can write
command = "rq-dashboard " + url_flags + " --debug"
without the fencepost -u in the command.

collijk · 2022-08-04T19:06:45Z

src/vivarium_cluster_tools/psimulate/runner.py

+
+    # Grab redis urls for dashboard and send them to function for popen
+    rq_urls = [
+        f"redis://{hostname}.cluster.ihme.washington.edu:{port}"


The workers are okay with this change?

I'm not sure I understand your question but this isn't doing anything to the workers it is just formatting the redis urls the way the rq-dashboard wants them. It isn't doing anything to the workers themselves to my knowledge.

So it's NOT* changing the workers by giving them the FQD with* .cluster.ihme.washington.edu. ~~I would presume that so long as the cluster's FQD doesn't change, we're ok.~~

@mattkappel Are you saying that f"redis://{hostname}:{port}" resolves to "redis://{hostname}.cluster.ihme.washington.edu:{port}"? If so, then no need for an rq_urls and you can just pass in redis_urls?

oh sorry, I misread! rq_urls != redis_urls But I would think what you think I said is true.

…and for future use

albrja · 2022-08-04T21:46:51Z

One important note is I had to make a few changes to the cli.py file in the rq-dashboard package since the package itself has not been updated in a few years but click has so to get this working I had to make my own branch to fix some default settings for click options which is not reflected in this PR and the RQ-Dashboard as implemented here will not work without making these changes in the rq-dashboard package.

stevebachmeier · 2022-08-05T20:37:47Z

src/vivarium_cluster_tools/psimulate/monitoring.py

+    command = "rq-dashboard " + url_flags + " --debug"
+
+    proc = subprocess.Popen(
+        command, shell=True, stdout=rq_dashboard_log, stderr=rq_dashboard_log


Do you want stdout and stderr going to the same log?

I mean, it's sometimes useful to have an log.o and and log.e but generally I don't think that's useful for our logging purposes in psimulate.

stevebachmeier · 2022-08-05T20:39:14Z

src/vivarium_cluster_tools/psimulate/monitoring.py

+        command, shell=True, stdout=rq_dashboard_log, stderr=rq_dashboard_log
+    )
+    rq_dashboard_log.write(f"Dashboard running at http://{hostname}:9181\n")
+    logger.info(f"Dashboard running at http://{hostname}:9181")


Where did these static ports come from?

In the vivarium_dashboard package the cli.py that has all the click options sets 9181 as the default.

We ought to have psimulate pick a random open port like @collijk suggested and pass that in as an option.

beatrixkh · 2022-09-06T17:17:51Z

src/vivarium_cluster_tools/psimulate/runner.py

-                written_results,
-                unwritten_results,
-                batch_size=batch_size,
+                output_directory, written_results, unwritten_results, batch_size=batch_size


similar to above, i think black will add in a trailing comma and the line breaks

albrja added 18 commits July 28, 2022 15:19

Initial commit to test subprocess for rq-dashboard

855dda8

Fixing typing object

597497a

Updating command for Popen call for testing

41786d6

adding debug flag

da877a3

Adding log for dashboard

3978d98

Fixing typo

e5cd42c

Fixing url for dashboard to match hostnmae

6fc9cf0

Fixing function args

a4a60aa

Adding kill command for dashboard subprocess when psim ends

e182c57

Fixing function args for atexit

0e07dce

Adding logging file

6f2f499

Fixing log file to capture output from subprocess

68ee050

Updating logging after testing

9cc61f2

Commit for black formatting

65b3e55

Ran isort

230c8ca

Fixing formatting

343b3a4

Adding linebreak to log file for dashboard

e8f6676

Removed unnecessary imports

f0140b3

albrja requested review from beatrixkh, collijk, hussain-jafari, mattkappel, rmudambi and stevebachmeier as code owners August 4, 2022 18:32

albrja commented Aug 4, 2022

View reviewed changes

Adding linebreak in imports after general imports

b5a2282

collijk reviewed Aug 4, 2022

View reviewed changes

Addressing PR comments and adding new monitoring module per comments …

2a48a13

…and for future use

stevebachmeier reviewed Aug 5, 2022

View reviewed changes

rmudambi approved these changes Aug 5, 2022

View reviewed changes

Adding hook to close log file

a94b7a1

beatrixkh reviewed Sep 6, 2022

View reviewed changes

stevebachmeier requested a review from ramittal as a code owner December 27, 2022 22:31

mattkappel changed the base branch from develop to main July 13, 2023 00:55

albrja closed this May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/rq dashboard #149

Feature/rq dashboard #149

albrja commented Aug 4, 2022

albrja Aug 4, 2022

albrja Aug 4, 2022

collijk Aug 4, 2022

albrja Aug 4, 2022

collijk Aug 4, 2022

albrja Aug 4, 2022

collijk Aug 4, 2022

collijk Aug 4, 2022

albrja Aug 4, 2022

collijk Aug 4, 2022

collijk Aug 4, 2022

collijk Aug 4, 2022

collijk Aug 4, 2022

collijk Aug 4, 2022

albrja Aug 4, 2022

mattkappel Aug 5, 2022 •

edited

Loading

stevebachmeier Aug 5, 2022

mattkappel Aug 5, 2022

albrja commented Aug 4, 2022

stevebachmeier Aug 5, 2022

albrja Aug 5, 2022

mattkappel Aug 5, 2022

stevebachmeier Aug 5, 2022

albrja Aug 5, 2022

mattkappel Aug 5, 2022 •

edited

Loading

beatrixkh Sep 6, 2022

		@@ -107,6 +102,26 @@ def try_run_vipin(log_path: Path) -> None:
		logger.warning(f"Performance reporting failed with: {e}")


		def run_rq_dashboard(redis_urls: list, output_directory: Path) -> None:

Feature/rq dashboard #149

Feature/rq dashboard #149

Conversation

albrja commented Aug 4, 2022