Skip to content

Document procedure to run Data Browser locally (#7643)#7836

Merged
hannes-ucsc merged 1 commit intodevelopfrom
issues/dsotirho-ucsc/7643-data-browser-documentation
Mar 18, 2026
Merged

Document procedure to run Data Browser locally (#7643)#7836
hannes-ucsc merged 1 commit intodevelopfrom
issues/dsotirho-ucsc/7643-data-browser-documentation

Conversation

@dsotirho-ucsc
Copy link
Copy Markdown
Contributor

@dsotirho-ucsc dsotirho-ucsc commented Mar 6, 2026

Linked issues: #7643

Checklist

Author

  • PR is assigned to the author
  • Status of PR is In progress
  • PR is a draft
  • Target branch is develop
  • Name of PR branch matches issues/<GitHub handle of author>/<issue#>-<slug>
  • PR is linked to all issues it (partially) resolves
  • Status of linked issues is In progress
  • PR description links to linked issues
  • PR title matches1 that of a linked issue or comment in PR explains why they're different
  • PR title references all linked issues
  • For each linked issue, there is at least one commit whose title references that issue

1 when the issue title describes a problem, the corresponding PR
title is Fix: followed by the issue title

Author (partiality)

  • Added p tag to titles of partial commits
  • This PR is labeled partial or completely resolves all linked issues
  • This PR partially resolves each of the linked issues or does not have the partial label

Author (reindex)

  • Added r tag to commit title or the changes introduced by this PR will not require reindexing of any deployment
  • This PR is labeled reindex:dev or the changes introduced by it will not require reindexing of dev
  • This PR is labeled reindex:anvildev or the changes introduced by it will not require reindexing of anvildev
  • This PR is labeled reindex:anvilprod or the changes introduced by it will not require reindexing of anvilprod
  • This PR is labeled reindex:prod or the changes introduced by it will not require reindexing of prod
  • This PR is labeled reindex:partial and its description documents the specific reindexing procedure for dev, anvildev, anvilprod and prod or requires a full reindex or carries none of the labels reindex:dev, reindex:anvildev, reindex:anvilprod and reindex:prod

Author (mirror)

  • This PR is labeled mirror:dev or the changes introduced by it will not require mirroring of dev
  • This PR is labeled mirror:anvildev or the changes introduced by it will not require mirroring of anvildev
  • This PR is labeled mirror:anvilprod or the changes introduced by it will not require mirroring of anvilprod
  • This PR is labeled mirror:prod or the changes introduced by it will not require mirroring of prod
  • This PR is labeled mirror:partial and its description documents the specific mirroring procedure for dev, anvildev, anvilprod and prod or requires a full mirroring or carries none of the labels mirror:dev, mirror:anvildev, mirror:anvilprod and mirror:prod

Author (API changes)

  • This PR and its linked issues are labeled API or this PR does not modify a REST API
  • Added a (A) tag to commit title for backwards (in)compatible changes or this PR does not modify a REST API
  • Updated REST API version number in app.py or this PR does not modify a REST API

Author (upgrading deployments)

  • Ran make docker_images.json and committed the resulting changes or this PR does not modify azul_docker_images, or any other variables referenced in the definition of that variable
  • Documented upgrading of deployments in UPGRADING.rst or this PR does not require upgrading deployments
  • Added u tag to commit title or this PR does not require upgrading deployments
  • This PR is labeled upgrade or does not require upgrading deployments
  • This PR is labeled deploy:shared or does not modify docker_images.json, and does not require deploying the shared component for any other reason
  • This PR is labeled deploy:gitlab or does not require deploying the gitlab component
  • This PR is labeled deploy:runner or does not require deploying the runner image

Author (hotfixes)

  • Added F tag to main commit title or this PR does not include permanent fix for a temporary hotfix
  • Reverted the temporary hotfixes for any linked issues or the none of the stable branches (anvilprod and prod) have temporary hotfixes for any of the issues linked to this PR

Author (before every review)

  • Rebased PR branch on develop, squashed fixups from prior reviews
  • Ran make requirements_update or this PR does not modify Dockerfile, environment, requirements*.txt, common.mk, Makefile or environment.boot
  • Added R tag to commit title or this PR does not modify requirements*.txt
  • This PR is labeled reqs or does not modify requirements*.txt
  • make integration_test passes in personal deployment or this PR does not modify functionality that could affect the IT outcome
  • PR is awaiting requested review from a peer
  • Status of PR is Review requested
  • PR is assigned to only the peer and the author

Peer reviewer (after approval)

Note that after requesting changes, the PR must be assigned to only the author.

  • Actually approved the PR
  • PR is not a draft
  • PR is awaiting requested review from system administrator
  • Status of PR is Review requested
  • PR is assigned to only the system administrator and the author

System administrator (after approval)

  • Actually approved the PR
  • Labeled linked issues as demo or no demo
  • Commented on linked issues about demo expectations or all linked issues are labeled no demo
  • Decided if PR can be labeled no sandbox
  • A comment to this PR details the completed security design review
  • PR title is appropriate as title of merge commit
  • N reviews label is accurate
  • Status of PR is Approved
  • PR is assigned to only the operator and the author

Operator

  • Checked reindex:… labels and r commit title tag
  • Checked mirror:… labels
  • Checked that demo expectations are clear or all linked issues are labeled no demo
  • Squashed PR branch and rebased onto develop
  • Sanity-checked history
  • Pushed PR branch to GitHub

Operator (deploy .shared and .gitlab components)

  • Ran _select dev.shared && CI_COMMIT_REF_NAME=develop make -C terraform/shared apply_keep_unused or this PR is not labeled deploy:shared
  • Ran _select dev.gitlab && CI_COMMIT_REF_NAME=develop make -C terraform/gitlab apply or this PR is not labeled deploy:gitlab
  • Ran _select anvildev.shared && CI_COMMIT_REF_NAME=develop make -C terraform/shared apply_keep_unused or this PR is not labeled deploy:shared
  • Ran _select anvildev.gitlab && CI_COMMIT_REF_NAME=develop make -C terraform/gitlab apply or this PR is not labeled deploy:gitlab
  • Checked the items in the next section or this PR is labeled deploy:gitlab
  • PR is assigned to only the system administrator and the author or this PR is not labeled deploy:gitlab

System administrator (post-deploy of .gitlab component)

  • Background migrations for dev.gitlab are complete or this PR is not labeled deploy:gitlab
  • Background migrations for anvildev.gitlab are complete or this PR is not labeled deploy:gitlab
  • PR is assigned to only the operator and the author

Operator (deploy runner image)

  • Ran _select dev.gitlab && make -C terraform/gitlab/runner or this PR is not labeled deploy:runner
  • Ran _select anvildev.gitlab && make -C terraform/gitlab/runner or this PR is not labeled deploy:runner

Operator (sandbox build)

  • Added sandbox label or PR is labeled no sandbox
  • Pushed PR branch to GitLab dev or PR is labeled no sandbox
  • Pushed PR branch to GitLab anvildev or PR is labeled no sandbox
  • Build passes in sandbox deployment or PR is labeled no sandbox
  • Build passes in anvilbox deployment or PR is labeled no sandbox
  • Reviewed build logs for anomalies in sandbox deployment or PR is labeled no sandbox
  • Reviewed build logs for anomalies in anvilbox deployment or PR is labeled no sandbox
  • Deleted unreferenced indices in sandbox or this PR does not remove catalogs or otherwise causes unreferenced indices in sandbox
  • Deleted unreferenced indices in anvilbox or this PR does not remove catalogs or otherwise causes unreferenced indices in anvilbox
  • Started reindex in sandbox or this PR is not labeled reindex:dev
  • Started reindex in anvilbox or this PR is not labeled reindex:anvildev
  • Checked for failures in sandbox or this PR is not labeled reindex:dev
  • Checked for failures in anvilbox or this PR is not labeled reindex:anvildev
  • Started mirroring in sandbox or this PR is not labeled mirror:dev
  • Started mirroring in anvilbox or this PR is not labeled mirror:anvildev
  • Checked for failures in sandbox or this PR is not labeled mirror:dev
  • Checked for failures in anvilbox or this PR is not labeled mirror:anvildev

Operator (merge the branch)

  • All status checks passed and the PR is mergeable
  • The title of the merge commit starts with the title of this PR
  • Added PR # reference to merge commit title
  • Collected commit title tags in merge commit title but only included p if the PR is also labeled partial
  • Pushed merge commit to GitHub
  • Status of PR is Merged lower
  • Status of blocked issues is Triage or no issues are blocked on the linked issues

Operator (main build)

  • Pushed merge commit to GitLab dev
  • Pushed merge commit to GitLab anvildev
  • Build passes on GitLab dev
  • Reviewed build logs for anomalies on GitLab dev
  • Build passes on GitLab anvildev
  • Reviewed build logs for anomalies on GitLab anvildev
  • Ran _select dev.shared && make -C terraform/shared apply or this PR is not labeled deploy:shared
  • Ran _select anvildev.shared && make -C terraform/shared apply or this PR is not labeled deploy:shared
  • Deleted PR branch from GitHub
  • PR is assigned to only the operator
  • Deleted PR branch from GitLab dev
  • Deleted PR branch from GitLab anvildev
  • Status of linked issues is Lower, or Triage, if PR is partial

Operator (reindex)

  • Deindexed all unreferenced catalogs in dev or this PR is neither labeled reindex:partial nor reindex:dev
  • Deindexed all unreferenced catalogs in anvildev or this PR is neither labeled reindex:partial nor reindex:anvildev
  • Deindexed specific sources in dev or this PR is neither labeled reindex:partial nor reindex:dev
  • Deindexed specific sources in anvildev or this PR is neither labeled reindex:partial nor reindex:anvildev
  • Indexed specific sources in dev or this PR is neither labeled reindex:partial nor reindex:dev
  • Indexed specific sources in anvildev or this PR is neither labeled reindex:partial nor reindex:anvildev
  • Started reindex in dev or this PR does not require reindexing dev
  • Started reindex in anvildev or this PR does not require reindexing anvildev
  • Checked for, triaged and possibly requeued messages in both fail queues in dev or this PR does not require reindexing dev
  • Checked for, triaged and possibly requeued messages in both fail queues in anvildev or this PR does not require reindexing anvildev
  • Emptied fail queues in dev or this PR does not require reindexing dev
  • Emptied fail queues in anvildev or this PR does not require reindexing anvildev
  • Restarted the Data Browser pipeline for the ucsc/hca/dev branch on GitLab in dev or this PR does not require reindexing dev
  • Restarted the Data Browser pipeline for the ucsc/lungmap/dev branch on GitLab in dev or this PR does not require reindexing dev
  • Restarted deploy_browser job in the GitLab pipeline for this PR in dev or this PR does not require reindexing dev
  • Restarted the Data Browser pipeline for the ucsc/anvil/anvildev branch on GitLab in anvildev or this PR does not require reindexing anvildev
  • Restarted deploy_browser job in the GitLab pipeline for this PR in anvildev or this PR does not require reindexing anvildev

Operator (mirroring)

  • Started mirroring in dev or this PR is not labelled mirror:dev
  • Started mirroring in anvildev or this PR is not labelled mirror:anvildev
  • Checked for, triaged and possibly requeued messages in mirror fail queue in dev or this PR is not labelled mirror:dev
  • Checked for, triaged and possibly requeued messages in mirror fail queue in anvildev or this PR is not labelled mirror:anvildev
  • Emptied mirror fail queue in dev or this PR is not labelled mirror:dev
  • Emptied mirror fail queue in anvildev or this PR is not labelled mirror:anvildev

Operator

  • Propagated the deploy:shared, deploy:gitlab, deploy:runner, API, reindex:partial, reindex:anvilprod, reindex:prod, mirror:partial, mirror:anvilprod and mirror:prod labels to the next promotion PRs or this PR carries none of these labels
  • Propagated any specific instructions related to the deploy:shared, deploy:gitlab, deploy:runner, API, reindex:partial, reindex:anvilprod, reindex:prod, mirror:partial, mirror:anvilprod and mirror:prod labels, from the description of this PR to that of the next promotion PRs or this PR carries none of these labels
  • PR is assigned to no one

Shorthand for review comments

  • L line is too long
  • W line wrapping is wrong
  • Q bad quotes
  • F other formatting problem

@dsotirho-ucsc dsotirho-ucsc self-assigned this Mar 6, 2026
@dsotirho-ucsc dsotirho-ucsc linked an issue Mar 6, 2026 that may be closed by this pull request
@codecov
Copy link
Copy Markdown

codecov Bot commented Mar 6, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.91%. Comparing base (3aecff8) to head (93856b1).
⚠️ Report is 2 commits behind head on develop.

Additional details and impacted files
@@           Coverage Diff            @@
##           develop    #7836   +/-   ##
========================================
  Coverage    84.91%   84.91%           
========================================
  Files          161      161           
  Lines        23104    23104           
========================================
  Hits         19619    19619           
  Misses        3485     3485           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@coveralls
Copy link
Copy Markdown

coveralls commented Mar 6, 2026

Coverage Status

coverage: 85.059%. remained the same
when pulling 93856b1 on issues/dsotirho-ucsc/7643-data-browser-documentation
into 3aecff8 on develop.

@dsotirho-ucsc dsotirho-ucsc force-pushed the issues/dsotirho-ucsc/7643-data-browser-documentation branch from 9584ac5 to b09e10a Compare March 6, 2026 19:08
Comment thread README.md Outdated
Comment on lines +907 to +908
and launch a local version of the Data Browser server. On macOS or Linux, a
specific Node.js version can be installed using the Node.js version management
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
and launch a local version of the Data Browser server. On macOS or Linux, a
specific Node.js version can be installed using the Node.js version management
and launch a local version of the Data Browser's server. On macOS or Linux, a
specific Node.js version can also be installed using the Node.js version management

Comment thread README.md
Comment thread README.md Outdated
Comment on lines +922 to +931
For example, to locally run a Data Browser for the `dev` deployment and `hca`
atlas, open `.gitlab/sites/dev/hca/base.yaml` and note the variables
`data_browser_build_script` and `data_browser_build_env`. The values of these
variables, `build-ma-dev:hca-dcp` and `ma-dev` respectively, indicate that the
site config used is `.site-config/hca-dcp/ma-dev/config.ts`, and additionally
that the argument to specify in the `npm run` command is the one found in
`package.json` that references `./scripts/dev.sh hca-dcp ma-dev`. Note, it may
be possible that no such argument can be found, in which case you can
temporarily create a new argument by adding a line to `package.json` using one
of the existing lines that mentions `scripts/dev.sh` as a guide.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This could benefit from being broken down into enumerated, concise steps.

Comment thread README.md Outdated
Use the steps above to first locate the `base.yaml` file for a GitLab colocated
with your personal deployment, and then open the related `config.ts` site config
file. After setting `CATALOG` and `DATA_URL` to values appropriate for your
personal deployment, open `packages.json` to find (or add) the correct argument
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
personal deployment, open `packages.json` to find (or add) the correct argument
personal deployment, open `package.json` to find (or add) the correct argument

Comment thread README.md Outdated
personal deployment, open `packages.json` to find (or add) the correct argument
to use, and then specify that argument in the `npm run` command.

Note that when run locally, the Data Browser might make duplicate requests to
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Note that when run locally, the Data Browser might make duplicate requests to
Note that when run locally, the Data Browser will make duplicate requests to

If it doesn't is because StrictMode isn't enabled, no?

@achave11-ucsc achave11-ucsc removed their assignment Mar 9, 2026
@dsotirho-ucsc dsotirho-ucsc force-pushed the issues/dsotirho-ucsc/7643-data-browser-documentation branch from b09e10a to 33e7518 Compare March 11, 2026 16:56
@dsotirho-ucsc
Copy link
Copy Markdown
Contributor Author

7836_IT_2026-03-11.txt

Comment thread README.md
Comment thread README.md Outdated
3. If no such line can be found in `/package.json`, you can temporarily add one
using one of the existing lines that mention `dev.sh` as a guide. Be sure to
give the new line a unique key.
4. Run `npm run KEY`, where `KEY` is the key of the line you found, or added.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Naming convention of variables should be consistent.

Suggested change
4. Run `npm run KEY`, where `KEY` is the key of the line you found, or added.
4. Run `npm run {key}`, where `{key}` is the key of the line you found, or added.

Comment thread README.md Outdated
Comment on lines +943 to +945
2. Locate and open `/.site-config/FOO/BAR/config.ts`, where `FOO` is the second
part of the `data_browser_build_script` value, and `BAR` is the
`data_browser_build_env` value.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consistency

Suggested change
2. Locate and open `/.site-config/FOO/BAR/config.ts`, where `FOO` is the second
part of the `data_browser_build_script` value, and `BAR` is the
`data_browser_build_env` value.
2. Locate and open `/.site-config/{foo}/{bar}/config.ts`, where `{foo}` is the second
part of the `data_browser_build_script` value, and `{bar}` is the
`data_browser_build_env` value.

Comment thread README.md Outdated
4. Open `/package.json`, and under the `scripts` section, find (or add) a line
that mentions `dev.sh` and the values obtained in step 1 (e.g. `hca-dcp` and
`ma-dev`).
5. Run `npm run KEY`, where `KEY` is the key of the line you found, or added.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same

Suggested change
5. Run `npm run KEY`, where `KEY` is the key of the line you found, or added.
5. Run `npm run {key}`, where `{key}` is the key of the line you found, or added.

@achave11-ucsc achave11-ucsc removed their assignment Mar 12, 2026
@dsotirho-ucsc dsotirho-ucsc force-pushed the issues/dsotirho-ucsc/7643-data-browser-documentation branch from 33e7518 to 52685d8 Compare March 12, 2026 17:59
@dsotirho-ucsc
Copy link
Copy Markdown
Contributor Author

7836_IT_2026-03-12.txt

Copy link
Copy Markdown
Member

@achave11-ucsc achave11-ucsc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Otherwise looks good!

Also, you may want to drop the quotes on "[n]", makes it sound like the name of the tool is [n].

Comment thread README.md Outdated
(`hca-dcp`) that is needed for our purposes.
3. Open `/package.json`, and under the `scripts` section, find the key of a
value that mentions `dev.sh hca-dcp ma-dev`, which is the script used when
running the Data Browser locally, and the two values obtained in step 1.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
running the Data Browser locally, and the two values obtained in step 1.
running the Data Browser locally, and the two values obtained in step 2.

@achave11-ucsc achave11-ucsc removed their assignment Mar 13, 2026
@dsotirho-ucsc dsotirho-ucsc force-pushed the issues/dsotirho-ucsc/7643-data-browser-documentation branch from 52685d8 to c7ce9c4 Compare March 16, 2026 22:06
@dsotirho-ucsc
Copy link
Copy Markdown
Contributor Author

Also, you may want to drop the quotes on "[n]", makes it sound like the name of the tool is [n].

The square brackets are for the markdown. When rendered it looks like:
Screenshot 2026-03-16 at 3 09 02 PM

@dsotirho-ucsc
Copy link
Copy Markdown
Contributor Author

7836_IT_2026-03-16.txt

achave11-ucsc
achave11-ucsc previously approved these changes Mar 16, 2026
Copy link
Copy Markdown
Member

@achave11-ucsc achave11-ucsc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved ✅

@achave11-ucsc achave11-ucsc marked this pull request as ready for review March 16, 2026 23:38
@achave11-ucsc achave11-ucsc removed their assignment Mar 16, 2026
Comment thread README.md
@hannes-ucsc hannes-ucsc added the 1 review [process] Lead requested changes once label Mar 16, 2026
@hannes-ucsc hannes-ucsc removed their assignment Mar 16, 2026
@dsotirho-ucsc dsotirho-ucsc force-pushed the issues/dsotirho-ucsc/7643-data-browser-documentation branch from c7ce9c4 to c01b929 Compare March 17, 2026 00:41
@dsotirho-ucsc
Copy link
Copy Markdown
Contributor Author

7836_IT_2026-03-16.txt

@hannes-ucsc hannes-ucsc added the no sandbox [process] PR will not be tested in the sandbox label Mar 18, 2026
@hannes-ucsc
Copy link
Copy Markdown
Member

Security design review

  • Security design review completed; this PR does not
    • … affect authentication; for example:
      • OAuth 2.0 with the application (API or Swagger UI)
      • Authentication of developers with Google Cloud APIs
      • Authentication of developers with AWS APIs
      • Authentication with a GitLab instance in the system
      • Password and 2FA authentication with GitHub
      • API access token authentication with GitHub
      • Authentication with Terra
    • … affect the permissions of internal users like access to
      • Cloud resources on AWS and GCP
      • GitLab repositories, projects and groups, administration
      • an EC2 instance via SSH
      • GitHub issues, pull requests, commits, commit statuses, wikis, repositories, organizations
    • … affect the permissions of external users like access to
      • TDR snapshots
    • … affect permissions of service or bot accounts
      • Cloud resources on AWS and GCP
    • … affect audit logging in the system, like
      • adding, removing or changing a log message that represents an auditable event
      • changing the routing of log messages through the system
    • … affect monitoring of the system
    • … introduce a new software dependency like
      • Python packages on PYPI
      • Command-line utilities
      • Docker images
      • Terraform providers
    • … add an interface that exposes sensitive or confidential data at the security boundary
    • … affect the encryption of data at rest
    • … require persistence of sensitive or confidential data that might require encryption at rest
    • … require unencrypted transmission of data within the security boundary
    • … affect the network security layer; for example by
      • modifying, adding or removing firewall rules
      • modifying, adding or removing security groups
      • changing or adding a port a service, proxy or load balancer listens on
  • Documentation on any unchecked boxes is provided in comments below

@hannes-ucsc hannes-ucsc removed their assignment Mar 18, 2026
@dsotirho-ucsc dsotirho-ucsc force-pushed the issues/dsotirho-ucsc/7643-data-browser-documentation branch from c01b929 to 93856b1 Compare March 18, 2026 17:26
@hannes-ucsc hannes-ucsc merged commit bdd3f67 into develop Mar 18, 2026
8 checks passed
@dsotirho-ucsc dsotirho-ucsc deleted the issues/dsotirho-ucsc/7643-data-browser-documentation branch March 18, 2026 21:10
@dsotirho-ucsc dsotirho-ucsc removed their assignment Mar 18, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

1 review [process] Lead requested changes once no sandbox [process] PR will not be tested in the sandbox

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Document procedure to run Data Browser locally

4 participants