Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added changes needed for the Docker in Docker execution #32

Merged
merged 3 commits into from Jun 17, 2021

Conversation

SergeyPod
Copy link
Contributor

No description provided.

Copy link
Collaborator

@MichelleGower MichelleGower left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some questions/comments about the yaml file if it is supposed to be used by others. Changes approved for merging.

project: dev
campaign: quick
campaign: "G3W06"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure what G3W06 means. Guessing it would be a Gen3 run using weekly stack w_2021_06 (which I don't think is the case in the rest of the yaml). Since example yaml, it doesn't have to be real, but I would try to not make it confusing.

submitPath: ${PWD}/submit/{outCollection}
container_obs_panda_edge_node_dir: "/opt/lsst/software/stack/stack/current/Linux64/ctrl_bps/21.0.0-6-g55737e1+e68d6e10c0/python/lsst/ctrl/bps/wms/panda/edgenode"
container_obs_panda_edge_node_dir: /opt/lsst/software/stack/stack/miniconda3-py38_4.9.2-0.6.0/Linux64/ctrl_bps/21.0.0-18-gf2cd492+6c749b2ca5/python/lsst/ctrl/bps/wms/panda/edgenode
container_CTRL_MPEXEC_DIR: /opt/lsst/software/stack/stack/miniconda3-py38_4.9.2-0.6.0/Linux64/ctrl_mpexec/21.0.0-30-g82f2559+c64cb64c6b/
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note: When the lazy command line changes are made in bps, the need for this should go away.

butlerConfig: s3://butler-us-central1-panda-dev/hsc/butler.yaml
inCollection: HSC/defaults
output: "RC2/w_2021_13/FULLRC2"
outCollection: "shared/pipecheck/{timestamp}"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should the example be for a non-production run and let the production folks change it correctly for production runs? If so the outCollection needs to follow the pattern in https://dmtn-167.lsst.io/ (pattern: u//xyz)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also I'm not sure what pipetask does if the run collection (outCollection) isn't an "extension" of the chained output collection (output). In the main bps example yaml file (the run collection is just the output collection with a run timestamp):
output: "u/${USER}/pipelines_check"
outCollection: "{output}/{timestamp}"

wmsServiceClass: lsst.ctrl.bps.wms.panda.panda_service.PanDAService
clusterAlgorithm: lsst.ctrl.bps.quantum_clustering_funcs.single_quantum_clustering
createQuantumGraph: '${CTRL_MPEXEC_DIR}/bin/pipetask qgraph -d "{dataQuery}" -b {butlerConfig} -i {inCollection} -p {pipelineYaml} -q {qgraphFile}'
runQuantumCommand: 'pipetask run -b {butlerConfig} --input {inCollection} --output-run {outCollection} --extend-run --skip-init-writes --clobber-partial-outputs --qgraph {bucket}/{payload_folder}/{uniqProcName}/${{IN/L}}'
createQuantumGraph: '${CTRL_MPEXEC_DIR}/bin/pipetask qgraph -d "{dataQuery}" -b {butlerConfig} -i {inCollection} -p {pipelineYaml} -q {qgraphFile} --qgraph-dot {qgraphFile}.dot --instrument lsst.obs.subaru.HyperSuprimeCam'
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am surprised to see instrument on the command line. That's normally defined in the pipeline yaml. Not that it is wrong, but if the bps yaml doesn't need it, then it's one less place to change it.

outCollection: "shared/pipecheck/{timestamp}"
dataQuery: "skymap='discrete/ci_hsc' AND tract=0 AND patch=69"
#Very small (~150 jobs) workflow
dataQuery: "tract = 9615 and patch=30 and detector IN (10..20) and instrument='HSC' and skymap='hsc_rings_v1' and band in ('r', 'i')"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is instrument needed here since the given inCollection limits it to that instrument?

@SergeyPod SergeyPod merged commit d2d8cab into master Jun 17, 2021
@SergeyPod SergeyPod deleted the tickets/DM-27260 branch June 17, 2021 19:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants