Skip to content

Conversation

@eweitz
Copy link
Member

@eweitz eweitz commented Jun 8, 2020

This copies code from single_cell_portal to Ingest Pipeline. Code for generating species and genome reference data has been moved from /scripts/genomes there to a new directory here, /ingest/genomes. And make_toy_data.py has been copied to /ingest here.

The genome pipeline in /ingest/genomes here will need to be updated to fix an igv.js issue. That pipeline isn't intended for technical end-users, so it doesn't fit in single_cell_portal, and having that and make_toy_data.py here simplifies dependency management for make_toy_data.py. More background is available in broadinstitute/single_cell_portal#99.

To complete the move, I've also opened two PRs (broadinstitute/single_cell_portal_core#621 and broadinstitute/single_cell_portal#101) to remove the slightly different(!) copies of this code in those places. That will resolve the biggest tech debt in the genome pipeline and satisfy SCP-2470. Other pre-existing tech debt is documented and scoped in SCP-2491.

This PR and the two noted above satisfy SCP-2470.

@codecov
Copy link

codecov bot commented Jun 8, 2020

Codecov Report

Merging #102 into master will decrease coverage by 24.70%.
The diff coverage is 0.00%.

Impacted file tree graph

@@             Coverage Diff             @@
##           master     #102       +/-   ##
===========================================
- Coverage   75.27%   50.56%   -24.71%     
===========================================
  Files          13       20        +7     
  Lines        1662     2474      +812     
===========================================
  Hits         1251     1251               
- Misses        411     1223      +812     
Impacted Files Coverage Δ
ingest/genomes/fetch_upstream_assemblies.py 0.00% <0.00%> (ø)
ingest/genomes/parse_genome_annotations.py 0.00% <0.00%> (ø)
ingest/genomes/parse_genome_assemblies.py 0.00% <0.00%> (ø)
ingest/genomes/persist_annotation_metadata.py 0.00% <0.00%> (ø)
ingest/genomes/process_genome_references.py 0.00% <0.00%> (ø)
ingest/genomes/utils.py 0.00% <0.00%> (ø)
ingest/make_toy_data.py 0.00% <0.00%> (ø)
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b0e37c6...11ecf0a. Read the comment docs.

Copy link
Contributor

@bistline bistline left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just a question about the HSTlib dependencies - will you add something about that to the README?

"""
# Example:
# $ sort -k1,1 -k4,4n gencode.vM17.annotation.gtf > gencode.vM17.annotation.possorted.gtf
# $ bgzip gencode.vM17.annotation.possorted.gtf
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have the same comment here that I had in https://github.com/broadinstitute/single_cell_portal/pull/99/files#r434173376. You mentioned a README update in this repo to address it?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done in 11ecf0a. I had scoped that update among other pre-existing tech debt in SCP-2491, but this was a trivial change.

@eweitz eweitz merged commit d71c9b0 into master Jun 9, 2020
@bistline bistline deleted the ew-move-genomes branch May 21, 2021 15:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants