-
Notifications
You must be signed in to change notification settings - Fork 433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hide fusion gene table in the summary view if no fusion data present … #6731
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Where the logic to determine the study has fusion sample list?
@zhx828 , in SampleServiceImpl processSample method. |
@alisman , here is the backend PR. |
Hi @sheridancbio , would you be able to review this PR. It will fix the problem reported by Tali. Thank you! |
@kalletlak do frontned/backend need to be deployed together? I.E. will either break if they deployed separately? |
@khzhu let me rephrase, where the logic(script) to populate the sample list if the study has the fusion data in the database? |
@alisman Yes |
Hi @zhx828 , I did not change any existing logic to populate the sample list other than added a new case list cases_fusion to list all samples that have fusion data.
Hope it answers your question. |
@khzhu if this is the case, @sheridancbio could you point Kelsey to the pipeline code where the logic determines whether the fusion list should be populated? @khzhu could you help to make a patch to that code too? |
@zhx828 , will do. |
@zhx828 @khzhu I'm reading up on this now. From a first read-through it looks like you introduce a new standard sample list to hold all samples which underwent fusion profiling. Then the presence or absence of list elements determines whether this study has any fusion data to display or not. Two very quick thoughts:
Just to be clear ... it is possible in your minds that there can be samples on the _fusion case list for which no fusions were detected. (in other words .. profiled but nothing found) ? cc: @n1zea144 |
hi @sheridancbio , thank for your inputs.
yes, that is right.
it will not be a required file.
we will not add cases_fusion.txt file for both. cases_fusion.txt will be eventually deprecated once we started to use structural variants. |
My two cents - |
one minor comment: should we use cases_fusion or cases_sv? (assume we will use the same case list after switch to the sv implementation) |
REPLACE INTO sample_list(STABLE_ID, CATEGORY, CANCER_STUDY_ID, NAME, DESCRIPTION) | ||
SELECT DISTINCT CONCAT(CANCER_STUDY_IDENTIFIER, '_fusion'), 'all_cases_with_mutation_data', `cancer_study`.CANCER_STUDY_ID, 'Samples with fusion data','Samples with fusion data' | ||
FROM `mutation_event` | ||
JOIN `mutation` ON `mutation`.MUTATION_EVENT_ID = `mutation_event`.MUTATION_EVENT_ID | ||
JOIN `genetic_profile` ON `genetic_profile`.GENETIC_PROFILE_ID = `mutation`.GENETIC_PROFILE_ID | ||
JOIN `cancer_study` ON `cancer_study`.CANCER_STUDY_ID = `genetic_profile`.CANCER_STUDY_ID | ||
WHERE MUTATION_TYPE = 'fusion'; | ||
REPLACE INTO sample_list_list(list_id, sample_id) | ||
SELECT DISTINCT list_id, SAMPLE_ID | ||
FROM `mutation_event` | ||
JOIN `mutation` ON `mutation`.MUTATION_EVENT_ID = `mutation_event`.MUTATION_EVENT_ID | ||
JOIN `genetic_profile` ON `genetic_profile`.GENETIC_PROFILE_ID = `mutation`.GENETIC_PROFILE_ID | ||
JOIN `cancer_study` ON `cancer_study`.CANCER_STUDY_ID = `genetic_profile`.CANCER_STUDY_ID | ||
JOIN `sample_list` ON `sample_list`.CANCER_STUDY_ID = `cancer_study`.CANCER_STUDY_ID AND `sample_list`.STABLE_ID LIKE '%fusion%' | ||
WHERE MUTATION_TYPE = 'fusion'; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My understanding of this code is that a default _fusion sample list will be created which is the list of samples that have fusion data. There is a caveat: some profiled samples may be fusion negative. For example, many MSK-IMPACT samples do not have fusions but they were all profiled.
This might be fine as the default one for migration, but we should generate proper _fusion case list for our studies.
Alternatively, I am wondering if we should just copy _sequenced to _fusion when there are fusion data?
@n1zea144 @sheridancbio thoughts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@n1zea144 @SRodenburg , any inputs on this? thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @jjgao Rob and I were talking about this too. Using MSK-IMPACT/MSK-ARCHER as an example, I'm its not entirely accurate to assume that if a sample was sequenced for mutation calling, that it was also analyzed for fusions (although it is true that with IMPACT, there are some probes looking for canonical fusions). However, it seems excessive to have a separate list for cases that were checked for SVs, and then another case list for samples in which SVs were found, so carrying over the sequenced sample list to fusions may be a good compromise.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks, @n1zea144 !
but how do we know if there are fusions or not at the importing stage to decide if we should carry the sequenced sample list over to fusions?
I tried JJ's alternative approach and tested on msk_impact_2017 dataset. And, it worked well.
|
@khzhu I am not sure if I understand correctly, but I think we should:
This will ensure that our current database will work after migration, and our current data files will work. Also once we put _fusion case lists, they will also work. BTW, should we call it _sv case list? |
0f16cb3
to
bcff7cb
Compare
will update migration sql statement
sounds good.
go with _sv |
bcff7cb
to
0ccf56a
Compare
Hi @jjgao @n1zea144 @sheridancbio ,
may I please have your final review on the changes. Thanks! |
044f1c1
to
beb2d74
Compare
hi @jjgao @n1zea144 @sheridancbio @zhx828 |
beb2d74
to
0c6b702
Compare
0c6b702
to
cdd9169
Compare
…and fix fusion gene count
cdd9169
to
289a918
Compare
…s are stored in the mutation_event table and a new filter for fusion data and a new case list for sv data
289a918
to
22d6a70
Compare
Hi @jjgao , all work has been done and I am waiting for @n1zea144 or @sheridancbio to complete their reviews. Thanks! |
I created a new PR rebased to release-3.2.0: #6945. Please do further review there |
hide the fusion gene table in the summary view if no fusion data present and fix the number of samples for fusion.
Fix #6709
Describe changes proposed in this pull request:
Checks
Any screenshots or GIFs?
If this is a new visual feature please add a before/after screenshot or gif
here with e.g. GifGrabber.
Notify reviewers
Read our Pull request merging
policy. It can help to figure out who worked on the
file before you. Please use
git blame <filename>
to determine thatand notify them either through slack or by assigning them as a reviewer on the PR