Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RFC84: Data Entries Removal #46

Merged
merged 1 commit into from
Jul 17, 2024
Merged

RFC84: Data Entries Removal #46

merged 1 commit into from
Jul 17, 2024

Conversation

forus
Copy link
Contributor

@forus forus commented Jun 5, 2024

In the current implementation of cBioPortal, the functionality for data management is limited to the deletion of entire studies, as documented here. This approach lacks granularity and does not provide users with the ability to selectively remove individual entries associated with a study, such as patients, samples, or specific molecular data profiles. This limitation restricts the flexibility needed for detailed data management and maintenance, leading to potential inefficiencies and challenges in study data handling.

To address this limitation, this PR extends the capabilities of the cbioportalImporter.py script and includes additional commands to the java tool that allow for the selective removal of patient or sample.

See more details in the RFC84 Data Entries Removal document.

Reminder: Do not forget to enable the following PR cBioPortal/cbioportal#10820

@forus forus requested review from JREastonMarks, pieterlukasse and haynescd and removed request for JREastonMarks June 5, 2024 09:58
@forus forus changed the base branch from inc-seg to rfc79-feedback June 11, 2024 19:15
Base automatically changed from rfc79-feedback to rfc79 June 19, 2024 14:23
Copy link
Member

@pieterlukasse pieterlukasse left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks @forus ! I added some comments. Please take a look

pieterlukasse
pieterlukasse previously approved these changes Jun 28, 2024
Add command to remove sample to python wrapper

Implment sample removal command

Remove sample from the tab delimited tables as well

Do samples removal in a transaction

Move removing sample everywhere in study to respective DAOs

To be able to reuse this functionality

Refactor sample removal test by using stream of

Test and fix edge cases of sample removal command

Add command to remove patient to python wrapper

Add java command to remove patient in a study

Increase test independence

Adding or removing profile/sample should break only one test

Fix patient id to sample ids mapping bug

Throw exception when sample with GSVA is removed

Throw exception when generic profile samples list is empty

Refactor patient and sample removal code

Remove unused logger and imports

Remove unused code from cna discrete long data tests
@forus forus merged commit 2929643 into main Jul 17, 2024
4 checks passed
@forus forus deleted the rfc84 branch July 17, 2024 07:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants