Skip to content

Conversation

@enryH
Copy link
Member

@enryH enryH commented Mar 9, 2023

Pride data release preparation.

  • pride submission scripts -> 00_0_*.ipynb

  • update erda (FTP server) notebooks, build dumps for pride of 7,444 selected files -> erda_*.ipynb

  • add python version of notebooks for better diffs in the future

Henry added 25 commits March 9, 2023 10:35
- save some intermediate files
- visualize some stats
- prepare upload to pride (which unified and annonymized identifiers)
- data curation
- allow selection of raw data
- only load intensities
- transpose and create mask view in separate document
- dump counts for samples and features
- create machine specific subfolders for pride
- instrument_name added for subfolders
- don't support long data for now
- skip categorical data checking

keep old code as comments for now (as a reminder)
- was at some point used to investigate which data to use
- notebook is for exploration of single MaxQuant folder
- erda notebooks create dumps
  which are then processed in "hela" notebooks
- rename and describe
- create folder and put commands for raw files
- use -f for using commands read from a file with lftp
- start uploading
- sanity checks and upload missing or incomplete files
- rename sample names in MQ output
technology type -> indicates that it is not RNA (MAGE-TAB format)
- all 7444 selected files for upload are used to create unified dums
- small plotting improvements and minor other changes
- check all files are in list of files (queried from server)
- create some dummy files (placeholders) locally for pride submission tool
- manuelly annotate the submission.px text file from the
  submission tool (basically: add files)
- 🐛 SDRF file had ontology issues
  (and cellline template was not enough)
- plots based on metadata
- metadata is provided on pride ("pride_metadata.csv")

Splitted metadata creation from analysis
- relevant information of mq_summaries.csv also provided
  in metadata_pride.csv
@enryH enryH changed the base branch from dev to extend_comparison May 16, 2023 16:39
@enryH enryH changed the base branch from extend_comparison to dev May 16, 2023 16:40
@enryH enryH changed the title Pride prep Pride submission prep May 16, 2023
@enryH enryH merged commit 85c06f6 into dev May 17, 2023
@enryH enryH deleted the pride_prep branch May 18, 2023 12:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants