Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import SCC public RNA-Seq & ChipSeq Datasets #1706

Closed
5 tasks done
scottx611x opened this issue May 1, 2017 · 0 comments
Closed
5 tasks done

Import SCC public RNA-Seq & ChipSeq Datasets #1706

scottx611x opened this issue May 1, 2017 · 0 comments

Comments

@scottx611x
Copy link
Member

scottx611x commented May 1, 2017

  • Get list of datasets to import from @sjhosui
  • Figure out which failed to import
  • Figure out which user should be importing said Datasets
  • Bring up test stack w/ latest beta snapshots and attempt the importing there
  • If the above is successful, then beta EC2 will need to be resized to m4.large, have the new datasets imported, and resized back down to an m3.medium

Total of 135 unique public ISA IDs from this list:
https://www.dropbox.com/s/663mcw93citqyo7/public_isa_locations.txt?dl=0

1 Duplicated ISA ID:
15771

120 had ISATabs :
https://www.dropbox.com/sh/5m203bg3hh77cup/AABwvbdqQgZqw682

15 have missing ISATabs:

7657 - ???
8336 - no data
9260 - 404
9787 - no data
11370 - no data
11380 - no data
11627 - no data
11654 - no data
11678 - no data
11687 - no data
11744 - no data
11807 - no data
12952 - no raw data
13363 - no data
15983 - DHS-Seq
1 failed to import into Refinery:

ISATab:
https://www.dropbox.com/s/qnlstjalvgnif5y/isa_10126_799887.zip?dl=0

Traceback: https://docs.google.com/document/d/1p5lokqfmyYpzKMQT2SqXqE2H_bfwKzeWWYFssid6Kz0/edit?usp=sharing

General notes:

  • isa_11445_847064.zip is 49kb and takes a VERY long time to import (sometimes has to be done separately outside of the batch imports)
  • Bumped VM memory to 8G to successfully batch import
  • Import of the 119 Datasets took roughly: 2hrs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants