Skip to content

Download data from GDC to AWS EC2

Pan Deng edited this page May 23, 2018 · 2 revisions

1. Data source:

GDC repository open access files

For data downloading, apply filters and download:

  • JSON file: contatins case id and other meta information about all the datasets filtered
  • manifest file: contains md5 for downloading datasets

Then, download the GDC data transfer tool

Reference genome:

Description and Download

2. Information transfer

Setup connection with AWS control machine via FileZilla (Tutorial)

Transfer gdc-client to control machine

Transfer manifest and JSON files to control machine

3. Dataset download

path/to/gdc-client download -m path/to/manifest/files .