Skip to content

Schedule the processing of ENA sequences and upload to S3 public dataset

Notifications You must be signed in to change notification settings

ga4gh/refget-loader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Refget Loader

Load reference sequences to Public Cloud Storage from a variety of data sources

Overview

Supported Cloud Environments

  • AWS S3

Supported Data Sources

  • ENA Assemblies

Getting Started

Prerequisites

  • Install the AWS Command Line Interface, and configure the CLI to run with an IAM user/profile that has write access to the S3 bucket of interest
  • Install the ena-refget-processor using the instructions provided, the scheduler will make use of its load_expanded_con.pl script

Installation

Clone repo and install locally

git clone https://github.com/ga4gh/ena-refget-scheduler.git
cd ena-refget-scheduler
python setup.py install

Confirm the scheduler has been installed by issuing:

ena-refget-scheduler

Usage

View / Modify Settings

View / Modify Upload Checkpoint

Schedule Upload Jobs

About

Schedule the processing of ENA sequences and upload to S3 public dataset

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages