Utility to perform exploration, archiving and curation of local NGS data.
The initial aim is to automate each stage so that it is easy to return to a partially processed directory and be able to review and pick up again with minimal effort.
A secondary aim is to automate the process of examining data to uncover relationships between data files, understand and annotate the contents, and safely remove files that are no longer required.
Depends on the genomics-bcftbx
and auto_process_ngs
modules.
To create test data use the make_mock_solid_dir.py` utility from the
``genomics-bcftbx
repo.
Example protocol:
Create a working copy:
arqvist stage solid0123_20111014_FRAG_BC staging
Examine primary data:
arqvist primary_data staging/solid0123_20111014_FRAG_BC
Compress primary data files using bzip2
:
arqvist compress staging/solid0123_20111014_FRAG_BC
See the documentation under docs
for more information.