Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error running BCFTOOLS #109

Closed
DBS-Max opened this issue Feb 13, 2020 · 3 comments
Closed

Error running BCFTOOLS #109

DBS-Max opened this issue Feb 13, 2020 · 3 comments

Comments

@DBS-Max
Copy link

DBS-Max commented Feb 13, 2020

I am getting the following error:

2020-02-13 22:21:21,214 WARNING  No reference file found at default locations. You can set the environment variable 'HGREF' or 'HG19' to point to a suitable Fasta file.
[W] overlapping records at chr1:20626004 for sample 0
[W] Variants that overlap on the reference allele: 109
[I] Total VCF records:         6616564
[I] Non-reference VCF records: 4691787
2020-02-13 22:22:54,382 ERROR    Command line view /tmp/tmpZpFEtyvcf.gz -R /data/reference_genome/panel_regions/all.bed.gz -o /tmp/truth.ppgUpq8m.vcf.gz -O z got return code 127.
STDOUT: 
STDERR: /bin/sh: 1: view: not found

2020-02-13 22:22:54,383 ERROR    Traceback (most recent call last):
2020-02-13 22:22:54,383 ERROR      File "/home/maxh/hap.py/bin/hap.py", line 529, in <module>
2020-02-13 22:22:54,384 ERROR        main()
2020-02-13 22:22:54,384 ERROR      File "/home/maxh/hap.py/bin/hap.py", line 309, in main
2020-02-13 22:22:54,384 ERROR        convert_gvcf_to_vcf=args.convert_gvcf_truth)
2020-02-13 22:22:54,385 ERROR      File "/home/maxh/hap.py/bin/pre.py", line 195, in preprocess
2020-02-13 22:22:54,385 ERROR        num_threads=threads)
2020-02-13 22:22:54,385 ERROR      File "/home/maxh/hap.py/lib/python27/Tools/bcftools.py", line 266, in preprocessVCF
2020-02-13 22:22:54,386 ERROR        raise ex
2020-02-13 22:22:54,386 ERROR    Exception: Command line view /tmp/tmpZpFEtyvcf.gz -R /data/reference_genome/panel_regions/all.bed.gz -o /tmp/truth.ppgUpq8m.vcf.gz -O z got return code 127.STDOUT: STDERR: /bin/sh: 1: view: not found
Hap.py v0.3.12
Error running BCFTOOLS; please check your file for compatibility issues issues using vcfcheck

I made my own docker container since the pkrusche one is 3 years old and I need hap.py to work with Octopus output. I had the smae issue with the provided dockerfiles though.

This is my dockerfile code:

FROM cannabinoid/base_conda_py2:0.1

RUN sudo apt-get update \
    && sudo apt-get install -y \
        build-essential \
        cmake \
        default-jdk \
        pkg-config \
        autoconf \
        bzip2 \
        libbz2-dev \
        libncurses5-dev \
        software-properties-common \
        zlib1g-dev

RUN conda update conda \
    && conda install \
        cython \
        scipy \
        numpy \
        pandas

RUN pip install pybedtools pysam bx-python

RUN sudo wget http://archive.apache.org/dist/ant/binaries/apache-ant-1.9.7-bin.tar.gz \
    && sudo tar xzf apache-ant-1.9.7-bin.tar.gz \
    && sudo rm apache-ant-1.9.7-bin.tar.gz
ENV PATH $PATH:/home/$USERNAME/apache-ant-1.9.7/bin

RUN sudo git clone https://github.com/Illumina/hap.py.git

RUN cd hap.py \
    && sudo python install.py /home/$USERNAME/hap.py --with-rtgtools --no-tests
ENV PATH $PATH:/home/$USERNAME/hap.py/bin
@DBS-Max
Copy link
Author

DBS-Max commented Feb 13, 2020

I tested with the pkrusche container with the same inputs and that container analyses fine.

The command I use is:

nohup docker run -v /media/maxh/slow_storage/human_data:/data pkrusche/hap.py \
    /opt/hap.py/bin/hap.py \
        -r /data/reference_genome/GRCh38_full_analysis_set_plus_decoy_hla.fa \
        --engine=vcfeval \
        --engine-vcfeval-template /data/reference_genome/GRCh38_full_analysis_set_plus_decoy_hla_sdf \
        --threads 5 \
        -R /data/reference_genome/panel_regions/all.bed.gz \
        -o /data/bwa_strelka2/eval/panel_regions_only_69_model_9_B82064 \
        /data/bwa_deepvariant/bcgc_30x/B82064_sorted_md.vcf.gz \
        /data/bwa_strelka2/panel_regions_only_69_model_9/results/variants/B82064.vcf.gz &

@dskola
Copy link
Contributor

dskola commented Feb 13, 2020

Hi @DBS-Max . This looks like a simple fix for now although all this command line manipulation code is in bad need of a refactoring. Can you try branch https://github.com/Illumina/hap.py/tree/issue_109 and see if it is fixed?

@DBS-Max
Copy link
Author

DBS-Max commented Feb 13, 2020

That fixed it! Thanks for the quick reply :).

@DBS-Max DBS-Max closed this as completed Feb 13, 2020
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
* Documentation updates for 0.3.12 release (Illumina#110)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Merge doc changes from dev (Illumina#112)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
* fix bold GFM rendering

Without a space between the *** and the : , Github won't render *** as bold.

* typos?

* Clarify that INFO.VQSLOD is required for --roc

* Update read me with GCC/G++ 4.9.2+ requirement

See Illumina#66

* Remove regex dependency + fix test error with gcc 7.3

* Fix Docker build

* Update rtg-tools to work with new Java

* Documentation updates for 0.3.12 release (Illumina#110)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Merge doc changes from dev (Illumina#112)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Resolved merge conflict

* Remove seemingly-unused ENV command

* Restore ENV DEBIAN_FRONTEND=noninteractive
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
* fix bold GFM rendering

Without a space between the *** and the : , Github won't render *** as bold.

* typos?

* Clarify that INFO.VQSLOD is required for --roc

* Update read me with GCC/G++ 4.9.2+ requirement

See Illumina#66

* Remove regex dependency + fix test error with gcc 7.3

* Fix Docker build

* Update rtg-tools to work with new Java

* Updated RELASES.md and normalization.md (Illumina#109)

* Documentation updates for 0.3.12 release (Illumina#110)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Merge doc changes from dev (Illumina#112)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Remove tag (Illumina#113)

* Documentation updates for 0.3.12 release (Illumina#110)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Merge doc changes from dev (Illumina#112)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Resolved merge conflict

* Remove seemingly-unused ENV command

* Restore ENV DEBIAN_FRONTEND=noninteractive
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Remove tag (Illumina#113)

* Documentation updates for 0.3.12 release (Illumina#110)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Merge doc changes from dev (Illumina#112)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Clean merge (Illumina#115)

* fix bold GFM rendering

Without a space between the *** and the : , Github won't render *** as bold.

* typos?

* Clarify that INFO.VQSLOD is required for --roc

* Update read me with GCC/G++ 4.9.2+ requirement

See Illumina#66

* Remove regex dependency + fix test error with gcc 7.3

* Fix Docker build

* Update rtg-tools to work with new Java

* Documentation updates for 0.3.12 release (Illumina#110)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Merge doc changes from dev (Illumina#112)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Resolved merge conflict

* Remove seemingly-unused ENV command

* Restore ENV DEBIAN_FRONTEND=noninteractive
nicholas-owen pushed a commit to nicholas-owen/hap.py that referenced this issue Mar 17, 2022
* Documentation updates for 0.3.12 release (Illumina#110)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Merge doc changes from dev (Illumina#112)

* Update RELEASES.md (Illumina#87)

* Updated RELASES.md and normalization.md (Illumina#109)

* Update more docs (Illumina#111)

* Update happy.md

* Cleaned up normalisation.md

* Update happy.md

* Created a new section for working with genome VCFs
  * Added description of --filter-nonref
  * Updated description of --convert-gvcf-xxxx options
  * Added note toward beginning of document calling attention to genome VCF section
* Fixed broken links in TOC
* Fixed inconsistent case in headings

* Fix unmergable typo directly in master (Illumina#119)

This little space has caused no end of problems in getting these branches synced up. I'm hopeful that attacking it directly at the source may relieve the issues.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants