I've run into an issue with running CSP data (TotalSeq C, adjacent to Chromium 10X protocol for GEX).
The data is FASTQ files with reads of 26 base pairs (and a couple with some 27 bp reads).
They were sequenced as 26x10x10x[25-27] bp configuration CSP libraries.
I've been looking into running kb count and use the H5AD files with the GEX H5AD outputs.
My version of kbtools runs fine on GEX data but always produces the same error when trying to run CSP.
I've tried changing the files, the parameters, reinstalling, etc.
[2024-02-22 16:25:38,978] DEBUG [main] Printing verbose output
[2024-02-22 16:25:41,189] DEBUG [main] kallisto binary located at /home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/bins/linux/kallisto/kallisto
[2024-02-22 16:25:41,195] DEBUG [main] bustools binary located at /home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/bins/linux/bustools/bustools
[2024-02-22 16:25:41,196] DEBUG [main] Creating `/home/kb_out2/2023_03_15-samples_CSP/1S1_S1/tmp` directory
[2024-02-22 16:25:41,201] DEBUG [main] Namespace(list=False, command='count', tmp=None, keep_tmp=False, verbose=True, i='/home/ref/kb_custom_index', g='/home/ref/kb_custom_t2g', x='10XV2', o='/home/kb_out2/2023_03_15-samples_CSP/1S1_S1', num=False, w=None, r=None, t=8, m='48G', strand='forward', inleaved=False, genomebam=False, aa=False, gtf=None, chromosomes=None, workflow='standard', em=False, mm=True, tcc=False, filter='bustools', filter_threshold=None, c1=None, c2=None, overwrite=False, dry_run=False, batch_barcodes=False, loom=False, h5ad=True, loom_names='barcode,target_name', sum='none', cellranger=False, gene_names=False, N=None, report=False, no_inspect=False, kallisto='/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/bins/linux/kallisto/kallisto', bustools='/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/bins/linux/bustools/bustools', no_validate=False, no_fragment=False, parity=None, fragment_l=None, fragment_s=None, bootstraps=None, matrix_to_files=False, matrix_to_directories=False, fastqs=['1S1_S1_L001_QC_001.1.trimmed.fastq', '1S1_S1_L001_QC_001.2.trimmed.fastq'])
[2024-02-22 16:25:45,302] INFO [count] Using index /home/ref/kb_custom_index to generate BUS file to /home/kb_out2/2023_03_15-samples_CSP/1S1_S1 from
[2024-02-22 16:25:45,305] INFO [count] 1S1_S1_L001_QC_001.1.trimmed.fastq
[2024-02-22 16:25:45,307] INFO [count] 1S1_S1_L001_QC_001.2.trimmed.fastq
[2024-02-22 16:25:45,307] DEBUG [count] kallisto bus -i /home/ref/kb_custom_index -o /home/kb_out2/2023_03_15-samples_CSP/1S1_S1 -x 10XV2 -t 8 --fr-stranded 1S1_S1_L001_QC_001.1.trimmed.fastq 1S1_S1_L001_QC_001.2.trimmed.fastq
[2024-02-22 16:25:45,410] DEBUG [count]
[2024-02-22 16:25:46,611] ERROR [count]
[2024-02-22 16:25:46,611] ERROR [main] An exception occurred
Traceback (most recent call last):
File "/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/main.py", line 1618, in main
COMMAND_TO_FUNCTION[args.command](parser, args, temp_dir=temp_dir)
File "/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/main.py", line 703, in parse_count
count(
File "/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/ngs_tools/logging.py", line 62, in inner
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/count.py", line 1277, in count
bus_result = kallisto_bus(
^^^^^^^^^^^^^
File "/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/count.py", line 203, in kallisto_bus
run_executable(command)
File "/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/dry/__init__.py", line 25, in inner
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/utils.py", line 203, in run_executable
raise sp.CalledProcessError(p.returncode, ' '.join(command))
subprocess.CalledProcessError: Command '/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/bins/linux/kallisto/kallisto bus -i /homeref/kb_custom_index -o /home/kb_out2/2023_03_15-samples_CSP/1S1_S1 -x 10XV2 -t 8 --fr-stranded 1S1_S1_L001_QC_001.1.trimmed.fastq 1S1_S1_L001_QC_001.2.trimmed.fastq' died with <Signals.SIGILL: 4>.
[2024-02-22 16:25:46,615] DEBUG [main] Removing `/home/kb_out2/2023_03_15-samples_CSP/1S1_S1/tmp` directory
#SBATCH --mem=64G
#SBATCH --partition=tb
#SBATCH --cpus-per-task=32
kb count \
-o /home/kb_out2/2023_03_15-samples_CSP/1S1_S1 \
-i /home/ref/kb_custom_index \
-g /home/ref/kb_custom_t2g \
-x 10XV2 \
-m 48G \
-t 8 \
--workflow standard \
--mm \
--h5ad \
--strand forward \
--filter bustools \
--verbose \
1S1_S1_L001_QC_001.1.trimmed.fastq 1S1_S1_L001_QC_001.2.trimmed.fastq
kb_python 0.28.0
kallisto: 0.50.1 (/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/bins/linux/kallisto/kallisto)
bustools: 0.43.1 (/home/kli/.conda/envs/kallisto/lib/python3.11/site-packages/kb_python/bins/linux/bustools/bustools)
Cross-posting from kallistobustools repo because I realized this repo has more activity:
I've run into an issue with running CSP data (TotalSeq C, adjacent to Chromium 10X protocol for GEX).
The data is FASTQ files with reads of 26 base pairs (and a couple with some 27 bp reads).
They were sequenced as 26x10x10x[25-27] bp configuration CSP libraries.
I've been looking into running kb count and use the H5AD files with the GEX H5AD outputs.
My version of kbtools runs fine on GEX data but always produces the same error when trying to run CSP.
I've tried changing the files, the parameters, reinstalling, etc.
Snippet from standard error
Snippet from srun file
version information: