Provide memory mapping option? #43

ChillarAnand · 2024-07-07T09:52:19Z

Several classification tools provide "memory-mapping" option.

When running a huge number of samples, instead of loading db into memory everytime, "memory-mapping" option will allow to preload the db into ram once and run classification across all the samples which improves run time by a huge margin.

muellan · 2024-07-12T14:31:27Z

There's the "interactive query mode".
If you run metacache query <database_name> without any read input files the database will be loaded into memory and you can then run as many queries as you like.
This is easiest done by piping query strings into metacache in a script like in the example below:

#!/bin/bash
database="mydatabasename"
queries=""
# add query
queries="${queries} myreads.fq -out myoutfile.txt\n"
# add query
queries="${queries} reads1.fa reads2.fa -pairfiles -out myoutfile.txt\n"
# ... add more queries ....
# finally: load database and run all queries
echo -e ${queries} | ./metacache query ${database}

ChillarAnand mentioned this issue Jul 7, 2024

Guidance for working with large reference data sets #37

Open

muellan added the question label Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide memory mapping option? #43

Provide memory mapping option? #43

ChillarAnand commented Jul 7, 2024

muellan commented Jul 12, 2024

Provide memory mapping option? #43

Provide memory mapping option? #43

Comments

ChillarAnand commented Jul 7, 2024

muellan commented Jul 12, 2024