You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This file should be a tab-separated list of paired chromosome/sequence identifiers (i.e. one pair per line), where each pair specifies two chromosome sequences, from different genome builds, which are homologous. However, this mechanism is now deprecated for calculating structures from hybrid data. Instead it is better to use the latest "master" branch version (not the first release version), which is more advanced.
In the more recent version there should be a chromosome naming file for each genome build (specified with -cn and -cn2 flags); two columns, space/tab separated. These map from the chromosome sequence identifier in the first column (e.g. as appears in the FASTA sequence file) to a simple name, like "chr1", in the second column. For example, for mouse build mm10:
The simple name should match another chromosome in the other naming file to define a homologous pair. The "nuc_sequence_names" program is provided to automatically create a naming file; seeking simple chromosome names from the sequence accession codes found in a FASTA file of a genome build. Note that if the sequence names for the genome build are already "chr1", "chr2", "chr3" etc. Then you can use a naming file of the form:
chr1 chr1
chr2 chr2
chr3 chr3
...
And naturally the second column should match the names for the hybrid's other genome build.
Can you please provide an example file of the homologous_chromos HOM_CHROMO_TSV_FILE? Or explain it a little bit.
Thanks
Gang
The text was updated successfully, but these errors were encountered: