-
Notifications
You must be signed in to change notification settings - Fork 3
Edits to InterMine Checkout
Before building RatMine there is some customization of the stock InterMine checkout needed. Most of these editions are to source configuration files to ensure that data is integrated correctly.
Uniprot
Both the Uniprot configuration and keys files need to edits.
MINE/bio/sources/uniprot/main/resources/uniprot_config.properies
10116.primaryIdentifier = RGDMINE/bio/sources/uniprot/resources/uniprot_keys.properties
Protein.key_primaryaccession = primaryAccessionKEGG Pathway
Both the KEGG Pathway configuration and keys files need edits. KEGG Pathway uses the obsolete key style definitions, they can be updated to the current style, but it not needed.
MINE/bio/sources/kegg-pathway/main/resources/kegg_config.properies
# rat
rno.taxonId = 10116
rno.identifier = ncbiGeneNumberMINE/bio/sources/kegg-pathway/resources/kegg-pathway_keys.properties
Gene.key_ncbigenenumber=ncbiGeneNumberFASTA
The FASTA source is missing a keys file for RatMine. Created the properties files and add the key definition to it.
MINE/bio/sources/fasta/resources/rat-chromosome-fasta_keys.properties
Chromosome.key_primaryidentifier=primaryIdentifierBIOGRID
The BioGrid sources configuration file need to be set to use RGD identifiers
MINE/bio/sources/biogrid/main/resources/biogrid_config.properties
# rat
10116.xref.primaryIdentifier = rgdPSI IntAct
The PSI sources configuration file need to be set to use RGD identifiers
MINE/bio/sources/psi/main/resources/psi-intact_config.properties
# Rattus norvegicus
10116.identifier = primaryIdentifier
10116.datasource = rgdYou will also need to copy the psi-intact_keys.properties file to psi_keys.properties, or set the name of the source in your ratmine properties file to psi-intact.
GFF3
RatMine has its own GFF3 parser, however to get RatMine to load duplicate IDs the parent class needs one line commented out. Add the following line to approximately line #125 of InterMine’s GFF3Converter.java file.
MINE/bio/core/main/src/org/intermine/bio/dataconversion/GFF3Converter.java
duplicates = false;