Please load the pretained model "model_29_1000_4l_308_512_noiseandTL.pt" for all the downstream tasks.
- Run ./data/genomic_benchmark.py to get train and test data for 8 tasks.
- Run genomic_classification.py for fine tuning.
- Download dataset from https://github.com/MAGICS-LAB/DNABERT_2
- Run gue_classification.py for fine tuning.
- Run ./data/genome_process.py to create train and test dataset.
- Run cdna_classification.py for fine tuning.