Welcome to the official repository for the paper "How do LLMs Understand Genes and Cells".
This project explores the capabilities of large language models (LLMs) in the realm of cellular biology. By leveraging the LLaMA_Factory framework, we fine-tune LLMs for gene-related challenges, paving new paths in bioinformatics research.
To replicate the experiments described in this paper, you first need to correctly install the LLaMA_Factory framework. Then, place the JSON data files from the above link into the LLaMA-Factory/data folder.
To start the fine-tuning process, run the provided training script by replacing {task_name} with the name of your specific task:
sh example_{task_name}.sh