LogicLLM

This is the official repository for the paper: "Exploring Self-supervised Logic-enhanced Training for Large Language Models".

Requirements

Python: 3.9
CUDA: 11.7/11.8

Other python packages can be installed using the following command:

pip install -r requirements.txt

This project relies on Hydra to manage configurations. The configuration files are located in conf/.

Typical usage:

python trainer_base_ds_mul.py -cp <config_path> -cn <config_file_name>  # torch launcher
deepspeed --include localhost:0,1,2,3 trainer_base_ds_mul.py -cp <config_path> -cn <config_file_name>  # deepspeed launcher

Datasets

You can download all datasets for self-supervised training from the huggingface repo, which also contains the processed datasets containing already constructed logically consistent pairs.

Preprocessing

If you want to preprocess the datasets by yourself, you can simply run the following command:

python trainer_base_ds_mul.py -cp <config_path> -cn <config_file_name> do_preprocess=True

This will stops the program after the datasets have been prepared. You can also remove do_preprocess=True so that the program will start training immediately. However, this is not encouraged, as the preprocessing step is time-consuming, and usually the training requires distributed training, which means that the other processes will be waiting for the data to be ready.

Training

All configs for training different models are listed as follows:

LLaMA-7B
- Config: conf/llama/wiki/llama_7b_merit_v1_pv91_v91_v5_0.yaml
- Weights: Huggingface Hub
LLaMA-13B
- Config: conf/llama/wiki/llama_13b_merit_v1_pv91_v91_v5_0.yaml
- Weights: Huggingface Hub
LLaMA-33B (QLoRA)
- Normal data : Counterfactual data = 1:3
  - Config: conf/llama/wiki/llama_30b_merit_v1_pv91_v91_v5_0.yaml
  - Weights: Huggingface Hub
- Normal data : Counterfactual data = 1:0
  - Config: conf/llama/wiki/llama_30b_merit_v1_pv91_v91_v5_0_no_aug.yaml
  - Weights: Huggingface Hub
- Normal data : Counterfactual data = 1:1
  - Config: conf/llama/wiki/llama_30b_merit_v1_pv91_v91_v5_0_1aug.yaml
  - Weights: Huggingface Hub
LLaMA-65B (QLoRA)
- Config: conf/llama/wiki/llama_65b_merit_v1_pv91_v91_v5_0.yaml
LLaMA-65B (Full parameter training w. Pipeline Parallel)
- Config: conf/llama/wiki/llama_65b_merit_v1_pv91_v91_v5_0_full_mp.yaml
- Note: For pipeline parallel training, please launch the program using trainer_base_ds_mp.py. Also, please first convert the Huggingface weights to DeepSpeed's format via convert2ckpt.py.
Falcon-40B
- Config: conf/rw/falcon_40b_merit_v1_pv91_v91_v5_0.yaml
- Weights: Huggingface Hub

Evaluation

Since there are too many configs for evaluation in this repo, we only list one example here:

python trainer_base_fsdp_v4.py -cp conf/llama/wiki/mc_eval/ -cn llama_30b_merit_v5_qlora_logiqav2_eval_mc_v1_0_test  # This is for LogiQA-v2 multiple choice evaluation.

Citation

If you find the repository and the paper helpful, please kindly cite our papers:

@inproceedings{logicllm2023jiao,
  author       = {Fangkai Jiao and
                  Zhiyang Teng and
                  Bosheng Ding and
                  Zhengyuan Liu and
                  Nancy F. Chen and
                  Shafiq R. Joty},
  title        = {Exploring Self-supervised Logic-enhanced Training for Large
                  Language Models},
  booktitle    = {{NAACL}},
  publisher    = {Association for Computational Linguistics},
  year         = {2024},
}

@inproceedings{merit2022jiao,
  author       = {Fangkai Jiao and
                  Yangyang Guo and
                  Xuemeng Song and
                  Liqiang Nie},
  title        = {MERIt: Meta-Path Guided Contrastive Learning for Logical Reasoning},
  booktitle    = {Findings of {ACL}},
  pages        = {3496--3509},
  publisher    = {Association for Computational Linguistics},
  year         = {2022},
}

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
conf		conf
data		data
general_util		general_util
lomo		lomo
meta_llama		meta_llama
models		models
modules		modules
open_ai_callers		open_ai_callers
post_processors		post_processors
preprocess		preprocess
remax_engine/utils		remax_engine/utils
scripts		scripts
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Untitled.ipynb		Untitled.ipynb
VEM.txt		VEM.txt
check_gpu_status.py		check_gpu_status.py
convert2ckpt.py		convert2ckpt.py
convert2ckpt_double_head.py		convert2ckpt_double_head.py
convert2hf.py		convert2hf.py
convert_llama_weights_to_hf.py		convert_llama_weights_to_hf.py
cot_combine_v1_0.py		cot_combine_v1_0.py
ds_inference.py		ds_inference.py
ds_inference_zero.py		ds_inference_zero.py
extend_ckpt_weight_size.py		extend_ckpt_weight_size.py
filter_merit_data_binary.py		filter_merit_data_binary.py
llama_example.py		llama_example.py
lomo_trainer.py		lomo_trainer.py
lora_merge.py		lora_merge.py
merit_eval.py		merit_eval.py
merit_eval_v2.py		merit_eval_v2.py
multi_run.sh		multi_run.sh
multi_run_dist.sh		multi_run_dist.sh
multi_run_dist_seed.sh		multi_run_dist_seed.sh
openai_api_caller_v1.py		openai_api_caller_v1.py
pbs.submit.sh		pbs.submit.sh
predictor.py		predictor.py
predictor_gen.py		predictor_gen.py
prepare_cot_feedback.py		prepare_cot_feedback.py
prompt_llm_service_via_curl.py		prompt_llm_service_via_curl.py
requirements.txt		requirements.txt
run.sh		run.sh
run_flask_server.py		run_flask_server.py
s5cmd		s5cmd
seed_multi_run.sh		seed_multi_run.sh
seed_multi_run_ds.sh		seed_multi_run_ds.sh
seed_multi_run_v4.sh		seed_multi_run_v4.sh
service_api_caller_v1.py		service_api_caller_v1.py
slurm.submit.sh		slurm.submit.sh
slurm.submit.w2.sh		slurm.submit.w2.sh
speculative_decode_serve.py		speculative_decode_serve.py
start.py		start.py
train_lomo.py		train_lomo.py
trainer_base_ds_mp.py		trainer_base_ds_mp.py
trainer_base_ds_mp_aws.py		trainer_base_ds_mp_aws.py
trainer_base_ds_mul.py		trainer_base_ds_mul.py
trainer_base_ds_mul_aws.py		trainer_base_ds_mul_aws.py
trainer_base_ds_v1.py		trainer_base_ds_v1.py
trainer_base_fsdp_v1.py		trainer_base_fsdp_v1.py
trainer_base_fsdp_v2.py		trainer_base_fsdp_v2.py
trainer_base_fsdp_v3.py		trainer_base_fsdp_v3.py
trainer_base_fsdp_v3_mul.py		trainer_base_fsdp_v3_mul.py
trainer_base_fsdp_v4.py		trainer_base_fsdp_v4.py
trainer_base_v3.py		trainer_base_v3.py
trainer_base_v3_aws.py		trainer_base_v3_aws.py
trainer_ds_mp_unify_aws.py		trainer_ds_mp_unify_aws.py
trainer_slurm_fsdp_v1.py		trainer_slurm_fsdp_v1.py
trainer_torch_fsdp_v1.py		trainer_torch_fsdp_v1.py
trainer_torch_fsdp_v2.py		trainer_torch_fsdp_v2.py
wiki_rel_disc_eval.py		wiki_rel_disc_eval.py
wiki_rel_disc_eval_base.py		wiki_rel_disc_eval_base.py
zero_to_fp32.py		zero_to_fp32.py

License

SparkJiao/LogicLLM

Folders and files

Latest commit

History