StructLM

This is the repository for our COLM-2024 paper StructLM: Towards Building Generalist Models for Structured Knowledge Grounding.

You can use this repository to evaluate the models. To reproduce the models, use SKGInstruct in your preferred finetuning framework. The checkpoitns are being released on Huggingface.

The processed test data is already provided, but the prompts used for training and testing can be found in /prompts

Links

Arxiv Link: https://arxiv.org/abs/2402.16671
Website: https://tiger-ai-lab.github.io/StructLM/

Training

Easy reproduction can be done with the Llama-Factory.

Follow the data preparation steps on their repo to add one of the StructLM datasets from huggingface
use the parameters in the bash script StructLM_finetune.yaml, as a reference replacing the parametres in block quotes [] with your paths. Then start the training like llamafactory-cli train StructLM_finetuning.yaml, as such

Evaluate StructLM-7B

Install Requirements

Requirements:

Python 3.10
Linux
support for CUDA 11.8

pip install -r requirements.txt

Download files

./download.sh

this will download

The raw data required for executing evaluation
The processed test data splits ready for evaluation

Run evaluation

For StructLM-7B/13B/34B

You can download these models seperately with

huggingface-cli download --repo-type=model --local-dir=models/ckpts/StructLM-7B TIGER-Lab/StructLM-7B
huggingface-cli download --repo-type=model --local-dir=models/ckpts/StructLM-13B TIGER-Lab/StructLM-13B
huggingface-cli download --repo-type=model --local-dir=models/ckpts/StructLM-34B TIGER-Lab/StructLM-34B

Then, you can run the inference on the downloaded checkpoints.

./run_test_eval.sh StructLM-7B
./run_test_eval.sh StructLM-13B
./run_test_eval.sh StructLM-34B

For StructLM-7B-Mistral

You can download the model with

huggingface-cli download --repo-type=model --local-dir=models/ckpts/StructLM-7B-Mistral TIGER-Lab/StructLM-7B-Mistral

We can run the inference on the donwloaded checkpoint.

python mistral-fix-data.py
./run_test_eval.sh StructLM-7B-Mistral

These evaluation will generate the results in outputs/StructLM-*/

Acknowledgements

The evaluation metrics in this repository were adapted and modified from the evaluation files found in https://github.com/HKUNLP/UnifiedSKG

Cite

@misc{zhuang2024structlm,
    title={StructLM: Towards Building Generalist Models for Structured Knowledge Grounding},
    author={Alex Zhuang and Ge Zhang and Tianyu Zheng and Xinrun Du and Junjie Wang and Weiming Ren and Stephen W. Huang and Jie Fu and Xiang Yue and Wenhu Chen},
    year={2024},
    eprint={2402.16671},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
configure		configure
metrics		metrics
models/loaders		models/loaders
prompts		prompts
tasks		tasks
third_party		third_party
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
StructLM_finetuning.yaml		StructLM_finetuning.yaml
download.sh		download.sh
eval.py		eval.py
eval_json.py		eval_json.py
mistral-fix-data.py		mistral-fix-data.py
requirements.txt		requirements.txt
run_test_eval.sh		run_test_eval.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StructLM

Table of Contents

Links

Training

Evaluate StructLM-7B

Install Requirements

Download files

Run evaluation

For StructLM-7B/13B/34B

For StructLM-7B-Mistral

Acknowledgements

Cite

About

Releases

Packages

Contributors 3

Languages

License

TIGER-AI-Lab/StructLM

Folders and files

Latest commit

History

Repository files navigation

StructLM

Table of Contents

Links

Training

Evaluate StructLM-7B

Install Requirements

Download files

Run evaluation

For StructLM-7B/13B/34B

For StructLM-7B-Mistral

Acknowledgements

Cite

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages