Skip to content

xiaojingyu92/Criteria2SQL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Criteria2SQL

Dataset and source code for our work Dataset and Enhanced Model for Eligibility Criteria-to-SQL Semantic Parsing.

Source code (src)

  • The implementation based on SQLova.

Requirements

  • python3.6 or higher.
  • PyTorch 0.4.0 or higher.
  • CUDA 9.0
  • Python libraries: babel, matplotlib, defusedxml, tqdm
  • Example
    • Install minicoda
    • conda install pytorch torchvision -c pytorch
    • conda install -c conda-forge records
    • conda install babel
    • conda install matplotlib
    • conda install defusedxml
    • conda install tqdm
  • The code has been tested on GTX 1080 Ti running on Ubuntu 16.04.4 LTS.

Training and Testing

  • To train the model by running: python train.py --seed 1 --bS 2 --accumulate_gradients 8 --bert_type_abb uS --fine_tune --lr 0.001 --lr_bert 0.00001 --max_seq_leng 512 on terminal.

  • To test on pre-trained model by running: python test.py --seed 1 --bS 2 --accumulate_gradients 8 --bert_type_abb uS --max_seq_leng 512 on terminal.

  • Pre-trained models can be download from here.

Dataset (data)

Our dataset follows same format as WikiSQL, while includes new types of SQL queries for order-sensitive eligibility criteria, counting-based eligibility criteria, boolean-type eligibility criteria.

Citation

If you use Criteria2SQL, please cite the following work:

@InProceedings{yu-EtAl:2020:LREC,
  author    = {Yu, Xiaojing  and  Chen, Tianlong  and  Yu, Zhengjie  and  Li, Huiyu  and  Yang, Yang  and  Jiang, Xiaoqian  and  Jiang, Anxiao},
  title     = {Dataset and Enhanced Model for Eligibility Criteria-to-SQL Semantic Parsing},
  booktitle      = {Proceedings of The 12th Language Resources and Evaluation Conference},
  month          = {May},
  year           = {2020},
  address        = {Marseille, France},
  publisher      = {European Language Resources Association},
  pages     = {5831--5839
  }