Skip to content

ZihanWang314/RUC-recruitment

Repository files navigation

2021 RUC RECRUITMENT QUESTION-ANSWERING SYSTEM

2021 RUC-RecQA. Program is halted, this respiratory is for backup only.

The file PRETRAINING shows the process of fine-tuning the model from "bert-base-chinese", including: (1)using the corpus from https://gaokao.chsi.com.cn/ which is national official recruitment website, with Masked-Language-Model to fine-tune the model's word embeddings. (2) using the query-answer pairs from query_answer_pair.xlsx and data_augmentation.xlsx by sentence-transformer to fine-tune the model's ability of matching relative sentences. (3) using the dataset of CLUE and CMRC2018, to improve the model's ability of extracting information from documents. (4) using the dataset of https://rdzs.ruc.edu.cn/cms/ which is RUC official recruitment website, to amplify the query set of the model.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published