GitHub - NUS-IDS/coling22_QSTS

This is the repository for the COLING 2022 paper: "QSTS : A Question-Sensitive Text Similarity Measure for Question Generation".

The model for question-class prediction was trained using T5-large (https://huggingface.co/docs/transformers/model_doc/t5) on the TREC dataset (https://cogcomp.seas.upenn.edu/page/resource_view/49) .

See code/predict_T5.py for details on how to use our trained classifier based on T5-large.

We used Stanza (version 1.2) for obtaining dependency parses and NER information (https://stanfordnlp.github.io/stanza/)

The pkl dump of subsetted embeddings (using wordlist from Wiki) from GloVE can be created as follows:

--Use the make_embeddings function in code/glove_similarity.py with inputs
the original glove embeddings from https://nlp.stanford.edu/data/glove.840B.300d.zip
and the terms list provided in code/wikiterms_stopwords.txt

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
code		code
dataset		dataset
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

NUS-IDS/coling22_QSTS

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages