SSDM_MRC

The source code of paper: Learning Disentangled Semantic Representations for Zero-Shot Cross-Lingual Transfer in Multilingual Machine Reading Comprehension, accepted to ACL 2022.

Introduction

The multilingual MRC model is based on multilingual Pre-trained Language Models (PLMs) equipped with a Siamese Semantic Disentanglement Model (SSDM) to explicitly transfer only semantic knowledge to the target language.

This repository contains two directories src and data, the SSDM and MRC models code in src, and all the train and test datasets in data.

Track the latest work, we are still optimizing and adjusting, thanks to the following code source:

disentangle-semantics-syntax

multilingual-mrc-isdg

HuggingFace

Environment

GPU Quadro RTX 6000 24G
python 3.7.9
torch 1.7.1
cuda 11.0

Usage

1、Set the configurations of SSDM in config.py，mainly to set output file, choice the type of PLMs and syntax loss (POS or STL).

2、Adjust number of epochs, learning rate, etc. in mrc_experiments.conf

3、Run the training code and it will print the test results in three MRC datasets (XQuAD, MLQA and TyDiQA-GoldP).

python main.py

Moreover, SSDM and MRC can be trained separately, depending on the user's choice.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SSDM_MRC

Introduction

Environment

Usage

About

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
src		src
README.md		README.md
config.py		config.py
main.py		main.py
mrc_experiments.conf		mrc_experiments.conf

wulinjuan/SSDM_MRC

Folders and files

Latest commit

History

Repository files navigation

SSDM_MRC

Introduction

Environment

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages