Paraphrase identification

About

This projet aims to build Neural Network model to predict if two questions are paraphrases or not using Deep Learning.

Installation

To use this project, you must make the follow commands:

git clone https://github.com/luciegaba/paraphrase-identification.
cd paraphrase-identification

If you run the code for BERT Fine-tuning part in Colab, you must do instead:

pip install -r requirements.txt

If you use conda virtual env:

conda env create -f environment.yml
conda activate paraphrase-identification

Results

In this project, we mainly focused on developing a model from scratch to challenge ourselves. We built a Siamese LSTM model for this purpose. Nonetheless, you will see that our performance were not so good due to lack of quality fo data and a potential badly calibrated model. But we also make a "challenging" model based on Transformers called "ParaBERT": The BERT fine-tuned model can be found here. See more details about our project in our report

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
models		models
notebooks		notebooks
reporting		reporting
scripts		scripts
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paraphrase identification

Table of Contents

About

Contents

Installation

Results

Contact

About

Releases

Packages

Languages

luciegaba/paraphrase-identification

Folders and files

Latest commit

History

Repository files navigation

Paraphrase identification

Table of Contents

About

Contents

Installation

Results

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages