Skip to content

sylee96/KU-NLP-kt-dev-challenge-2022

 
 

Repository files navigation

kt-dev-challenge-2022

This is a project that won the first prize at kt-dev-challenge held in September 2022.
Overall, we used the T5 model provided by KT, but we would like to inform you that we cannot disclose some models and datasets due to confidentiality.

T5-model architecture

image image

Configuration of Implementation


How to start

Clone project and install modules

git clone https://github.com/kkjsw17/KU-NLP-kt-dev-challenge-2022.git
cd KU-NLP-kt-dev-challenge-2022
pip install -r ./assets/requirements.txt

How to preprocess

preprocessor.py

  • To use T5 Encoder only model, the sentence must be changed to a tensor.
  • This is used to transform from sentence to tensor.
python preprocessor.py

How to train

train.py

  • This is used to train T5 encoder-decoder model.
  • Cannot disclose train code for encoder only models due to confidentiality.
sh train.sh

How to find best model

search_hyperparams.py

  • This is used to search hyperparameters to optimize model.
  • Hyperparameter search borrowed the idea of the grid search method. search_ensemble_f1.py
  • This is used to search optimal combination.
  • The method searching best combination used the idea of the hard-voting.
python search_hyperparams.py
python search_ensemble_f1.py

How to infer

infer.py and infer_encoder.py

  • This is used to infer results using dataset of test.
  • infer.py is used to infer results using T5 encoder-decoder model
  • infer_encoder.py is used to infer results using T5 encoder only model. But due to confidentiality, this code is not perfect code.
sh infer.sh
sh infer_encoder.sh

Results

We won the first prize by this project, and T5 Encoder only model has better result than T5 Encoder-Decoder model

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 97.8%
  • Shell 2.2%