MultiTask Learning in Intent Detection and Slot Prediction for Tamil Conversational Dialogues using Multilingual Pretrained Models
The aim of this project is to develop an intent detction and slot prediction system for Tamil language. An open source dataset from the paper "TamilATIS: Dataset for Task-Oriented Dialog in Tamil (S et al., DravidianLangTech 2022)" was used.Both the Single Task learning based approach and Multi-task learning approaches are experimented. In MultiTask Learning, we use a Random Loss Weighting for intent-detection to account for class-imbalance.
The TamilATIS dataset was created by translating the ATIS dataset to English and then the slots are annotated manually. To get the dataset, email the authors of the paper "TamilATIS: Dataset for Task-Oriented Dialog in Tamil (S et al., DravidianLangTech 2022)".
The demo for this project is available here
As a first step, clone this repo.
To run the MultiTask Learning experiements, Edit all the configurations in the configs folder.This project uses Hydra as a configuration management tool. If you are new to hydra, I would recommend this tutorial.
After setting up all the configs,
python3 main.py
The single task learning experiments are inside, the stl
folder.
To experiment with this dataset, two multilingual pretrained models are used XLM-Roberta-Base and XLM-Align-Base. Both the Single task learning and Multi-task learning approach is used.
Model | Accuracy | Macro F1 | Weighted F1 |
---|---|---|---|
XLM-ROBERTA-BASE | 0.9252 | 0.4080 | 0.9166 |
XLM-ALIGN-BASE | 0.9013 | 0.2506 | 0.8839 |
MURIL-BASE | 0.8146 | 0.0498 | 0.7314 |
MURIL-LARGE | 0.8968 | 0.2290 | 0.8715 |
Model | Accuracy | Macro F1 | Weighted F1 |
---|---|---|---|
XLM-ROBERTA-BASE | 0.9507 | 0.6298 | 0.9205 |
XLM-ALIGN-BASE | 0.9428 | 0.57 | 0.89 |
MURIL-BASE | 0.6437 | 0.0635 | 0.5456 |
MURIL-LARGE | 0.96 | 0.7611 | 0.9548 |
Model | Accuracy | Macro F1 | Weighted F1 |
---|---|---|---|
XLM-ROBERTA-BASE | 0.9476 | 0.6458 | 0.9434 |
XLM-ALIGN-BASE | 0.9566 | 0.5828 | 0.9521 |
! MURIL-BASE | 0.9581 | 0.5959 | 0.9530 |
! MURIL-LARGE | 0.9596 | 0.6202 | 0.9552 |
Model | Accuracy | Macro F1 | Weighted F1 |
---|---|---|---|
XLM-ROBERTA-BASE | 0.9600 | 0.7508 | 0.9432 |
XLM-ALIGN-BASE | 0.9243 | 0.7863 | 0.9395 |
MURIL-BASE | 0.9407 | 0.6434 | 0.9301 |
MURIL-LARGE | 0.95 | 0.7577 | 0.9467 |
- The RLW didn't work when used with span loss, The run can be found be found here, so the results were not included here.
- XLM large model didn't provide decent results
Citation
Ramaneswaran S, Sanchit Vijay, and Kathiravan Srinivasan. 2022. TamilATIS: Dataset for Task-Oriented Dialog in Tamil. In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages, pages 25–32, Dublin, Ireland. Association for Computational Linguistics.
Bibtex
@inproceedings{s-etal-2022-tamilatis,
title = "{T}amil{ATIS}: Dataset for Task-Oriented Dialog in {T}amil",
author = "S, Ramaneswaran and
Vijay, Sanchit and
Srinivasan, Kathiravan",
booktitle = "Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages",
month = may,
year = "2022",
address = "Dublin, Ireland",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2022.dravidianlangtech-1.4",
doi = "10.18653/v1/2022.dravidianlangtech-1.4",
pages = "25--32"
}