Skip to content

The shared task for the Workshop on Simulation of Conversational Intelligence in Chat (SCI-CHAT) serves as a place to test and compare new and established research ideas in the field of open-domain dialogue and natural language processing.

hkmirza/EACL2024-SCI-CHAT-SharedTask

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 

Repository files navigation

EACL2024-SCI-CHAT-SharedTask

The shared task for the Workshop on Simulation of Conversational Intelligence in Chat (SCI-CHAT) serves as a place to test and compare new and established research ideas in the field of open-domain dialogue and natural language processing.

Data pre-processing

  • After downloading the data in the folder podcast, run the command:
python preprocess.py --input_folder path/to/the/podcast/folder
  • This command will generate a json files containing two keys input and reply, which we will use to fine-tune the DialoGPT-medium model.

Model Fine-tuning

A walkthrough of how to fine DialoGPT can be found on Hugging Face notebooks or here. A step-by-step to fine tune DialoGPT-medium on podacast data and pushing the model to huggingface-hubis available as fine_tune.ipynb.

Hosting API

Once you have a working model, you can host the API on hugging face hub or some other services as per your preference. A guide on uploading your model on hugging face hub is available here. Make sure that the API is available publicly.

The sample format of the API requests will be available soon.

Models Details

Other Possible Pre-trained Models

Other possible datasets

About

The shared task for the Workshop on Simulation of Conversational Intelligence in Chat (SCI-CHAT) serves as a place to test and compare new and established research ideas in the field of open-domain dialogue and natural language processing.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published