Dialogue & Narrative Coursework

Subtask 1

The goal is to predict the knowledge grounding in form of document span for the next agent response given dialogue history and the associated document.

How to load our datasets

We mostly used the reading_comprehension (rc) dataset for our work. While this dataset contains the whole context required to ground the user utterances, it doesnt divide said context into spans as it's done in the 'document_domain' dataset. We merged all this information into a single dataset. Specifically, we added the spans corresponding to the context (see column 'spans'), and the spans corresponding to the grouding (see column 'answers' then key 'spans') to the train and validation doc2dial_rc datasets. To load our datasets, use the following steps for ease of use:

import pandas as pd
import ast
to_dict = lambda ex:ast.literal_eval(ex)
df = pd.read_csv('./data/doc2dial_rc_<train/val>.csv.zip', converters={'answers':to_dict, 'spans':to_dict})

Our work

Our work is split in a varierty of notebooks and Python scripts. More importantly, our report highlights our experiements, results, and evalaution.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
img		img
notebooks		notebooks
test		test
.gitignore		.gitignore
README.md		README.md
convert_rc_data.py		convert_rc_data.py
convert_strings_to_dicts.py		convert_strings_to_dicts.py
coursework_assignment_DN_2021.pdf		coursework_assignment_DN_2021.pdf
data_investigation.py		data_investigation.py
doc2dial_rc_val.csv		doc2dial_rc_val.csv
load_data.py		load_data.py
predictions_subtask1_NB.json		predictions_subtask1_NB.json
predictions_subtask1_cosine_simple.json		predictions_subtask1_cosine_simple.json
sharedtask_utils.py		sharedtask_utils.py
task1_naive_bayes.py		task1_naive_bayes.py
utils.py		utils.py
word_counts.py		word_counts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dialogue & Narrative Coursework

Subtask 1

How to load our datasets

Our work

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dialogue & Narrative Coursework

Subtask 1

How to load our datasets

Our work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages