Skip to content

Code for DSTC 10: SIMMC 2.0 track: Multimodal Coreference Resolution subtask.

License

Notifications You must be signed in to change notification settings

i-need-sleep/MMCoref

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cleaned code and technical report here.

MMCoref

Submission for DSTC 10 simmc2 subtask 2: coreference resolution.

Links for checkpoints, etc are in the placeholder.txt files.

For each object, visual features are extracted using CLIP and/or BUTD. Non-visual prefab features are encoded using BERT or sentence BERT. These features are conbined with object index embeddings, positions, etc in a linear layer. The comined features for each object and the flattened dialogue history are input into a pretrained UNITER model with a binary classification head for each object.

We train 5 models with different inputs (differenct visual backbone, etc) and ensemble them to produce the final result.

About

Code for DSTC 10: SIMMC 2.0 track: Multimodal Coreference Resolution subtask.

Resources

License

Stars

Watchers

Forks

Packages

No packages published