[Recipe PR] MELD: Multimodal EmotionLines Dataset #4771

realzza · 2022-11-17T08:20:58Z

About

Added baseline recipe for MELD: Multimodal EmotionLines Dataset

Checklist

Todo

submit model through huggingface

sw005320 · 2022-11-17T23:04:29Z

@siddhu001, it would be great if you also review this PR.

ftshijt

Just to confirm: it seems that this is only an unimodal model. Is it intentional or you will keep update it.

I also recommend you to check the task of SLU, which could be a better place to hold your recipe as well (we can keep on this).

egs2/meld/asr1/local/data_prep.py

egs2/meld/asr1/local/path.sh

egs2/meld/asr1/run.sh

egs2/meld/asr1/README.md

egs2/meld/asr1/local/data_prep.py

siddhu001 · 2022-11-18T05:07:08Z

I would suggest incorporating all of Jiatong's feedback. I added a comment and clarification, rest looks good to me.

realzza · 2022-11-18T16:53:38Z

Just to confirm: it seems that this is only an unimodal model. Is it intentional or you will keep update it.

Yes, this recipe is designed to use unimodal to do multitask learning is ASR and SLU. We may not update MELD to involve multimodals.

I also recommend you to check the task of SLU, which could be a better place to hold your recipe as well (we can keep on this).

Thanks for mentioning, I categorized this recipe as SLU in egs2/README.md.

sw005320 · 2022-11-18T18:53:35Z

@ftshijt, can you review it again?
I think this PR is almost ready.

ftshijt · 2022-11-21T05:04:27Z

Looks cool! After it passed the CI tests, I will merge it

realzza · 2022-11-21T06:02:03Z

Seems it fails the CI test, as well as centos7 and debian9 tests due to the overlength lines in data_prep.py, let me fix this format issue shortly!

update: should be fixed now!

realzza · 2022-11-22T21:55:21Z

Hi @ftshijt, thank you for your review! I fixed the flake8 error W605 invalid escape sequence '\-', we can rerun the CI tests.

codecov · 2022-11-22T22:27:03Z

Codecov Report

Merging #4771 (fbfe277) into master (ca2193d) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #4771   +/-   ##
=======================================
  Coverage   80.32%   80.32%           
=======================================
  Files         530      530           
  Lines       46527    46527           
=======================================
  Hits        37372    37372           
  Misses       9155     9155

Flag	Coverage Δ
test_integration_espnet1	`66.37% <ø> (ø)`
test_integration_espnet2	`48.87% <ø> (ø)`
test_python	`68.62% <ø> (ø)`
test_utils	`23.30% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

realzza · 2022-11-28T18:28:31Z

The failed check may be caused by this issue reported in the latest version of numpy.

sw005320 · 2022-11-28T20:56:21Z

Thanks, @realzza!

realzza added 13 commits November 17, 2022 02:40

symbolic link init

07cef80

update db.sh

46b9517

add symbolic links

0067f54

update MELD to db

3389f1a

default setup

d7d55e3

add encode and decode configs

98abccb

remove ipynb chkpt

7bf8809

specialized script for prepare data

920b145

script to download MELD and untar

9d7e474

default path

3345dbd

add entry script run.sh

84915c9

add README of MELD

dad459a

update egs2/README.md

ee286e2

mergify bot added ESPnet2 README labels Nov 17, 2022

realzza marked this pull request as ready for review November 17, 2022 08:31

sw005320 requested a review from ftshijt November 17, 2022 23:03

sw005320 added this to the v.202211 milestone Nov 17, 2022

sw005320 added Recipe SLU Spoken language understanding labels Nov 17, 2022

ftshijt requested changes Nov 18, 2022

View reviewed changes

Remove commented lines

c1aa6d0

siddhu001 reviewed Nov 18, 2022

View reviewed changes

egs2/meld/asr1/local/data_prep.py Outdated Show resolved Hide resolved

realzza added 3 commits November 18, 2022 11:30

softlinked train config

65aa439

resolve run.sh requests

f06215c

remove unused transcript file generation

830c6d7

update huggingface model

e2cece6

remove default token_type arg

11597ed

realzza and others added 3 commits November 21, 2022 01:34

fix overlength line issue

0c3154f

solve W605:Invalid escape sequence Error

562f7dd

Merge branch 'espnet:master' into master

fbfe277

sw005320 merged commit 9a97143 into espnet:master Nov 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Recipe PR] MELD: Multimodal EmotionLines Dataset #4771

[Recipe PR] MELD: Multimodal EmotionLines Dataset #4771

realzza commented Nov 17, 2022 •

edited

sw005320 commented Nov 17, 2022

ftshijt left a comment

siddhu001 commented Nov 18, 2022

realzza commented Nov 18, 2022

sw005320 commented Nov 18, 2022

ftshijt commented Nov 21, 2022

realzza commented Nov 21, 2022 •

edited

realzza commented Nov 22, 2022

codecov bot commented Nov 22, 2022 •

edited

realzza commented Nov 28, 2022

sw005320 commented Nov 28, 2022

[Recipe PR] MELD: Multimodal EmotionLines Dataset #4771

[Recipe PR] MELD: Multimodal EmotionLines Dataset #4771

Conversation

realzza commented Nov 17, 2022 • edited

About

Checklist

Todo

sw005320 commented Nov 17, 2022

ftshijt left a comment

Choose a reason for hiding this comment

siddhu001 commented Nov 18, 2022

realzza commented Nov 18, 2022

sw005320 commented Nov 18, 2022

ftshijt commented Nov 21, 2022

realzza commented Nov 21, 2022 • edited

realzza commented Nov 22, 2022

codecov bot commented Nov 22, 2022 • edited

Codecov Report

realzza commented Nov 28, 2022

sw005320 commented Nov 28, 2022

realzza commented Nov 17, 2022 •

edited

realzza commented Nov 21, 2022 •

edited

codecov bot commented Nov 22, 2022 •

edited