Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fail to reproduce result from demo video #5

Closed
CompactSupport opened this issue Nov 1, 2022 · 3 comments
Closed

Fail to reproduce result from demo video #5

CompactSupport opened this issue Nov 1, 2022 · 3 comments

Comments

@CompactSupport
Copy link

I tried to reproduce the result of the Mandarin lip reading from the following demo video:

https://youtu.be/FIau-6JA9Po?t=33

I've made a clip "demo_cn.mp4" from this video 0:33-0:41.

My code:

python main.py --config-filename configs/CMLR_V_WER8.0.ini --data-filename inputs/demo_cn.mp4

The output:

load a pre-trained model from: models/CMLR/CMLR_V_WER8.0/model.pth
face tracking speed: 4.90 fps.
hyp: 有一种种的人俗话说的大家人才能真的一年里的一个行

This is different from the one shown in the demo: 中青祝愿大家在新的一年里新春愉快身体健康

I also extracted mouth ROIs from the clip (link).

Would you please let me know if I missed anything?

@zhunge
Copy link

zhunge commented Nov 3, 2022

set video fps=25 maybe get correct result

@mpc001 mpc001 closed this as completed Nov 19, 2022
@fantasyfw
Copy link

Same problem, not able to reproduce the result. Here is my output:

load a pre-trained model from: models/CMLR/CMLR_V_WER8.0/model.pth
face tracking speed: 5.98 fps.
hyp: 我们总结了祖国的大量餐饮新的一天也成了这样

25fps was set in "configs/CMLR_V_WER8.0.ini"

@mostafamdy
Copy link

same here. the visual only is not working well but the visual and audio are perfect.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants