Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to shut it up? #19

Closed
bigboss97 opened this issue Sep 8, 2020 · 6 comments
Closed

How to shut it up? #19

bigboss97 opened this issue Sep 8, 2020 · 6 comments

Comments

@bigboss97
Copy link

I used the colab and it worked beautifully, THANKS!
I have a problem. Whenever the audio has a silence piece the original mouth movement remains unchanged.
I've tried with "mmm" to "shutup". But it had no effect. So I used "ha ha ha" to minimize the lip movement.
Is there a way to close the mouth for silence?

@prajwalkr
Copy link
Collaborator

Did you try once with the non-GAN model? It performs better on silences

@bigboss97
Copy link
Author

Did you try once with the non-GAN model? It performs better on silences

I'm not familiar with that. How can I do that in colab?

@prajwalkr
Copy link
Collaborator

Just download the corresponding checkpoint and pass it as an argument in the same inference command.

@HusainKapadia
Copy link

HusainKapadia commented Sep 8, 2020

Just a suggestion...perhaps to improve performance on silence you can add some simulated data to the training data set, where:

  1. The audio inputs would be silence or add some low DB noise to it (to avoid Mel bands being NaN)
  2. The corresponding video could be created with an image of any random person with a neutral expression (of course the length's of the audio and video should match)
  3. Train the model with this extra data along with the LSR2 dataset specified in the README. Or you could fine tune the model with the pre-trained weights.
  4. Run inference with new model.

PS: this is not something I have tried yet, but if this is a limitation of the existing model, this could be one way of improving it, I believe
Would you agree @prajwalkr ?

@prajwalkr
Copy link
Collaborator

prajwalkr commented Sep 9, 2020

Maybe it can help. Maybe using some short range temporal info in the generator might help even better. We are yet to experiment more on this.

@prajwalkr
Copy link
Collaborator

Re-open if needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants