Skip to content

Polish language thread #151

@ziom6270

Description

@ziom6270

In some generations, the first two or three words were a complete/incomprehensible hallucination (as if the model was warming up - steering towards the Polish language). Furthermore, it read Japanese names surprisingly well, but, for example, it read the word "jujitsu" plainly, without the typical Polonization characteristic of the Polish language. In some longer generations, the quality deteriorates after about 3 minutes, after which it returns to correct quality. I am attaching all my tests.

An excellent model; I see huge potential in it. Definitely more optimized than the 1.5B or Large version. Although these are voice cloning models, they can be highly unstable and require significant effort in post-production. Generally, the male voice sounds better and behaves more stably, although it is good to have a comparison with the female voice. I suspect that the voice model needs to be trained on a larger audio dataset with ideal transcription for these specific phenomena. But the results are genuinely satisfactory. In terms of PC performance, the constant VRAM requirement is about 3.5GB.

I am attaching all my tests for listening, along with the script and commentary.

@YaoyaoChang
Thank you very much for such quick action and providing the model for testing. I can't wait for the next sample. Perhaps a small script could be made available for retraining on my own dataset; that would be very helpful for testing the appropriate characters—those specific to the Polish language.

[01] -MAN- -WOMAN- -COMMENTARY- -LOG_DETAILS- -TEXT_SCRIPT-

[02] -MAN- -WOMAN- -COMMENTARY- -LOG_DETAILS- -TEXT_SCRIPT-

[03]-MAN- -WOMAN- -LOG_DETAILS- -TEXT_SCRIPT-

[04] -MAN- -WOMAN- -LOG_DETAILS- -TEXT_SCRIPT-

[05] -MAN- -WOMAN- -LOG_DETAILS- -TEXT_SCRIPT-

[06] -MAN- -WOMAN- -LOG_DETAILS- -TEXT_SCRIPT-

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions