Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"word-level" doesn't always exist #31

Closed
NodudeWasTaken opened this issue Jan 13, 2023 · 2 comments
Closed

"word-level" doesn't always exist #31

NodudeWasTaken opened this issue Jan 13, 2023 · 2 comments
Labels
bug Something isn't working

Comments

@NodudeWasTaken
Copy link

NodudeWasTaken commented Jan 13, 2023

I tried to translate from japanese to english and use whisperx.
In exactly one entry it's missing a "word-level" in it's alignment dict, causing utils.write_ass to fail.

Alignment output entry:

    {
        "id": 654,
        "seek": 518644,
        "start": 5198.74,
        "end": 5198.7404,
        "text": " Shizu",
        "tokens": [
            1160,
            590,
            84
        ],
        "temperature": 1,
        "avg_logprob": -4.464975124452172,
        "compression_ratio": 1.0123456790123457,
        "no_speech_prob": 0.07407991588115692
    },
@m-bain m-bain added the bug Something isn't working label Jan 16, 2023
@tomlinsonm
Copy link

Probably redundant, but it comes in when the segment has an invalid timestamp - either greater than max duration or smaller than the allowed precision.

@m-bain
Copy link
Owner

m-bain commented Jan 24, 2023

fixed in recent commit I think d395c21

@m-bain m-bain closed this as completed Jan 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants