Skip to content

Speech MOS scores drastically higher than before #1

@nschimme

Description

@nschimme

Nice project! I switched to your pure Python solution and it works great for audio. There is an issue for speech however, you can see in my benchmark here it compares this library to the original C++ VisQOL and speech often gets significantly higher scores with deltas of 1-2 MOS. My benchmark uses the lattice_tcditugenmeetpackhref_ls2_nl60_lr12_bs2048_learn.005_ep2400_train1_7_raw.tflite model.

https://github.com/nschimme/faac/actions/runs/24147298369?pr=147

The dataset that is misbehaving is: https://github.com/nschimme/TCD-VOIP

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions