Nice project! I switched to your pure Python solution and it works great for audio. There is an issue for speech however, you can see in my benchmark here it compares this library to the original C++ VisQOL and speech often gets significantly higher scores with deltas of 1-2 MOS. My benchmark uses the lattice_tcditugenmeetpackhref_ls2_nl60_lr12_bs2048_learn.005_ep2400_train1_7_raw.tflite model.
https://github.com/nschimme/faac/actions/runs/24147298369?pr=147
The dataset that is misbehaving is: https://github.com/nschimme/TCD-VOIP
Nice project! I switched to your pure Python solution and it works great for audio. There is an issue for speech however, you can see in my benchmark here it compares this library to the original C++ VisQOL and speech often gets significantly higher scores with deltas of 1-2 MOS. My benchmark uses the
lattice_tcditugenmeetpackhref_ls2_nl60_lr12_bs2048_learn.005_ep2400_train1_7_raw.tflitemodel.https://github.com/nschimme/faac/actions/runs/24147298369?pr=147
The dataset that is misbehaving is: https://github.com/nschimme/TCD-VOIP