This library calculates the fluency factors of a given audio file It is important that the file formats be wave You can customize the speech to text tool the default speech to text is VOSK
pip install git+https://github.com/salsina/persian-fluency-detector#egg=persian_fluency_detector