transcribers language-based, each in their own directory, currently: IPA english chinese Spanish Russian Portuguese Japanese Vietnamese Yoruba Kazak todos add the rest of the transcribers, perhaps @Peter refactor the transcribers code lower priority since it works, but harder to add to