Skip to content
This repository has been archived by the owner on Jul 28, 2021. It is now read-only.

Feature: speech recognition for visual feedback on audio #94

Open
robinp opened this issue Aug 7, 2019 · 4 comments
Open

Feature: speech recognition for visual feedback on audio #94

robinp opened this issue Aug 7, 2019 · 4 comments

Comments

@robinp
Copy link

robinp commented Aug 7, 2019

Could wire up speech recognition on the audio chunks to:

  • auto-name the clips when importing
  • show the text an the audio block in the timeline

Bonus: add keyword / topic extraction.

@owickstrom
Copy link
Owner

Yes, this is something I've been wanting to add. But I haven't found any good tools to integration with yet (maybe haven't looked hard enough...)

@robinp
Copy link
Author

robinp commented Aug 15, 2019

You mean speech recognition? I had good experience with CMU PocketSphinx, even wrote some c2hs bindings. If you are interested, I can dig them up!

@owickstrom
Copy link
Owner

Sorry for dropping the ball on this one. PocketSphinx seems interesting, and it's available on macOS, Linux, and Windows, it seems. Do you have the bindings published somewhere?

@robinp
Copy link
Author

robinp commented Jan 5, 2020

Just pushed after some dusting: https://github.com/TreeTide/voicetrans/tree/master/sphinx . Partial bindings, but good enough to run the recognition (see app/Main.hs to get model file and test input).

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants