whatsmeow-transcribe

This is a small service app for transcribing (speech-to-text) WhatsApp voice messages. It is powered by whatsmeow and the openai/whisper API.

Clone the repository.
Run go build inside this directory.
Run ./whatsmeow-transcribe --api-key sk-proj-YOUR-API-KEY-HERE to start the program.
On the first run, scan the QR code. On future runs, the program will remember you (unless whatsmeow.db is deleted).

Any voice message sent to your account will be transcribed. The speech-to-text result is automatically posted to the conversation for everyone to see.

You can also use the API_KEY environment variable to supply the API key.
In case you are running a local text-to-speech instance, you can have --api-url point to your server.

This is a proof of concept. No support is provided.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
screenshot.png		screenshot.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whatsmeow-transcribe

About

Languages

License

hoehermann/whatsmeow-transcribe

Folders and files

Latest commit

History

Repository files navigation

whatsmeow-transcribe

About

Topics

Resources

License

Stars

Watchers

Forks

Languages