Docker AI

This project has many offline AI workflows.

from video to audio

ffmpeg

from video to images

ffmpeg

from video to text (transcription/extraction)

?

from audio to text (speech)

?

from text to text (speakking)

espeak

from text to text (translating)

argostranslate

from txt to audio (text-to-speech)

docker run --rm -v .:/files -w /files -e input=file.txt -e output=file.wav tmvdl/ai:txt2wav
# In Portuguese
docker run --rm -v .:/files -w /files -e voice=pt-br -e input=file.txt -e output=file.wav tmvdl/ai:txt2wav

from pdf file to text

docker run --rm -v .:/files -w /files -e input=file.pdf -e output=file.txt tmvdl/ai:pdf2txt

from image to text (extract)

docker run --rm -v .:/files -w /files -e input=file.png -e output=file.txt tmvdl/ai:png2txt

license

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
.vscode		.vscode
src		src
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docker AI

from video to audio

from video to images

from video to text (transcription/extraction)

from audio to text (speech)

from text to text (speakking)

from text to text (translating)

from txt to audio (text-to-speech)

from pdf file to text

from image to text (extract)

license

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Docker AI

from video to audio

from video to images

from video to text (transcription/extraction)

from audio to text (speech)

from text to text (speakking)

from text to text (translating)

from txt to audio (text-to-speech)

from pdf file to text

from image to text (extract)

license

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages