Skip to content

tarsislimadev/docker-ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Docker AI

This project has many offline AI workflows.

from video to audio

ffmpeg

from video to images

ffmpeg

from video to text (transcription/extraction)

?

from audio to text (speech)

?

from text to text (speakking)

espeak

from text to text (translating)

argostranslate

from txt to audio (text-to-speech)

docker run --rm -v .:/files -w /files -e input=file.txt -e output=file.wav tmvdl/ai:txt2wav
# In Portuguese
docker run --rm -v .:/files -w /files -e voice=pt-br -e input=file.txt -e output=file.wav tmvdl/ai:txt2wav

from pdf file to text

docker run --rm -v .:/files -w /files -e input=file.pdf -e output=file.txt tmvdl/ai:pdf2txt

from image to text (extract)

docker run --rm -v .:/files -w /files -e input=file.png -e output=file.txt tmvdl/ai:png2txt

license

MIT

About

Offline AI workflows

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors