Extracting text from pdf files and translate the text into designated language by calling google api
Python 3.4+
pip3 install googletrans
pip3 install pdfminer.six
python3 translate.py -a # translate all pdf files under current directory
python3 translate.py -n test.pdf # translate test.pdf under current directory
python3 translate.py -h # print all detailed helping messages
googletrans: https://github.com/ssut/py-googletrans
pdfminer: https://github.com/pdfminer/pdfminer.six
google translate api: https://cloud.google.com/translate/
Add support for exporting translation into docx files
Pack up and reform into a single executable file
Fix format issues (like words with letters "fi" will be weird after text extration from a pdf file, pictures, equations etc.)
Fix JSON empty issues in googletrans (temporarily fix by eliminating characters whose ascii value bigger than 127)
Fix translated text out-of-order problem