Document-Translater

This repository contains the code to convert English Digital Documents(pdf) into Hindi.

Requirements

1. Python 3.7
2. pip 19.0.3
3. OS : ubuntu18.** , ununtu16.**

Installing Dependencies

pip install -r requirements.txt
sudo apt-get install tesseract-ocr
sudo apt install libtesseract-dev 
sudo apt-get install libleptonica-dev
sudo apt-get install -y poppler-utils
python -m textblob.download_corpora

Please feel free to refer to tesseract installation page for any help in installation .

Model Download:

Download model from here and copy inside the ./model folder . Update the model path in src/constants.py file, if having some different model name.
Start Server (will start a server on port 5001)
```
python src/app.py
```
Go to http://localhost:5001/home
Example Run

a. English - Hindi

Input Upload(English) Hindi Output

Note: Having too many pages in the pdf might take a bit of time for the API to return the results. On successfull processing, a text file with the converted hindi text will be generated.

Acknowlegements

text-translator

Future Work

Updating Web UI to allow the user to translate an image format documents, and also the functionality to allow the conversion of a particular page of pdf.
Support for Other Languages.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
data		data
model		model
src		src
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

model

model

src

src

static

static

templates

templates

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

requirements.txt

requirements.txt

Repository files navigation

Document-Translater

Requirements

Acknowlegements

Future Work

About

Releases

Packages

Languages

License

srijan14/Document-Machine-Translation

Folders and files

Latest commit

History

Repository files navigation

Document-Translater

Requirements

Acknowlegements

Future Work

About

Topics

Resources

License

Stars

Watchers

Forks

Languages