CharCnnRnn Text Embedding

A data processing pipeline (main script data_prep.py) that takes screenshots, and the images (screenshots) description text files to generate CharCnnRnn text embedding tensors using pre-trained models, ConvAutoencoder for image feature extraction and CharCnnRnn used to create the text embedding. Both models included and can be trained on a custom dataset.
The generated output text embedding tensors' files can be used as input to stackGAN.

Train Custom Dataset

Train the conv_autoencoder model.
run python train.py .../dataset .../output 0.001 40
Inside /dataset should be two folders, test and train each contains 64x64 or 256x256 gray .png images.
Train char_embedding model.
run python train.py .../images_data.json .../output 0.001 20 fixed_gru cvpr img_64x64_path
The input is path to json file inside a folder contains the images folder enc_64x64_images or enc_256x256_images, see below example.
```
[{
 "text": "Hello Woeld! ...\n",
 "encod_64x64_path": "/enc_64x64_images/enc_64x64_1609088299704.pt",
 "encod_256x256_path": "/enc_256x256_images/enc_256x256_1609088299704.pt"
 }, ...]
```
Finally, data_prep.py using the previous trained models to generate the embedding files. Find command example.
run python data_prep.py .../GAN_dataset .../projects .../conv_autoencoder_1608844546780.pt .../char_embedding_1609010245909.pt

(1) and (2) training data prep code is used in the data_prep.py script.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_prep.py		data_prep.py
flow.png		flow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models

models

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

data_prep.py

data_prep.py

flow.png

flow.png

Repository files navigation

CharCnnRnn Text Embedding

Train Custom Dataset

About

Releases

Packages

Languages

License

ramidzamzam/charCnnRnn_embedding

Folders and files

Latest commit

History

Repository files navigation

CharCnnRnn Text Embedding

Train Custom Dataset

About

Resources

License

Stars

Watchers

Forks

Languages