textobjdetection

Code for the Human-related Object Detection based on Natural Language Parsing of Image Query Expressions article

Project status

Dependencies

To execute this, you must have Python 3.6.*, PyTorch, OpenCV, Numpy and Matplotlib installed, to accomplish this, we recommend installing the Anaconda Python distribution and use conda to install the dependencies, as it follows:

conda install pytorch torchvision cuda80 -c soumith
conda install opencv -c conda-forge
conda install matplotlib numpy
conda install aria2 -c bioconda
pip install visual-genome

Dataset download

You must download the Visual Genome dataset, as well the train/val/test split used for our experiments. For this, we provide the download_dataset.sh bash script, it will take care of the downloads required.

Pretrained models

Pretrained SSD + LSTM weights are provided as proof of our experimients. They are available at:

LSTM Model: https://s3-sa-east-1.amazonaws.com/textobjdetection/lstm_model.pt
SSD Model: https://s3-sa-east-1.amazonaws.com/textobjdetection/ssd_lang.pt

After downloading the models, they must be uncompressed under the weights folder.

Demo

A simple demo is provided as a Jupyter Notebook, here you can load images and predict bounding boxes given a object query phrase.

Acknowledgements

The SSD multibox detector is based on amdegroot's PyTorch implementation: https://github.com/amdegroot/ssd.pytorch

Contributions

Any contribution Pull Request will reviewed as part of Open Source initiative. We follow PEP8 and PEP257 guidelines

Name		Name	Last commit message	Last commit date
Latest commit History 263 Commits
demo		demo
ssd		ssd
.gitattributes		.gitattributes
.gitignore		.gitignore
Demo.ipynb		Demo.ipynb
LICENSE		LICENSE
README.md		README.md
download_data.sh		download_data.sh
eval_visual.py		eval_visual.py
lstm.py		lstm.py
lstm_model.py		lstm_model.py
train_visual.py		train_visual.py
train_voc.py		train_voc.py
visual_genome_loader.py		visual_genome_loader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

textobjdetection

Project status

Dependencies

Dataset download

Pretrained models

Demo

Acknowledgements

Contributions

About

Releases

Packages

Languages

License

andfoy/textobjdetection

Folders and files

Latest commit

History

Repository files navigation

textobjdetection

Project status

Dependencies

Dataset download

Pretrained models

Demo

Acknowledgements

Contributions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages