Prerequisites

This is Yotta’s project #2 by Olivier Collier, Cyril Lemaire and Julien Sadoun. We designed an application that can recognize the city in which a given picture has been taken. Our algorithm analyses architecture, colors and patterns to make a prediction.

We proposed a solution based on transfer learning, building on DenseNet201. It creates a proof of concept, that takes only five cities into account: Amsterdam, London, Paris, Strasbourg and Venice.

Prerequisites

Our project requires the following tools:

Python 3.8, see download page here,
wget, see download page here,
Poetry, see download page here.

Getting started

Clone this repository

git clone <this project>
cd <this project>

Set up virtual environment

Set up a Poetry environment by running the following command:

poetry install
poetry shell

If you have several versions of Python on your computer, you may need to run the following command prior to installing Poetry:

poetry env use /full/path/to/python3.8

**Download trained model and/or image dataset **

To download the trained model, run the following commands from the repository root:

wget 'https://www.dropbox.com/sh/f2ldmcezg8jhzid/AACnJsiQufdq7z5fZ4ZNg98ha?dl=0' --content-disposition
unzip models.zip
rm models.zip

To download the necessary image dataset for training and testing, run the following commands from the repository root:

wget 'https://www.dropbox.com/sh/1ks98px6egwjp31/AAAZh1LuvQzs5-9Cu9u2dQHka?dl=0' --content-disposition
unzip data.zip
rm data.zip

Train the model

To train a model using the previously downloaded dataset, run the following command from the repository root:

poetry run python src/application/train.py

Make predictions from the model

To test our final model (not the one trained above, but the one downloaded), run the following command from the repository root:

poetry run python src/application/predict.py

Test our web-app!

To access our web-app and interactively use your own pictures, run the following command from the repository root:

streamlit run src/application/application.py

Image scrapping

To create your own image dataset, you can use our script to scrap images on Google. First complete the src/config/scrap_config_template.py file with the folder in which you want to dowload images and the queries to use in Google Image. Then run the following command from the repository root:

poetry run python src/infrastructure/data_scraping.py

If you have duplicates in your folder you can run:

poetry run python src/infrastructure/remove_duplicates.py

to delete them.

Documentation

If you want to consult the project documentation, run the following command from the repository root:

open docs/build/html/index.html

Repository architecture

├── README.md
├── data
│   ├── test
│   └── train
├── docs
│   ├── Makefile
│   ├── build
│   │   └── html
│   │       ├── genindex.html
│   │       ├── index.html
│   │       ├── modules.html
│   │       ├── objects.inv
│   │       ├── py-modindex.html
│   │       ├── search.html
│   │       ├── searchindex.js
│   │       ├── src.application.html
│   │       ├── src.config.html
│   │       ├── src.domain.html
│   │       ├── src.html
│   │       └── src.infrastructure.html
│   ├── commands.rst
│   ├── conf.py
│   ├── getting-started.rst
│   ├── index.rst
│   └── make.bat
├── models
│   └── PCR_model.hdf5
├── pyproject.toml
└── src
    ├── __init__.py
    ├── application
    │   ├── __init__.py
    │   ├── application.py
    │   ├── predict.py
    │   └── train.py
    ├── config
    │   ├── __init__.py
    │   └── config.py
    │   └── scrap_config_template.py
    ├── domain
    │   ├── __init__.py
    │   └── model.py
    └── infrastructure
        ├── __init__.py
        ├── data_generator.py
        ├── data_scraping.py
        └── remove_duplicates.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

models

models

src

src

.gitignore

.gitignore

README.md

README.md

pyproject.toml

pyproject.toml

Repository files navigation

Prerequisites

Getting started

Repository architecture

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
docs		docs
models		models
src		src
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

cyrlemaire/Project-City-Recognition

Folders and files

Latest commit

History

Repository files navigation

Prerequisites

Getting started

Repository architecture

About

Resources

Stars

Watchers

Forks

Languages