Simplified GeoGuessr: Visual Place Recognition from Google Street View Images

A deep learning project for classifying cities from street view images using multiple model architectures. This project compares three different approaches to visual place recognition: a GSV-based model adapted from an existing open-source implementation, a vision-based transformer model, and our hybrid model, GeoSceneNet, which fuses computer-vision based scene descriptors with CNN image features.

Models

GSV

Based on the GSV-Cities framework:

Backbone: ResNet50
Aggregation: ConvAP (Convolutional Aggregation Pooling)
Fine-tuned with a classification head for city prediction

VLM

Uses OpenAI's CLIP model:

Base Model: openai/clip-vit-base-patch32
Architecture: CLIP vision encoder with a linear classification head
Leverages pre-trained vision-language representations

GeoSceneNet

Our own custom model:

Model: Fusion of CV scene descriptors and CNN image features (ResNet18)
Classification head predicts off of these features

Installation

Clone the repository:

git clone https://github.com/onoderamia/CS549.git
cd CS549

Prepare the GSV-cities repository:

git submodule init
git submodule update

Then comment out line 7 in gsv/gsv-cities/main.py:

# from dataloaders.GSVCitiesDataloader import GSVCitiesDataModule

Install dependencies:

pip install -r requirements.txt

Scrape your own data OR download our data from here

cd scraper
echo "API_KEY=[YOUR_GOOGLE_API_KEY]" > .env
python scraper.py

Usage

Training

Train the GSV model:

cd gsv
python train.py                      # Train from scratch
python train.py ../models/gsv.pth    # Resume from checkpoint

Train the VLM model:

cd vlm
python train.py                      # Train from scratch
python train.py ../models/vlm.pth    # Resume from checkpoint

Train the custom model:

cd custom
python train.py

All training scripts save the model to the models/ directory. You can also use our pretrained models available here

Web Application

Run the web server:

cd webapp
python app.py

A demonstration of the web application is available here. If you would like to test it on the same GeoGuessr map, you can also try it here.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
custom		custom
gsv		gsv
scraper		scraper
vlm		vlm
webapp		webapp
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
geoguess.py		geoguess.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simplified GeoGuessr: Visual Place Recognition from Google Street View Images

Models

GSV

VLM

GeoSceneNet

Installation

Usage

Training

Web Application

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

onoderamia/ECE549

Folders and files

Latest commit

History

Repository files navigation

Simplified GeoGuessr: Visual Place Recognition from Google Street View Images

Models

GSV

VLM

GeoSceneNet

Installation

Usage

Training

Web Application

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages