GitHub

The official PyTorch implementation of "Learning Where to See for Navigation: A Self-Supervised Vision-Action Pre-Training Approach".

Installation

Main libraries:

PyTorch: as the main ML framework
Comet.ml: tracking code, logging experiments
OmegaConf: for managing configuration files

First create a virtual env for the project.

python3 -m venv .venv
source .venv/bin/activate

Then install the latest version of PyTorch from the official site. Finally, run the following:

pip install -r requirements.txt

To set up Comet.Ml follow the official documentations.

Dataset

Please follow this guide to download the dataset.

Training

To run pretext training (edit config first):

./run.sh train

Sample Outputs

Unlike ImageNet weights which primarily focus on a single salient object within the environment, regardless of its distance, the proposed VANP demonstrates greater accuracy in attending to multiple nearby objects that directly influence the robot's trajectory by activating regions corresponding to pedestrians, cars, trash cans, doors, and other relevant elements.

However, the model sometimes fails to pay attention to the important regions affecting the trajectory. We can see activations in the sky or lots of unnecessary activations:

Acknowledgements

Thanks for GNM, VICreg, and Barlow papers for making their code public.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
VANP		VANP
docs		docs
LICENSE.txt		LICENSE.txt
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VANP

VANP

docs

docs

LICENSE.txt

LICENSE.txt

README.md

README.md

requirements.txt

requirements.txt

run.sh

run.sh

Repository files navigation

Installation

Dataset

Training

Sample Outputs

Acknowledgements

About

Releases

Packages

Languages

License

mhnazeri/VANP

Folders and files

Latest commit

History

Repository files navigation

Installation

Dataset

Training

Sample Outputs

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Languages