FOAM: A Follower-aware Speaker Model for Vision-and-Language Navigation

Requirements

pip install -r requirements.txt

Install Matterport3D simulators:

git submodule update --init --recursive 
sudo apt-get install libjsoncpp-dev libepoxy-dev libglm-dev libosmesa6 libosmesa6-dev libglew-dev libopencv-dev
sudo apt-get install libopencv-dev
mkdir build && cd build
cmake -DEGL_RENDERING=ON ..
# Replace the above line with following if it doesn't work:
#   cmake -DOSMESA_RENDERING=ON ..
make -j8

Data

bash ./tasks/R2R/data/download.sh

Image Features

Training

bash run/agent_clip_vit16.bash 0 # 0 is the id of GPU
bash run/speaker_clip_vit16.bash 0
bash run/foam_envdrop_clip_vit16.bash 0

Pretrained Checkpoint

Citation

@inproceedings{dou2022foam,
  title={FOAM: A Follower-aware Speaker Model for Vision-and-Language Navigation},
  author={Dou, Zi-Yi and Peng, Nanyun},
  booktitle={Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)},
  year={2022},
}

Acknowledgement

The code is based on EnvDrop and CLIP-ViL-VLN. We thank Hao Tan for the help with preprocessing.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cmake		cmake
connectivity		connectivity
img_features		img_features
include		include
pybind11 @ 97784da		pybind11 @ 97784da
r2r_src		r2r_src
run		run
semantic_views		semantic_views
src		src
tasks/R2R/data		tasks/R2R/data
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
LICENSE_Matterport3DSimulator		LICENSE_Matterport3DSimulator
LICENSE_R2REnvDrop		LICENSE_R2REnvDrop
README.md		README.md
precompute_clip16_views.py		precompute_clip16_views.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

FOAM: A Follower-aware Speaker Model for Vision-and-Language Navigation

Requirements

Data

Training

Citation

Acknowledgement

About

Licenses found

Releases

Packages

Languages

License

Licenses found

PlusLabNLP/follower_aware_speaker

Folders and files

Latest commit

History

Repository files navigation

FOAM: A Follower-aware Speaker Model for Vision-and-Language Navigation

Requirements

Data

Training

Citation

Acknowledgement

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages