[NeurIPS 2025] GOAL: Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation

This repository contains Pytorch implementation of our paper: Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation

Data and Model Weights Preparation

Download scene datasets of MP3D and HM3D according to the instructions here. Place the downloaded datasets under the directory ./data/scene_datastets
For generating semantic maps dataset, please refer to the scripts offered by PONI. Note that for HM3D, minor modifications to the code are required due to differences in file formatting compared to MP3D. After maps construction, extract the files and place them in ./data/semantic_maps. (NOTE: I initially tried to upload precomputed maps, but turns out that compression and uploading of such large datasets is prone to corruption)
Following PONI, the validation episode datasets are split into multiple parts for parallel processing. You may either download episodes datasets according to instructions and split the datasets yourself or download the pre-split datasets directly from here. Place the resulting files under ./data/datasets/objectnav
We follow the common practices in SGM, T-Diff etc., to leverage area potential function from PONI as a frontier based exploration strategy, when the prediction confidence of GOAL is low (e.g. at the very beginning of navigation with limited observations). You can download from official repo of PONI or directly from here. Place the file as ./pretrained_models/area_potential.pth.
We provide pretrained models trained on MP3D and HM3D with ChatGPT Prior:

Dataset Models

MP3D mp3d_chatgpt

HM3D hm3d_chatgpt
We provide model weights of sparse unet for segmentation here. Place it as ./pretrained_models/spconv_state.pth.

Environments Setup

We recommend separate environments for training (generative flow) and evaluation (ObjectGoal navigation).

training environment

Create environment:

conda create -n goal-train python=3.9
conda activate goal-train

Install Pytorch. For example, we use v2.1.0 with cuda11.8 for training.

pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu118

Install other necessary packages

# timm for necessary components of DiT; 
# sklearn for DBSCAN for clustering observed objects
pip install timm scikit-learn tensorboard

evaluation environment

Create enviroment:

conda create -n goal-eval python=3.8
conda activate goal-eval

Install Pytorch. For example, we use v1.12.0 with cuda11.6 for evaluation.

pip install torch==1.12.0+cu116 torchvision==0.13.0+cu116 torchaudio==0.12.0 --extra-index-url https://download.pytorch.org/whl/cu116

Install habitat-sim and habitat-lab, we use version v0.2.1. However, we recommend installing habtiat-sim file directly from here, as using official download scripts sometimes cause unexpected errors.
Install spconv for sparse unet.

# adjust cuda version according to your setup 
pip install spconv-cu116

Install other necessary packages

pip install scikit-image scikit-fmm einops timm six torch_geometric torchdiffeq

Compile pyastar for local planning:

cd ./nav/astar_pycpp && make

We also provide the yaml files train_env.yaml and eval.env.yaml for reference.

Running Experiments

Experiment scripts with various configurations are available in the ./experiments_scripts directory. You should first activate corresponding environments

# For training 
conda activate goal-train
# For evaluation 
conda activate goal-eval

Experiment scripts with various configurations are available in the ./experiments_scripts directory. You should first activate corresponding environments

# For training 
conda activate goal-train
# For evaluation 
conda activate goal-eval

Training

By default, training utilizes the first four GPUs. You may modify the visible GPU devices by editing the corresponding experiment scripts.

Evaluation

Set the environment variable GOAL_ROOT to point to the root directory of the GOAL repository:
```
export GOAL_ROOT=<YOUR_PATH_TO_GOAL>
export PYTHONPATH=<YOUR_PATH_TO_GOAL>
```
Run the evaluation script as follows:
```
sh script.sh <GPU_IDS> <THREADS_PER_GPU> <PARTS>
```
- <GPU_IDS>: Comma-separated list of GPU device IDs, e.g., 0,1
- <THREADS_PER_GPU>: Number of parallel threads per GPU
- <PARTS>: (Optional) Specify dataset splits to evaluate; if omitted, all splits will be used (11 splits for MP3D, 20 splits for HM3D)
Example:
```
sh script.sh 0,1 6
```
This command runs all parts on GPUs 0 and 1, with 6 threads per GPU. Note that 6 threads per GPU corresponds approximately to a 22GB GPU memory requirement. Please adjust the thread count according to your hardware capacity. This command runs all parts on GPUs 0 and 1, with 6 threads per GPU. Note that 6 threads per GPU corresponds approximately to a 22GB GPU memory requirement. Please adjust the thread count according to your hardware capacity.

After evaluation completes, merge results from all parts to obtain overall performance statistics:

python $GOAL_ROOT/nav/merge_results.py --path_format "$EXPT_ROOT/<EXP_NAME>/tb_seed_100_val_part_*/stats.json"

Acknowledgements

Our work is built upon PONI, SGM, flow_matching, astar_pycpp.

Citation

If you find this codebase useful, please site us:

@article{li2025distilling,
  title={Distilling LLM Prior to Flow Model for Generalizable Agent's Imagination in Object Goal Navigation},
  author={Li, Badi and Lu, Ren-jie and Zhou, Yu and Meng, Jingke and Zheng, Wei-shi},
  journal={arXiv preprint arXiv:2508.09423},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
datasets		datasets
experiments_scripts		experiments_scripts
flow_matching		flow_matching
models		models
nav		nav
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
eval_env.yml		eval_env.yml
main.py		main.py
train_env.yml		train_env.yml
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[NeurIPS 2025] GOAL: Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation

Data and Model Weights Preparation

Environments Setup

training environment

evaluation environment

Running Experiments

Training

Evaluation

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Dataset	Models
MP3D	mp3d_chatgpt
HM3D	hm3d_chatgpt

Folders and files

Latest commit

History

Repository files navigation

[NeurIPS 2025] GOAL: Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation

Data and Model Weights Preparation

Environments Setup

training environment

evaluation environment

Running Experiments

Training

Evaluation

Acknowledgements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages