ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving

📁 Project Structure

Code

/fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/shz1syv/codes/ExploreVLA/

Datasets

NavSim (RGB + Depth) /fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/shz1syv/navsim
nuScenes (RGB) /fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/archive/aid_005/nuScenes
nuScenes (Depth) /fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/shz1syv/nuscenes/
PWM Dataset /fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/shz1syv/pwm_datasets
NavSim v1 Training Metric Cache /fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/shz1syv/hf/ReCogDrive_Metric_Cache
NavSim v1/v2 Test Metric Cache /fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/shz1syv/navsim/exp

Model Checkpoints

Stage 1 (Imitation Learning / SFT) .../codes/ExploreVLA/resume_checkpoint_sft
Stage 2 (Reinforcement Learning) .../codes/ExploreVLA/resume_checkpoint

🚀 Training

Single A100 (Debug / Development)

Request an A100 GPU, load environment, and start training:

bsub -Is -q inter_a100 -n 4 -M 32000 -W 12:00 -gpu "num=1:mig=7" /bin/bash
module load conda/4.11.0 cuda/12.4 gcc/10.2.0
conda activate /fs/scratch/rb-bd-dlp-rng-dl01-cr-tfx/special/tfx-901/shz1syv/.conda/envs/showo

bash scripts/finetune/navsim/run_sft_navsim_baseline_1a100.sh

The conda environment is based on Show-o.

🐞 Debugging (Remote Attach)

Update the host according to your assigned node host: rng-dl01-w0xx:

        {
            "name": "Python: Remote Attach",
            "type": "python",
            "request": "attach",
            "connect": {
                "host": "rng-dl01-w089",
                "port": 9501
            },
            "pathMappings": [
                {
                    "localRoot": "${workspaceFolder}",
                    "remoteRoot": "${workspaceFolder}"
                }
            ]
        }

Enable debug mode in config:

experiment:
    is_debug: true

🖥️ Cluster Training

cd bsub

# Stage 1: Imitation Learning
bsub < navsim.bsub  

# Stage 2: Reinforcement Learning
bsub < navsim_rl.bsub

⚙️ Configuration

Use my_navsim.yaml for Stage 1.

Stage 1: Pre-training

Task: image + action → image

experiment:
    stage: 'pretrain'
    eval_only: false
    is_debug: false

Stage 1: Supervised Fine-Tuning (SFT)

Task: image → action + image

experiment:
    stage: 'sft'
    eval_only: false
    is_debug: false

📊 Evaluation

NAVSIM v1 (PDMS)

bash scripts/finetune/navsim/run_sft_navsim_rl_eval_predefine_std.sh

⚠️ Make sure to load the Stage 2 RL checkpoint from: .../resume_checkpoint/...

NAVSIM v2 (EPDMS)

Rename original navsim folder to avoid conflicts:
```
navsim → navsim_v1
```
Optional (already downloaded): Download NavSim v2.2 repo and place only the navsim folder here
Optional (already recomputed): Recompute metric cache (v1 and v2 caches are NOT compatible)
Run evaluation:

python my_run_pdm_score_v2.py

⚠️ After evaluation, rename the folder back if needed.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
accelerate_configs		accelerate_configs
bsub		bsub
configs		configs
data_utils		data_utils
llava		llava
models		models
navsim-v2		navsim-v2
navsim		navsim
nuplan		nuplan
nuscenes_kit		nuscenes_kit
scripts/finetune		scripts/finetune
training		training
vla4dst		vla4dst
.gitignore		.gitignore
README-cn.md		README-cn.md
README.md		README.md
my_run_pdm_score_v2.py		my_run_pdm_score_v2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving

📁 Project Structure

Code

Datasets

Model Checkpoints

🚀 Training

Single A100 (Debug / Development)

🐞 Debugging (Remote Attach)

🖥️ Cluster Training

⚙️ Configuration

Stage 1: Pre-training

Stage 1: Supervised Fine-Tuning (SFT)

📊 Evaluation

NAVSIM v1 (PDMS)

NAVSIM v2 (EPDMS)

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving

📁 Project Structure

Code

Datasets

Model Checkpoints

🚀 Training

Single A100 (Debug / Development)

🐞 Debugging (Remote Attach)

🖥️ Cluster Training

⚙️ Configuration

Stage 1: Pre-training

Stage 1: Supervised Fine-Tuning (SFT)

📊 Evaluation

NAVSIM v1 (PDMS)

NAVSIM v2 (EPDMS)

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages