Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems

Core contributors: Yihong Tian, Junjie Zhang, Liuyang Li, and Jie Yin*

Ultra-Fusion is a tightly-coupled multi-sensor SLAM/localization framework for intelligent transportation systems (ITS). It is designed for real deployment where sensor degradation (illumination changes, LiDAR degeneracy, wheel slippage, GNSS outage) and spatiotemporal miscalibration are common.

The system unifies WIO, VIO, LIO, and LVIO in one configurable optimization framework, with optional wheel/GNSS fusion and online calibration.

Overview

Highlights

Unified sliding-window estimator with timestamp-ordered heterogeneous factors.
Observability-aware initialization for robust bootstrap under diverse motion/sensor conditions.
Factor-wise reliability scheduling (FRS) to gate/down-weight degraded measurements.
Online LiDAR-IMU spatiotemporal calibration during operation.
Validated on wheeled, legged, and aerial platforms across multiple public benchmarks.

Go to our project website for more details!

Method Overview

Ultra-Fusion converts asynchronous sensor streams into optional factors in one optimization window, with shared state representation, marginalization, and calibration logic.

Unified pipeline: initialization, reliability scheduling, online calibration, and multi-modal fusion in one framework.

Why Ultra-Fusion

Compared with conventional fusion pipelines that are heavily tied to a fixed sensor set, Ultra-Fusion emphasizes:

Configurability: one framework for WIO/VIO/LIO/LVIO (+ wheel/GNSS).
Reliability: robust localization under corner-case degradations.
Deployability: support for long-term and high-speed operation in real ITS scenarios.
Transferability: validated beyond wheeled robots to legged and aerial platforms.

Benchmarks and Findings

Ultra-Fusion is evaluated on:

M3DGR (wheeled, real+sim, sensor degradation),
M2DGR-Plus(wheeled),
KAIST (Complex Urban Dataset)(autonomous driving),
GrandTour (legged),
MARS-LVIG (aerial).

Across these datasets, the paper reports competitive localization performance and improved availability under:

sensor degradation (visual/LiDAR/wheel/GNSS),
temporal/extrinsic perturbations,
long-duration and high-speed operation.

Note

We currently release executable binaries and demos. Full source code will be released after paper acceptance. Please refer to our prior works Ground-Fusion and Ground-Fusion++ for implementation reference.

⚙️1. Prerequisites & Installation

The public runtime is tested on Ubuntu 20.04 + ROS Noetic. The Docker image is only the ROS/runtime environment: it contains ROS Noetic, rosbag, RViz, Ceres, yaml-cpp, and the required system libraries. It does not contain the Ultra-Fusion source tree or the release .deb.

1.1 Pull the Docker Image for Environments

#Alibaba Cloud ACR:
docker pull registry.cn-hangzhou.aliyuncs.com/bit_robot_image/ultrafusion:0.1.0

#Docker Hub:
docker pull maotiandocker/ultrafusion:0.1.0

#Or build the public runtime image from the Dockerfile:
docker build -t ultrafusion:0.1.0 .

1.2 Install the Release Deb of Ultra-Fusion

Start a container. The /media mount is optional, but convenient when rosbag files are stored on the host under /media.

xhost +local:docker

docker run --rm -it --net=host --ipc=host \
  -e DISPLAY="${DISPLAY}" \
  -e QT_X11_NO_MITSHM=1 \
  -v /tmp/.X11-unix:/tmp/.X11-unix:rw \
  -v /media:/media:ro \
  registry.cn-hangzhou.aliyuncs.com/bit_robot_image/ultrafusion:0.1.0

Inside the container, download and install the release package. The same deb is available from GitHub Releases and the project mirror:

wget -O /tmp/ultrafusion.deb \
  https://github.com/sjtuyinjie/Ultra-Fusion/releases/download/v0.1.0/ultrafusion_0.1.0_amd64.deb

# Mirror:
# wget -O /tmp/ultrafusion.deb \
#   http://47.100.60.229:8088/loc_map/releases/ultrafusion/ultrafusion_0.1.0_amd64.deb

echo "c9a40d62df6100006431598d672c943f23f116e973e9c3b111d76d76c059196c  /tmp/ultrafusion.deb" | sha256sum -c -

dpkg -i /tmp/ultrafusion.deb

The deb installs:

/opt/ultrafusion/bin/uf_node
/usr/bin/uf-node and /usr/bin/uf_node
/opt/ultrafusion/config/m3dgr
/opt/ultrafusion/config/m2p
/opt/ultrafusion/config/lvig
/opt/ultrafusion/config/kaist
/opt/ultrafusion/config/groundtour
/opt/ultrafusion/rviz/lio.rviz

Open the included RViz layout with:

rviz -d /opt/ultrafusion/rviz/lio.rviz

2 Run Ultra-Fusion on Five benchmakrs

🔥2.1 Run Ultra-Fusion on M3DGR

Download M3DGR bags and give a star. Start ROS and play your bag in the usual ROS way. Use one terminal for roscore, one terminal for rosbag play, and one terminal for uf_node.

#play your bag
roscore & rosbag play /media/path/to/your.bag --clock

Run Ultra-Fusion in another shell:

#default setting
uf_node m3dgr

#standard LWIO
uf_node m3dgr_standard

#Stronger vision-coupling
uf_node m3dgr_image_enhance

M3DGR public release profiles:

Command	Config	Recommended benchmark setting
`uf_node m3dgr`	`/opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml`	Default M3DGR standard profile
`uf_node m3dgr_standard`	`/opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml`	Standard wheeled LVWIO profile: Dynamic01, Varying-illu01, Dark01, and Occlusion01
`uf_node m3dgr_image_enhance`	`/opt/ultrafusion/config/m3dgr/uf_m3dgr_image_enhance.yaml`	Stronger visual-coupling profile: Corridor01, GNSS-denial01, Longtime01, and Longtime02

Demo preview (M3DGR). After launching uf_node with the profile above and playing the corresponding M3DGR bag in RViz (/opt/ultrafusion/rviz/lio.rviz), you should see live LiDAR mapping and trajectory overlays similar to:

Corridor01 · uf_node m3dgr_image_enhance
Stable localization in an indoor corridor under vision challenge and LiDAR degeneration. GNSS-denial01 · uf_node m3dgr_image_enhance
Continuous estimation when GNSS measurements are unavailable. Longtime02 · uf_node m3dgr_image_enhance
Long-duration operation with consistent map and path alignment.

🔥2.2 Run Ultra-Fusion on more datasets

These additional public shortcuts are included for reproducibility. The table lists the dataset sequences/settings covered by the released profiles. Sequences not listed here are outside the public release profile coverage and may require separate parameter retuning.

Command	Config	Recommended sequences/settings
`uf_node m2p`	`/opt/ultrafusion/config/m2p/uf_m2p.yaml`	M2DGR-Plus bridge1-style LVWIO setting
`uf_node lvig`	`/opt/ultrafusion/config/lvig/uf_lvig.yaml`	MARS-LVIG HKairport01 LVIO setting
`uf_node kaist`	`/opt/ultrafusion/config/kaist/uf_kaist.yaml`	KAIST urban25 and urban35
`uf_node groundtour`	`/opt/ultrafusion/config/groundtour/uf_groundtour.yaml`	GrandTour SPX-2, SNOW-2, and EIG-1

You can also pass a config path directly:

uf_node /opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml

Demo preview (more datasets). The GIFs below match the released shortcuts in the table above. Run the corresponding uf_node command, play the recommended sequence, and compare your RViz output with:

GrandTour Arc2 · uf_node groundtour
Legged-platform mapping and trajectory recovery through a large arc-shaped route. KAIST urban25/35 · uf_node kaist
Large-scale LiDAR mapping with vehicle trajectory visualization. MARS-LVIG HKairport01 · uf_node lvig
Aerial LVIO reconstruction with dense point cloud and flight trajectory.

3 Custom profiles: modes, GNSS, extrinsics, and delays

The released commands above are aliases for YAML files under /opt/ultrafusion/config. For a custom setup, copy the closest released config directory so the main YAML and its camera-intrinsic files keep the same relative layout. Avoid writing a minimal file from scratch: the runtime still reads shared camera, mapping, and noise fields at startup, and reads wheel fields when wheel: 1.

WORK=/tmp/uf_config
mkdir -p "$WORK"
cp -a /opt/ultrafusion/config/m3dgr "$WORK"/

CFG="$WORK/m3dgr/uf_m3dgr_standard.yaml"
${EDITOR:-nano} "$CFG"

roscore &
sleep 3
uf_node "$CFG" &
rosbag play /media/path/to/your.bag --clock

uf_node reads the YAML only at startup. Restart uf_node after changing the copied config.

3.1 Fusion mode switches

Keep imu: 1 for the modes below. Visual sensing is selected by use_image. use_gf_standalone_vio is not the UF visual switch: in the current runtime it only selects the Ground-Fusion standalone backend for pure VIO (use_lidar: 0, use_image: 1, wheel: 0). Ultra-Fusion also has its own native VIO/VIWO/LVIO/LVWIO path; keep use_gf_standalone_vio: false for those UF modes.

Target mode	`use_lidar`	`use_image`	`wheel`	`use_gf_standalone_vio`	Runtime path
UF `lvwio`	`1`	`1`	`1`	`false`	Native UF LiDAR + visual + wheel
UF `lvio`	`1`	`1`	`0`	`false`	Native UF LiDAR + visual
UF `vio`	`0`	`1`	`0`	`false`	Native UF standalone VIO (`UFVIO`)
GF standalone `vio`	`0`	`1`	`0`	`true`	Ground-Fusion standalone VIO baseline
UF `viwo`	`0`	`1`	`1`	`false`	Native UF standalone visual + wheel
UF `wio`	`0`	`0`	`1`	ignored	Native UF standalone wheel + IMU
UF `lio`	`1`	`0`	`0`	ignored	LiDAR + IMU
UF `lwio`	`1`	`0`	`1`	ignored	LiDAR + wheel + IMU

Set depth: 1 only for RGB-D visual profiles that really provide the configured depth image; use depth: 0 for monocular RGB visual profiles. use_lidar_reproject only matters for LiDAR+visual profiles, so keep the copied profile's value unless you are intentionally evaluating that coupling. use_planar_wheel_factor selects the planar wheel factor model; keep the released profile's value unless you are intentionally comparing it with the legacy wheel-pose factor.

Sensor topics are configured in common:

common:
  imu_topic: /camera/imu
  lid_topic: /livox/mid360/lidar
  wheel_topic: /odom
  image0_topic: /camera/color/image_raw/compressed
  image1_topic: /camera/aligned_depth_to_color/image_raw

3.2 Camera intrinsic files

Camera intrinsics are not stored in the main UF YAML. The main YAML points to camodocal/OpenCV calibration YAML files:

cam0_calib: "color.yaml"
cam1_calib: "color.yaml"

cam0_calib is the primary visual camera file. The runtime loads it as <directory-of-main-config>/<cam0_calib>, so keep the calibration file next to the copied main YAML or preserve the released config directory layout as in the copy example above. In the current runtime this field is treated as a path relative to the main config directory; an absolute path will still be prefixed by that directory.

The released visual profiles use PINHOLE or KANNALA_BRANDT camera models:

%YAML:1.0
---
model_type: PINHOLE
camera_name: camera
image_width: 640
image_height: 480
distortion_parameters:
  k1: 0.0
  k2: 0.0
  p1: 0.0
  p2: 0.0
  k3: 0.0
projection_parameters:
  fx: 607.79772949218
  fy: 607.83526613281
  cx: 328.79772949218
  cy: 245.53321838378

For KANNALA_BRANDT, use projection_parameters fields mu, mv, u0, v0, k2, k3, k4, and k5, following the released fisheye-style calibration files. cam1_calib is only used when the runtime is configured for the two-camera path; for the current single-camera/RGB-D public profiles, keep it consistent with the released template. RGB-D depth input is controlled by depth: 1 and common.image1_topic, not by giving the depth image its own camera-intrinsic YAML.

3.3 Optional GNSS fusion

GNSS is independent of the LiDAR/visual/wheel mode switches. UF estimator paths can add raw GNSS pseudorange/Doppler factors and position-only sensor_msgs/NavSatFix factors when the bag provides the required topics. The GF standalone VIO backend receives raw GNSS only; position-only GNSS fixes are not consumed by that backend.

Use case	Main fields	Notes
Disable GNSS	`gnss_enable: 0`	No GNSS subscribers are started
Raw GNSS	`gnss_enable: 1`, `gnss_raw_enable: true`, `gnss_position_enable: false`	Requires range measurements plus ephemeris/iono topics
Position-only GNSS	`gnss_enable: 1`, `gnss_raw_enable: false`, `gnss_position_enable: true`	Uses `sensor_msgs/NavSatFix` in UF estimator paths
Raw + position GNSS	`gnss_enable: 1`, `gnss_raw_enable: true`, `gnss_position_enable: true`	Use only when both measurement types are available

Typical GNSS topic and lever-arm fields:

gnss_meas_topic: /ublox_driver/range_meas
gnss_position_topic: /ublox_driver/receiver_lla
gnss_ephem_topic: /ublox_driver/ephem
gnss_glo_ephem_topic: /ublox_driver/glo_ephem
gnss_iono_params_topic: /ublox_driver/iono_params

gnss_use_antenna_extrinsic: false
gnss_antenna_in_body: [0.0, 0.0, 0.0]

If gnss_use_antenna_extrinsic is true, gnss_antenna_in_body is the antenna position in the estimator body/IMU frame. Do not enable raw GNSS without the matching ephemeris topics; use position-only GNSS in a UF estimator profile or leave GNSS off.

3.4 Extrinsic convention

All extrinsics are under mapping. Ultra-Fusion uses T_A_B to mean "transform a point from frame B into frame A":

p_A = R_A_B * p_B + t_A_B

Rotation arrays are row-major 3x3 matrices, and translations are in meters.

YAML fields	Transform	Meaning
`extrinsic_T`, `extrinsic_R`	`T_I_L`	LiDAR frame `L` to IMU/body frame `I`
`extrinsic_TIC`, `extrinsic_RIC`	`T_I_C`	Camera frame `C` to IMU/body frame `I`
`extrinsic_TCL`, `extrinsic_RCL`	`T_C_L`	LiDAR frame `L` to camera frame `C`
`extrinsic_TOL`, `extrinsic_ROL`	`T_O_L`	LiDAR frame `L` to wheel/odometer frame `O`
`extrinsic_TIO`, `extrinsic_RIO`	`T_I_O`	Wheel/odometer frame `O` to IMU/body frame `I`

Runtime priority:

Runtime transform	How UF obtains it
`T_I_L`	Always reads `mapping.extrinsic_T/R`
`T_I_C`	Uses `mapping.extrinsic_TIC/RIC` if present; otherwise computes `T_I_L * inverse(T_C_L)` from `extrinsic_TCL/RCL`
`T_C_L`	If `T_I_C` is present, UF also derives internal `T_C_L = inverse(T_I_C) * T_I_L`
`T_I_O`	Uses explicit `mapping.extrinsic_TIO/RIO` if present; otherwise computes `T_I_L * inverse(T_O_L)` from `extrinsic_TOL/ROL`

There is no public YAML flag named estimate_wheel_extrinsic. To change the wheel extrinsic, provide extrinsic_TIO/RIO directly or provide a correct extrinsic_TOL/ROL so UF can derive T_I_O.

3.5 Calibration and delay fields

For fixed calibration, keep both visual online-calibration flags at zero:

estimate_extrinsic: 0
estimate_td: 0
td: 0.0

In the current runtime, estimate_extrinsic and estimate_td are treated as a joint online camera-IMU calibration request: any nonzero value starts the T_I_C + td calibration state machine after the visual feature and motion excitation gates pass. The state machine first commits T_I_C, then enters the visual delay (td) stage. Therefore estimate_td: 1 alone should not be read as an isolated delay-only mode. This state machine is driven in the UF LiDAR/visual processing path; pure UF standalone VIO/VIWO and GF standalone VIO keep their T_I_C and td parameter blocks fixed in the solver.

Field	Scope	Code behavior
`estimate_extrinsic`	Camera-IMU	`0` does not request online visual calibration by itself; nonzero requests the joint `T_I_C + td` state machine
`estimate_td`	Visual timing	`0` does not request online visual calibration by itself; nonzero also requests the same `T_I_C + td` state machine
`td`	Visual timing	Visual state time uses `image_timestamp + td`
`common.img_time_offset`	ROS image stamp	Added to the ROS image timestamp before visual buffering; this is separate from `td`
`wheel_initial_td`	Wheel timing	Wheel state time uses `wheel_timestamp + wheel_initial_td`
`TimeSync.initial_lidar_to_imu_dt_sec`	LiDAR-IMU timing	Initial LiDAR-to-IMU time offset
`TimeSync.enable_lidar_imu_online_dt`	LiDAR-IMU timing	Enables online LiDAR-IMU time-offset estimation

The current public configs use wheel_initial_td for wheel timing. Legacy fields such as estimate_td_wheel and td_wheel are not the public switch for wheel-delay calibration.

LiDAR-IMU online extrinsic calibration is configured separately:

lidar_imu_calib:
  enable: false
  enable_lock_result: true
  freeze_after_locked_result: true
  apply_locked_result_to_slam: false

This LiDAR-IMU calibrator estimates the rotation part of T_I_L. Locked rotation results affect SLAM only when apply_locked_result_to_slam: true; the LiDAR-IMU translation used by SLAM remains the YAML translation unless you edit the config.

When checking a new profile, inspect the startup log lines for Opti_TIC, Opti_TIO, td, wheel td, GNSS status, and LiDAR-IMU time sync. A smooth but biased trajectory is often a frame or time-offset error, not just solver tuning.

4. Qualitative Results

4.1 Robustness Under Degradation

Representative stress cases: challenging perception conditions with consistent trajectory and map quality.

4.2 Cross-Platform Results

Trajectory estimation examples on ground, legged, and UAV datasets.

For full-scene playback demos (LiDAR cloud + trajectory in RViz), see Demo preview under §2.1 (M3DGR) and §2.2 (other datasets) above.

5. License and Acknowledgements

This project is licensed under the MIT License. If you find this project useful, please cite:

@article{tian2026ultra,
  title={Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems},
  author={Tian, Yihong and Zhang, Junjie and Li, Liuyang and Zhang, Deteng and Zuo, Yunfei and Yin, Jie},
  journal={arXiv preprint arXiv:2606.21223},
  year={2026}
}

Please also consider citing our previous works related to this project:

@article{zhang2025towards,
  author={Zhang, Deteng and Zhang, Junjie and Sun, Yan and Li, Tao and Yin, Hao and Xie, Hongzhao and Yin, Jie},
  booktitle={2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)}, 
  title={Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework}, 
  year={2025},
  volume={},
  number={},
  pages={8894-8901},
  doi={10.1109/IROS60139.2025.11247507}}


@inproceedings{yin2024ground,
  title={Ground-fusion: A low-cost ground slam system robust to corner cases},
  author={Yin, Jie and Li, Ang and Xi, Wei and Yu, Wenxian and Zou, Danping},
  booktitle={2024 IEEE International Conference on Robotics and Automation (ICRA)},
  pages={8603--8609},
  year={2024},
  organization={IEEE}
}
@article{yin2021m2dgr,
  title={M2dgr: A multi-sensor and multi-scenario slam dataset for ground robots},
  author={Yin, Jie and Li, Ang and Li, Tao and Yu, Wenxian and Zou, Danping},
  journal={IEEE Robotics and Automation Letters},
  volume={7},
  number={2},
  pages={2266--2273},
  year={2021},
  publisher={IEEE}
}

This work is self-funded. Thanks Tianbao Zhang for providing computation for some time. For code maitaining, collaboration or bussiness, contact maotian616@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
images		images
.dockerignore		.dockerignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems

Overview

Highlights

Method Overview

Why Ultra-Fusion

Benchmarks and Findings

⚙️1. Prerequisites & Installation

1.1 Pull the Docker Image for Environments

1.2 Install the Release Deb of Ultra-Fusion

2 Run Ultra-Fusion on Five benchmakrs

🔥2.1 Run Ultra-Fusion on M3DGR

🔥2.2 Run Ultra-Fusion on more datasets

3 Custom profiles: modes, GNSS, extrinsics, and delays

3.1 Fusion mode switches

3.2 Camera intrinsic files

3.3 Optional GNSS fusion

3.4 Extrinsic convention

3.5 Calibration and delay fields

4. Qualitative Results

4.1 Robustness Under Degradation

4.2 Cross-Platform Results

5. License and Acknowledgements

6. Star History

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems

Overview

Highlights

Method Overview

Why Ultra-Fusion

Benchmarks and Findings

⚙️1. Prerequisites & Installation

1.1 Pull the Docker Image for Environments

1.2 Install the Release Deb of Ultra-Fusion

2 Run Ultra-Fusion on Five benchmakrs

🔥2.1 Run Ultra-Fusion on M3DGR

🔥2.2 Run Ultra-Fusion on more datasets

3 Custom profiles: modes, GNSS, extrinsics, and delays

3.1 Fusion mode switches

3.2 Camera intrinsic files

3.3 Optional GNSS fusion

3.4 Extrinsic convention

3.5 Calibration and delay fields

4. Qualitative Results

4.1 Robustness Under Degradation

4.2 Cross-Platform Results

5. License and Acknowledgements

6. Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages