Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems
Core contributors: Yihong Tian, Junjie Zhang, Liuyang Li, and Jie Yin*
Ultra-Fusion is a tightly-coupled multi-sensor SLAM/localization framework for intelligent transportation systems (ITS). It is designed for real deployment where sensor degradation (illumination changes, LiDAR degeneracy, wheel slippage, GNSS outage) and spatiotemporal miscalibration are common.
The system unifies WIO, VIO, LIO, and LVIO in one configurable optimization framework, with optional wheel/GNSS fusion and online calibration.
- Unified sliding-window estimator with timestamp-ordered heterogeneous factors.
- Observability-aware initialization for robust bootstrap under diverse motion/sensor conditions.
- Factor-wise reliability scheduling (FRS) to gate/down-weight degraded measurements.
- Online LiDAR-IMU spatiotemporal calibration during operation.
- Validated on wheeled, legged, and aerial platforms across multiple public benchmarks.
Go to our project website for more details!
Ultra-Fusion converts asynchronous sensor streams into optional factors in one optimization window, with shared state representation, marginalization, and calibration logic.
Unified pipeline: initialization, reliability scheduling, online calibration, and multi-modal fusion in one framework.
Compared with conventional fusion pipelines that are heavily tied to a fixed sensor set, Ultra-Fusion emphasizes:
- Configurability: one framework for WIO/VIO/LIO/LVIO (+ wheel/GNSS).
- Reliability: robust localization under corner-case degradations.
- Deployability: support for long-term and high-speed operation in real ITS scenarios.
- Transferability: validated beyond wheeled robots to legged and aerial platforms.
Ultra-Fusion is evaluated on:
- M3DGR (wheeled, real+sim, sensor degradation),
- M2DGR-Plus(wheeled),
- KAIST (Complex Urban Dataset)(autonomous driving),
- GrandTour (legged),
- MARS-LVIG (aerial).
Across these datasets, the paper reports competitive localization performance and improved availability under:
- sensor degradation (visual/LiDAR/wheel/GNSS),
- temporal/extrinsic perturbations,
- long-duration and high-speed operation.
Note
We currently release executable binaries and demos. Full source code will be released after paper acceptance. Please refer to our prior works Ground-Fusion and Ground-Fusion++ for implementation reference.
The public runtime is tested on Ubuntu 20.04 + ROS Noetic. The Docker image is
only the ROS/runtime environment: it contains ROS Noetic, rosbag, RViz, Ceres,
yaml-cpp, and the required system libraries. It does not contain the
Ultra-Fusion source tree or the release .deb.
#Alibaba Cloud ACR:
docker pull registry.cn-hangzhou.aliyuncs.com/bit_robot_image/ultrafusion:0.1.0
#Docker Hub:
docker pull maotiandocker/ultrafusion:0.1.0
#Or build the public runtime image from the Dockerfile:
docker build -t ultrafusion:0.1.0 .Start a container. The /media mount is optional, but convenient when rosbag
files are stored on the host under /media.
xhost +local:docker
docker run --rm -it --net=host --ipc=host \
-e DISPLAY="${DISPLAY}" \
-e QT_X11_NO_MITSHM=1 \
-v /tmp/.X11-unix:/tmp/.X11-unix:rw \
-v /media:/media:ro \
registry.cn-hangzhou.aliyuncs.com/bit_robot_image/ultrafusion:0.1.0Inside the container, download and install the release package. The same deb is available from GitHub Releases and the project mirror:
wget -O /tmp/ultrafusion.deb \
https://github.com/sjtuyinjie/Ultra-Fusion/releases/download/v0.1.0/ultrafusion_0.1.0_amd64.deb
# Mirror:
# wget -O /tmp/ultrafusion.deb \
# http://47.100.60.229:8088/loc_map/releases/ultrafusion/ultrafusion_0.1.0_amd64.deb
echo "c9a40d62df6100006431598d672c943f23f116e973e9c3b111d76d76c059196c /tmp/ultrafusion.deb" | sha256sum -c -
dpkg -i /tmp/ultrafusion.debThe deb installs:
/opt/ultrafusion/bin/uf_node/usr/bin/uf-nodeand/usr/bin/uf_node/opt/ultrafusion/config/m3dgr/opt/ultrafusion/config/m2p/opt/ultrafusion/config/lvig/opt/ultrafusion/config/kaist/opt/ultrafusion/config/groundtour/opt/ultrafusion/rviz/lio.rviz
Open the included RViz layout with:
rviz -d /opt/ultrafusion/rviz/lio.rvizDownload M3DGR bags and give a star.
Start ROS and play your bag in the usual ROS way. Use one terminal for
roscore, one terminal for rosbag play, and one terminal for uf_node.
#play your bag
roscore & rosbag play /media/path/to/your.bag --clockRun Ultra-Fusion in another shell:
#default setting
uf_node m3dgr
#standard LWIO
uf_node m3dgr_standard
#Stronger vision-coupling
uf_node m3dgr_image_enhanceM3DGR public release profiles:
| Command | Config | Recommended benchmark setting |
|---|---|---|
uf_node m3dgr |
/opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml |
Default M3DGR standard profile |
uf_node m3dgr_standard |
/opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml |
Standard wheeled LVWIO profile: Dynamic01, Varying-illu01, Dark01, and Occlusion01 |
uf_node m3dgr_image_enhance |
/opt/ultrafusion/config/m3dgr/uf_m3dgr_image_enhance.yaml |
Stronger visual-coupling profile: Corridor01, GNSS-denial01, Longtime01, and Longtime02 |
Demo preview (M3DGR). After launching uf_node with the profile above and
playing the corresponding M3DGR bag in RViz (/opt/ultrafusion/rviz/lio.rviz),
you should see live LiDAR mapping and trajectory overlays similar to:
These additional public shortcuts are included for reproducibility. The table lists the dataset sequences/settings covered by the released profiles. Sequences not listed here are outside the public release profile coverage and may require separate parameter retuning.
| Command | Config | Recommended sequences/settings |
|---|---|---|
uf_node m2p |
/opt/ultrafusion/config/m2p/uf_m2p.yaml |
M2DGR-Plus bridge1-style LVWIO setting |
uf_node lvig |
/opt/ultrafusion/config/lvig/uf_lvig.yaml |
MARS-LVIG HKairport01 LVIO setting |
uf_node kaist |
/opt/ultrafusion/config/kaist/uf_kaist.yaml |
KAIST urban25 and urban35 |
uf_node groundtour |
/opt/ultrafusion/config/groundtour/uf_groundtour.yaml |
GrandTour SPX-2, SNOW-2, and EIG-1 |
You can also pass a config path directly:
uf_node /opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yamlDemo preview (more datasets). The GIFs below match the released shortcuts
in the table above. Run the corresponding uf_node command, play the
recommended sequence, and compare your RViz output with:
The released commands above are aliases for YAML files under
/opt/ultrafusion/config. For a custom setup, copy the closest released config
directory so the main YAML and its camera-intrinsic files keep the same relative
layout. Avoid writing a minimal file from scratch: the runtime still reads
shared camera, mapping, and noise fields at startup, and reads wheel fields when
wheel: 1.
WORK=/tmp/uf_config
mkdir -p "$WORK"
cp -a /opt/ultrafusion/config/m3dgr "$WORK"/
CFG="$WORK/m3dgr/uf_m3dgr_standard.yaml"
${EDITOR:-nano} "$CFG"
roscore &
sleep 3
uf_node "$CFG" &
rosbag play /media/path/to/your.bag --clockuf_node reads the YAML only at startup. Restart uf_node after changing the
copied config.
Keep imu: 1 for the modes below. Visual sensing is selected by use_image.
use_gf_standalone_vio is not the UF visual switch: in the current runtime it
only selects the Ground-Fusion standalone backend for pure VIO
(use_lidar: 0, use_image: 1, wheel: 0). Ultra-Fusion also has its own
native VIO/VIWO/LVIO/LVWIO path; keep use_gf_standalone_vio: false for those
UF modes.
| Target mode | use_lidar |
use_image |
wheel |
use_gf_standalone_vio |
Runtime path |
|---|---|---|---|---|---|
UF lvwio |
1 |
1 |
1 |
false |
Native UF LiDAR + visual + wheel |
UF lvio |
1 |
1 |
0 |
false |
Native UF LiDAR + visual |
UF vio |
0 |
1 |
0 |
false |
Native UF standalone VIO (UFVIO) |
GF standalone vio |
0 |
1 |
0 |
true |
Ground-Fusion standalone VIO baseline |
UF viwo |
0 |
1 |
1 |
false |
Native UF standalone visual + wheel |
UF wio |
0 |
0 |
1 |
ignored | Native UF standalone wheel + IMU |
UF lio |
1 |
0 |
0 |
ignored | LiDAR + IMU |
UF lwio |
1 |
0 |
1 |
ignored | LiDAR + wheel + IMU |
Set depth: 1 only for RGB-D visual profiles that really provide the configured
depth image; use depth: 0 for monocular RGB visual profiles. use_lidar_reproject
only matters for LiDAR+visual profiles, so keep the copied profile's value
unless you are intentionally evaluating that coupling. use_planar_wheel_factor
selects the planar wheel factor model; keep the released profile's value unless
you are intentionally comparing it with the legacy wheel-pose factor.
Sensor topics are configured in common:
common:
imu_topic: /camera/imu
lid_topic: /livox/mid360/lidar
wheel_topic: /odom
image0_topic: /camera/color/image_raw/compressed
image1_topic: /camera/aligned_depth_to_color/image_rawCamera intrinsics are not stored in the main UF YAML. The main YAML points to camodocal/OpenCV calibration YAML files:
cam0_calib: "color.yaml"
cam1_calib: "color.yaml"cam0_calib is the primary visual camera file. The runtime loads it as
<directory-of-main-config>/<cam0_calib>, so keep the calibration file next to
the copied main YAML or preserve the released config directory layout as in the
copy example above. In the current runtime this field is treated as a path
relative to the main config directory; an absolute path will still be prefixed
by that directory.
The released visual profiles use PINHOLE or KANNALA_BRANDT camera models:
%YAML:1.0
---
model_type: PINHOLE
camera_name: camera
image_width: 640
image_height: 480
distortion_parameters:
k1: 0.0
k2: 0.0
p1: 0.0
p2: 0.0
k3: 0.0
projection_parameters:
fx: 607.79772949218
fy: 607.83526613281
cx: 328.79772949218
cy: 245.53321838378For KANNALA_BRANDT, use projection_parameters fields
mu, mv, u0, v0, k2, k3, k4, and k5, following the released
fisheye-style calibration files. cam1_calib is only used when the runtime is
configured for the two-camera path; for the current single-camera/RGB-D public
profiles, keep it consistent with the released template. RGB-D depth input is
controlled by depth: 1 and common.image1_topic, not by giving the depth image
its own camera-intrinsic YAML.
GNSS is independent of the LiDAR/visual/wheel mode switches. UF estimator paths
can add raw GNSS pseudorange/Doppler factors and position-only
sensor_msgs/NavSatFix factors when the bag provides the required topics. The
GF standalone VIO backend receives raw GNSS only; position-only GNSS fixes are
not consumed by that backend.
| Use case | Main fields | Notes |
|---|---|---|
| Disable GNSS | gnss_enable: 0 |
No GNSS subscribers are started |
| Raw GNSS | gnss_enable: 1, gnss_raw_enable: true, gnss_position_enable: false |
Requires range measurements plus ephemeris/iono topics |
| Position-only GNSS | gnss_enable: 1, gnss_raw_enable: false, gnss_position_enable: true |
Uses sensor_msgs/NavSatFix in UF estimator paths |
| Raw + position GNSS | gnss_enable: 1, gnss_raw_enable: true, gnss_position_enable: true |
Use only when both measurement types are available |
Typical GNSS topic and lever-arm fields:
gnss_meas_topic: /ublox_driver/range_meas
gnss_position_topic: /ublox_driver/receiver_lla
gnss_ephem_topic: /ublox_driver/ephem
gnss_glo_ephem_topic: /ublox_driver/glo_ephem
gnss_iono_params_topic: /ublox_driver/iono_params
gnss_use_antenna_extrinsic: false
gnss_antenna_in_body: [0.0, 0.0, 0.0]If gnss_use_antenna_extrinsic is true, gnss_antenna_in_body is the antenna
position in the estimator body/IMU frame. Do not enable raw GNSS without the
matching ephemeris topics; use position-only GNSS in a UF estimator profile or
leave GNSS off.
All extrinsics are under mapping. Ultra-Fusion uses T_A_B to mean
"transform a point from frame B into frame A":
p_A = R_A_B * p_B + t_A_B
Rotation arrays are row-major 3x3 matrices, and translations are in meters.
| YAML fields | Transform | Meaning |
|---|---|---|
extrinsic_T, extrinsic_R |
T_I_L |
LiDAR frame L to IMU/body frame I |
extrinsic_TIC, extrinsic_RIC |
T_I_C |
Camera frame C to IMU/body frame I |
extrinsic_TCL, extrinsic_RCL |
T_C_L |
LiDAR frame L to camera frame C |
extrinsic_TOL, extrinsic_ROL |
T_O_L |
LiDAR frame L to wheel/odometer frame O |
extrinsic_TIO, extrinsic_RIO |
T_I_O |
Wheel/odometer frame O to IMU/body frame I |
Runtime priority:
| Runtime transform | How UF obtains it |
|---|---|
T_I_L |
Always reads mapping.extrinsic_T/R |
T_I_C |
Uses mapping.extrinsic_TIC/RIC if present; otherwise computes T_I_L * inverse(T_C_L) from extrinsic_TCL/RCL |
T_C_L |
If T_I_C is present, UF also derives internal T_C_L = inverse(T_I_C) * T_I_L |
T_I_O |
Uses explicit mapping.extrinsic_TIO/RIO if present; otherwise computes T_I_L * inverse(T_O_L) from extrinsic_TOL/ROL |
There is no public YAML flag named estimate_wheel_extrinsic. To change the
wheel extrinsic, provide extrinsic_TIO/RIO directly or provide a correct
extrinsic_TOL/ROL so UF can derive T_I_O.
For fixed calibration, keep both visual online-calibration flags at zero:
estimate_extrinsic: 0
estimate_td: 0
td: 0.0In the current runtime, estimate_extrinsic and estimate_td are treated as a
joint online camera-IMU calibration request: any nonzero value starts the
T_I_C + td calibration state machine after the visual feature and motion
excitation gates pass. The state machine first commits T_I_C, then enters the
visual delay (td) stage. Therefore estimate_td: 1 alone should not be read
as an isolated delay-only mode. This state machine is driven in the UF
LiDAR/visual processing path; pure UF standalone VIO/VIWO and GF standalone VIO
keep their T_I_C and td parameter blocks fixed in the solver.
| Field | Scope | Code behavior |
|---|---|---|
estimate_extrinsic |
Camera-IMU | 0 does not request online visual calibration by itself; nonzero requests the joint T_I_C + td state machine |
estimate_td |
Visual timing | 0 does not request online visual calibration by itself; nonzero also requests the same T_I_C + td state machine |
td |
Visual timing | Visual state time uses image_timestamp + td |
common.img_time_offset |
ROS image stamp | Added to the ROS image timestamp before visual buffering; this is separate from td |
wheel_initial_td |
Wheel timing | Wheel state time uses wheel_timestamp + wheel_initial_td |
TimeSync.initial_lidar_to_imu_dt_sec |
LiDAR-IMU timing | Initial LiDAR-to-IMU time offset |
TimeSync.enable_lidar_imu_online_dt |
LiDAR-IMU timing | Enables online LiDAR-IMU time-offset estimation |
The current public configs use wheel_initial_td for wheel timing. Legacy
fields such as estimate_td_wheel and td_wheel are not the public switch for
wheel-delay calibration.
LiDAR-IMU online extrinsic calibration is configured separately:
lidar_imu_calib:
enable: false
enable_lock_result: true
freeze_after_locked_result: true
apply_locked_result_to_slam: falseThis LiDAR-IMU calibrator estimates the rotation part of T_I_L. Locked
rotation results affect SLAM only when apply_locked_result_to_slam: true; the
LiDAR-IMU translation used by SLAM remains the YAML translation unless you edit
the config.
When checking a new profile, inspect the startup log lines for Opti_TIC,
Opti_TIO, td, wheel td, GNSS status, and LiDAR-IMU time sync. A smooth but
biased trajectory is often a frame or time-offset error, not just solver tuning.
Representative stress cases: challenging perception conditions with consistent trajectory and map quality.
Trajectory estimation examples on ground, legged, and UAV datasets.
For full-scene playback demos (LiDAR cloud + trajectory in RViz), see Demo preview under §2.1 (M3DGR) and §2.2 (other datasets) above.
This project is licensed under the MIT License. If you find this project useful, please cite:
@article{tian2026ultra,
title={Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems},
author={Tian, Yihong and Zhang, Junjie and Li, Liuyang and Zhang, Deteng and Zuo, Yunfei and Yin, Jie},
journal={arXiv preprint arXiv:2606.21223},
year={2026}
}Please also consider citing our previous works related to this project:
@article{zhang2025towards,
author={Zhang, Deteng and Zhang, Junjie and Sun, Yan and Li, Tao and Yin, Hao and Xie, Hongzhao and Yin, Jie},
booktitle={2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
title={Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework},
year={2025},
volume={},
number={},
pages={8894-8901},
doi={10.1109/IROS60139.2025.11247507}}
@inproceedings{yin2024ground,
title={Ground-fusion: A low-cost ground slam system robust to corner cases},
author={Yin, Jie and Li, Ang and Xi, Wei and Yu, Wenxian and Zou, Danping},
booktitle={2024 IEEE International Conference on Robotics and Automation (ICRA)},
pages={8603--8609},
year={2024},
organization={IEEE}
}
@article{yin2021m2dgr,
title={M2dgr: A multi-sensor and multi-scenario slam dataset for ground robots},
author={Yin, Jie and Li, Ang and Li, Tao and Yu, Wenxian and Zou, Danping},
journal={IEEE Robotics and Automation Letters},
volume={7},
number={2},
pages={2266--2273},
year={2021},
publisher={IEEE}
}This work is self-funded. Thanks Tianbao Zhang for providing computation for some time. For code maitaining, collaboration or bussiness, contact maotian616@gmail.com.








