Skip to content

sjtuyinjie/Ultra-Fusion

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems

Website arXiv Dataset Video License Platform

Core contributors: Yihong Tian, Junjie Zhang, Liuyang Li, and Jie Yin*


Ultra-Fusion is a tightly-coupled multi-sensor SLAM/localization framework for intelligent transportation systems (ITS). It is designed for real deployment where sensor degradation (illumination changes, LiDAR degeneracy, wheel slippage, GNSS outage) and spatiotemporal miscalibration are common.

The system unifies WIO, VIO, LIO, and LVIO in one configurable optimization framework, with optional wheel/GNSS fusion and online calibration.


Overview

Highlights

  • Unified sliding-window estimator with timestamp-ordered heterogeneous factors.
  • Observability-aware initialization for robust bootstrap under diverse motion/sensor conditions.
  • Factor-wise reliability scheduling (FRS) to gate/down-weight degraded measurements.
  • Online LiDAR-IMU spatiotemporal calibration during operation.
  • Validated on wheeled, legged, and aerial platforms across multiple public benchmarks.

Go to our project website for more details!

Method Overview

Ultra-Fusion converts asynchronous sensor streams into optional factors in one optimization window, with shared state representation, marginalization, and calibration logic.

Ultra-Fusion pipeline

Unified pipeline: initialization, reliability scheduling, online calibration, and multi-modal fusion in one framework.


Why Ultra-Fusion

Compared with conventional fusion pipelines that are heavily tied to a fixed sensor set, Ultra-Fusion emphasizes:

  1. Configurability: one framework for WIO/VIO/LIO/LVIO (+ wheel/GNSS).
  2. Reliability: robust localization under corner-case degradations.
  3. Deployability: support for long-term and high-speed operation in real ITS scenarios.
  4. Transferability: validated beyond wheeled robots to legged and aerial platforms.

Benchmarks and Findings

Ultra-Fusion is evaluated on:

Across these datasets, the paper reports competitive localization performance and improved availability under:

  • sensor degradation (visual/LiDAR/wheel/GNSS),
  • temporal/extrinsic perturbations,
  • long-duration and high-speed operation.

Note

We currently release executable binaries and demos. Full source code will be released after paper acceptance. Please refer to our prior works Ground-Fusion and Ground-Fusion++ for implementation reference.


⚙️1. Prerequisites & Installation

The public runtime is tested on Ubuntu 20.04 + ROS Noetic. The Docker image is only the ROS/runtime environment: it contains ROS Noetic, rosbag, RViz, Ceres, yaml-cpp, and the required system libraries. It does not contain the Ultra-Fusion source tree or the release .deb.

1.1 Pull the Docker Image for Environments

#Alibaba Cloud ACR:
docker pull registry.cn-hangzhou.aliyuncs.com/bit_robot_image/ultrafusion:0.1.0

#Docker Hub:
docker pull maotiandocker/ultrafusion:0.1.0

#Or build the public runtime image from the Dockerfile:
docker build -t ultrafusion:0.1.0 .

1.2 Install the Release Deb of Ultra-Fusion

Start a container. The /media mount is optional, but convenient when rosbag files are stored on the host under /media.

xhost +local:docker

docker run --rm -it --net=host --ipc=host \
  -e DISPLAY="${DISPLAY}" \
  -e QT_X11_NO_MITSHM=1 \
  -v /tmp/.X11-unix:/tmp/.X11-unix:rw \
  -v /media:/media:ro \
  registry.cn-hangzhou.aliyuncs.com/bit_robot_image/ultrafusion:0.1.0

Inside the container, download and install the release package. The same deb is available from GitHub Releases and the project mirror:

wget -O /tmp/ultrafusion.deb \
  https://github.com/sjtuyinjie/Ultra-Fusion/releases/download/v0.1.0/ultrafusion_0.1.0_amd64.deb

# Mirror:
# wget -O /tmp/ultrafusion.deb \
#   http://47.100.60.229:8088/loc_map/releases/ultrafusion/ultrafusion_0.1.0_amd64.deb

echo "c9a40d62df6100006431598d672c943f23f116e973e9c3b111d76d76c059196c  /tmp/ultrafusion.deb" | sha256sum -c -

dpkg -i /tmp/ultrafusion.deb

The deb installs:

  • /opt/ultrafusion/bin/uf_node
  • /usr/bin/uf-node and /usr/bin/uf_node
  • /opt/ultrafusion/config/m3dgr
  • /opt/ultrafusion/config/m2p
  • /opt/ultrafusion/config/lvig
  • /opt/ultrafusion/config/kaist
  • /opt/ultrafusion/config/groundtour
  • /opt/ultrafusion/rviz/lio.rviz

Open the included RViz layout with:

rviz -d /opt/ultrafusion/rviz/lio.rviz

2 Run Ultra-Fusion on Five benchmakrs

🔥2.1 Run Ultra-Fusion on M3DGR

Download M3DGR bags and give a star. Start ROS and play your bag in the usual ROS way. Use one terminal for roscore, one terminal for rosbag play, and one terminal for uf_node.

#play your bag
roscore & rosbag play /media/path/to/your.bag --clock

Run Ultra-Fusion in another shell:

#default setting
uf_node m3dgr

#standard LWIO
uf_node m3dgr_standard

#Stronger vision-coupling
uf_node m3dgr_image_enhance

M3DGR public release profiles:

Command Config Recommended benchmark setting
uf_node m3dgr /opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml Default M3DGR standard profile
uf_node m3dgr_standard /opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml Standard wheeled LVWIO profile: Dynamic01, Varying-illu01, Dark01, and Occlusion01
uf_node m3dgr_image_enhance /opt/ultrafusion/config/m3dgr/uf_m3dgr_image_enhance.yaml Stronger visual-coupling profile: Corridor01, GNSS-denial01, Longtime01, and Longtime02

Demo preview (M3DGR). After launching uf_node with the profile above and playing the corresponding M3DGR bag in RViz (/opt/ultrafusion/rviz/lio.rviz), you should see live LiDAR mapping and trajectory overlays similar to:

M3DGR Corridor demo
Corridor01 · uf_node m3dgr_image_enhance
Stable localization in an indoor corridor under vision challenge and LiDAR degeneration.
M3DGR GNSS-denial demo
GNSS-denial01 · uf_node m3dgr_image_enhance
Continuous estimation when GNSS measurements are unavailable.
M3DGR Longtime02 demo
Longtime02 · uf_node m3dgr_image_enhance
Long-duration operation with consistent map and path alignment.

🔥2.2 Run Ultra-Fusion on more datasets

These additional public shortcuts are included for reproducibility. The table lists the dataset sequences/settings covered by the released profiles. Sequences not listed here are outside the public release profile coverage and may require separate parameter retuning.

Command Config Recommended sequences/settings
uf_node m2p /opt/ultrafusion/config/m2p/uf_m2p.yaml M2DGR-Plus bridge1-style LVWIO setting
uf_node lvig /opt/ultrafusion/config/lvig/uf_lvig.yaml MARS-LVIG HKairport01 LVIO setting
uf_node kaist /opt/ultrafusion/config/kaist/uf_kaist.yaml KAIST urban25 and urban35
uf_node groundtour /opt/ultrafusion/config/groundtour/uf_groundtour.yaml GrandTour SPX-2, SNOW-2, and EIG-1

You can also pass a config path directly:

uf_node /opt/ultrafusion/config/m3dgr/uf_m3dgr_standard.yaml

Demo preview (more datasets). The GIFs below match the released shortcuts in the table above. Run the corresponding uf_node command, play the recommended sequence, and compare your RViz output with:

GrandTour Arc2 demo
GrandTour Arc2 · uf_node groundtour
Legged-platform mapping and trajectory recovery through a large arc-shaped route.
KAIST urban driving demo
KAIST urban25/35 · uf_node kaist
Large-scale LiDAR mapping with vehicle trajectory visualization.
MARS-LVIG aerial demo
MARS-LVIG HKairport01 · uf_node lvig
Aerial LVIO reconstruction with dense point cloud and flight trajectory.

3 Custom profiles: modes, GNSS, extrinsics, and delays

The released commands above are aliases for YAML files under /opt/ultrafusion/config. For a custom setup, copy the closest released config directory so the main YAML and its camera-intrinsic files keep the same relative layout. Avoid writing a minimal file from scratch: the runtime still reads shared camera, mapping, and noise fields at startup, and reads wheel fields when wheel: 1.

WORK=/tmp/uf_config
mkdir -p "$WORK"
cp -a /opt/ultrafusion/config/m3dgr "$WORK"/

CFG="$WORK/m3dgr/uf_m3dgr_standard.yaml"
${EDITOR:-nano} "$CFG"

roscore &
sleep 3
uf_node "$CFG" &
rosbag play /media/path/to/your.bag --clock

uf_node reads the YAML only at startup. Restart uf_node after changing the copied config.

3.1 Fusion mode switches

Keep imu: 1 for the modes below. Visual sensing is selected by use_image. use_gf_standalone_vio is not the UF visual switch: in the current runtime it only selects the Ground-Fusion standalone backend for pure VIO (use_lidar: 0, use_image: 1, wheel: 0). Ultra-Fusion also has its own native VIO/VIWO/LVIO/LVWIO path; keep use_gf_standalone_vio: false for those UF modes.

Target mode use_lidar use_image wheel use_gf_standalone_vio Runtime path
UF lvwio 1 1 1 false Native UF LiDAR + visual + wheel
UF lvio 1 1 0 false Native UF LiDAR + visual
UF vio 0 1 0 false Native UF standalone VIO (UFVIO)
GF standalone vio 0 1 0 true Ground-Fusion standalone VIO baseline
UF viwo 0 1 1 false Native UF standalone visual + wheel
UF wio 0 0 1 ignored Native UF standalone wheel + IMU
UF lio 1 0 0 ignored LiDAR + IMU
UF lwio 1 0 1 ignored LiDAR + wheel + IMU

Set depth: 1 only for RGB-D visual profiles that really provide the configured depth image; use depth: 0 for monocular RGB visual profiles. use_lidar_reproject only matters for LiDAR+visual profiles, so keep the copied profile's value unless you are intentionally evaluating that coupling. use_planar_wheel_factor selects the planar wheel factor model; keep the released profile's value unless you are intentionally comparing it with the legacy wheel-pose factor.

Sensor topics are configured in common:

common:
  imu_topic: /camera/imu
  lid_topic: /livox/mid360/lidar
  wheel_topic: /odom
  image0_topic: /camera/color/image_raw/compressed
  image1_topic: /camera/aligned_depth_to_color/image_raw

3.2 Camera intrinsic files

Camera intrinsics are not stored in the main UF YAML. The main YAML points to camodocal/OpenCV calibration YAML files:

cam0_calib: "color.yaml"
cam1_calib: "color.yaml"

cam0_calib is the primary visual camera file. The runtime loads it as <directory-of-main-config>/<cam0_calib>, so keep the calibration file next to the copied main YAML or preserve the released config directory layout as in the copy example above. In the current runtime this field is treated as a path relative to the main config directory; an absolute path will still be prefixed by that directory.

The released visual profiles use PINHOLE or KANNALA_BRANDT camera models:

%YAML:1.0
---
model_type: PINHOLE
camera_name: camera
image_width: 640
image_height: 480
distortion_parameters:
  k1: 0.0
  k2: 0.0
  p1: 0.0
  p2: 0.0
  k3: 0.0
projection_parameters:
  fx: 607.79772949218
  fy: 607.83526613281
  cx: 328.79772949218
  cy: 245.53321838378

For KANNALA_BRANDT, use projection_parameters fields mu, mv, u0, v0, k2, k3, k4, and k5, following the released fisheye-style calibration files. cam1_calib is only used when the runtime is configured for the two-camera path; for the current single-camera/RGB-D public profiles, keep it consistent with the released template. RGB-D depth input is controlled by depth: 1 and common.image1_topic, not by giving the depth image its own camera-intrinsic YAML.

3.3 Optional GNSS fusion

GNSS is independent of the LiDAR/visual/wheel mode switches. UF estimator paths can add raw GNSS pseudorange/Doppler factors and position-only sensor_msgs/NavSatFix factors when the bag provides the required topics. The GF standalone VIO backend receives raw GNSS only; position-only GNSS fixes are not consumed by that backend.

Use case Main fields Notes
Disable GNSS gnss_enable: 0 No GNSS subscribers are started
Raw GNSS gnss_enable: 1, gnss_raw_enable: true, gnss_position_enable: false Requires range measurements plus ephemeris/iono topics
Position-only GNSS gnss_enable: 1, gnss_raw_enable: false, gnss_position_enable: true Uses sensor_msgs/NavSatFix in UF estimator paths
Raw + position GNSS gnss_enable: 1, gnss_raw_enable: true, gnss_position_enable: true Use only when both measurement types are available

Typical GNSS topic and lever-arm fields:

gnss_meas_topic: /ublox_driver/range_meas
gnss_position_topic: /ublox_driver/receiver_lla
gnss_ephem_topic: /ublox_driver/ephem
gnss_glo_ephem_topic: /ublox_driver/glo_ephem
gnss_iono_params_topic: /ublox_driver/iono_params

gnss_use_antenna_extrinsic: false
gnss_antenna_in_body: [0.0, 0.0, 0.0]

If gnss_use_antenna_extrinsic is true, gnss_antenna_in_body is the antenna position in the estimator body/IMU frame. Do not enable raw GNSS without the matching ephemeris topics; use position-only GNSS in a UF estimator profile or leave GNSS off.

3.4 Extrinsic convention

All extrinsics are under mapping. Ultra-Fusion uses T_A_B to mean "transform a point from frame B into frame A":

p_A = R_A_B * p_B + t_A_B

Rotation arrays are row-major 3x3 matrices, and translations are in meters.

YAML fields Transform Meaning
extrinsic_T, extrinsic_R T_I_L LiDAR frame L to IMU/body frame I
extrinsic_TIC, extrinsic_RIC T_I_C Camera frame C to IMU/body frame I
extrinsic_TCL, extrinsic_RCL T_C_L LiDAR frame L to camera frame C
extrinsic_TOL, extrinsic_ROL T_O_L LiDAR frame L to wheel/odometer frame O
extrinsic_TIO, extrinsic_RIO T_I_O Wheel/odometer frame O to IMU/body frame I

Runtime priority:

Runtime transform How UF obtains it
T_I_L Always reads mapping.extrinsic_T/R
T_I_C Uses mapping.extrinsic_TIC/RIC if present; otherwise computes T_I_L * inverse(T_C_L) from extrinsic_TCL/RCL
T_C_L If T_I_C is present, UF also derives internal T_C_L = inverse(T_I_C) * T_I_L
T_I_O Uses explicit mapping.extrinsic_TIO/RIO if present; otherwise computes T_I_L * inverse(T_O_L) from extrinsic_TOL/ROL

There is no public YAML flag named estimate_wheel_extrinsic. To change the wheel extrinsic, provide extrinsic_TIO/RIO directly or provide a correct extrinsic_TOL/ROL so UF can derive T_I_O.

3.5 Calibration and delay fields

For fixed calibration, keep both visual online-calibration flags at zero:

estimate_extrinsic: 0
estimate_td: 0
td: 0.0

In the current runtime, estimate_extrinsic and estimate_td are treated as a joint online camera-IMU calibration request: any nonzero value starts the T_I_C + td calibration state machine after the visual feature and motion excitation gates pass. The state machine first commits T_I_C, then enters the visual delay (td) stage. Therefore estimate_td: 1 alone should not be read as an isolated delay-only mode. This state machine is driven in the UF LiDAR/visual processing path; pure UF standalone VIO/VIWO and GF standalone VIO keep their T_I_C and td parameter blocks fixed in the solver.

Field Scope Code behavior
estimate_extrinsic Camera-IMU 0 does not request online visual calibration by itself; nonzero requests the joint T_I_C + td state machine
estimate_td Visual timing 0 does not request online visual calibration by itself; nonzero also requests the same T_I_C + td state machine
td Visual timing Visual state time uses image_timestamp + td
common.img_time_offset ROS image stamp Added to the ROS image timestamp before visual buffering; this is separate from td
wheel_initial_td Wheel timing Wheel state time uses wheel_timestamp + wheel_initial_td
TimeSync.initial_lidar_to_imu_dt_sec LiDAR-IMU timing Initial LiDAR-to-IMU time offset
TimeSync.enable_lidar_imu_online_dt LiDAR-IMU timing Enables online LiDAR-IMU time-offset estimation

The current public configs use wheel_initial_td for wheel timing. Legacy fields such as estimate_td_wheel and td_wheel are not the public switch for wheel-delay calibration.

LiDAR-IMU online extrinsic calibration is configured separately:

lidar_imu_calib:
  enable: false
  enable_lock_result: true
  freeze_after_locked_result: true
  apply_locked_result_to_slam: false

This LiDAR-IMU calibrator estimates the rotation part of T_I_L. Locked rotation results affect SLAM only when apply_locked_result_to_slam: true; the LiDAR-IMU translation used by SLAM remains the YAML translation unless you edit the config.

When checking a new profile, inspect the startup log lines for Opti_TIC, Opti_TIO, td, wheel td, GNSS status, and LiDAR-IMU time sync. A smooth but biased trajectory is often a frame or time-offset error, not just solver tuning.


4. Qualitative Results

4.1 Robustness Under Degradation

Qualitative robustness under sensor degradation

Representative stress cases: challenging perception conditions with consistent trajectory and map quality.

4.2 Cross-Platform Results

Trajectories across ground, legged, and aerial platforms

Trajectory estimation examples on ground, legged, and UAV datasets.

For full-scene playback demos (LiDAR cloud + trajectory in RViz), see Demo preview under §2.1 (M3DGR) and §2.2 (other datasets) above.


5. License and Acknowledgements

This project is licensed under the MIT License. If you find this project useful, please cite:

@article{tian2026ultra,
  title={Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation for Intelligent Transportation Systems},
  author={Tian, Yihong and Zhang, Junjie and Li, Liuyang and Zhang, Deteng and Zuo, Yunfei and Yin, Jie},
  journal={arXiv preprint arXiv:2606.21223},
  year={2026}
}

Please also consider citing our previous works related to this project:

@article{zhang2025towards,
  author={Zhang, Deteng and Zhang, Junjie and Sun, Yan and Li, Tao and Yin, Hao and Xie, Hongzhao and Yin, Jie},
  booktitle={2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)}, 
  title={Towards Robust Sensor-Fusion Ground SLAM: A Comprehensive Benchmark and A Resilient Framework}, 
  year={2025},
  volume={},
  number={},
  pages={8894-8901},
  doi={10.1109/IROS60139.2025.11247507}}


@inproceedings{yin2024ground,
  title={Ground-fusion: A low-cost ground slam system robust to corner cases},
  author={Yin, Jie and Li, Ang and Xi, Wei and Yu, Wenxian and Zou, Danping},
  booktitle={2024 IEEE International Conference on Robotics and Automation (ICRA)},
  pages={8603--8609},
  year={2024},
  organization={IEEE}
}
@article{yin2021m2dgr,
  title={M2dgr: A multi-sensor and multi-scenario slam dataset for ground robots},
  author={Yin, Jie and Li, Ang and Li, Tao and Yu, Wenxian and Zou, Danping},
  journal={IEEE Robotics and Automation Letters},
  volume={7},
  number={2},
  pages={2266--2273},
  year={2021},
  publisher={IEEE}
}

This work is self-funded. Thanks Tianbao Zhang for providing computation for some time. For code maitaining, collaboration or bussiness, contact maotian616@gmail.com.

6. Star History

Star History Chart

About

Ultra-Fusion: A Resilient Tightly-Coupled Multi-Sensor Fusion SLAM Framework under Sensor Degradation and Spatiotemporal Perturbation

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors