MambaDSF

Multi-Scale State-Space Model with Dilated Feature Fusion for Sonar Small Target Detection

Notice: This manuscript is publicly available as an arXiv preprint and has been submitted to IEEE Geoscience and Remote Sensing Letters (GRSL).

Overview

MambaDSF is a sonar small-target detection framework that integrates selective state-space modeling with dilated feature fusion. It is designed for forward-looking sonar imagery, where target echoes are often compact, low-contrast, and easily confused with reverberation or background structures.

The framework addresses three practical challenges:

Scarce target pixels: Small sonar targets occupy limited image regions and may collapse to very few feature-map cells after downsampling.
Background ambiguity: Reverberation, speckle-like noise, and object-shaped clutter can produce echo patterns that resemble true targets.
Multi-scale appearance variation: The apparent target size and echo envelope vary with imaging range, viewpoint, and dataset domain.

Architecture

MambaDSF consists of three main components synchronized with the current manuscript:

MambaEFP Backbone: Enhances MambaVision with efficient feature-pyramid propagation for global acoustic context modeling and multi-scale feature extraction.
DFMamba Encoder: Combines dilated local attention with Fusion State-Space Modeling (FusSSM) to align local target details and cross-scale semantic information.
SA-WIoU & CSC Losses: Introduces Scale-Adaptive Weighted IoU (SA-WIoU) for small-target localization and Cross-Scale Semantic Consistency (CSC) for feature alignment across detection scales.

Qualitative Comparison

Qualitative comparison on representative UATD, FLS and MD-FLS test samples across eight detection methods.

Installation

# Clone the repository
git clone https://github.com/IDontKnowAAA/MambaDSF.git
cd MambaDSF

# Install dependencies
pip install -r requirements.txt

# Install mamba-ssm (requires CUDA)
pip install mamba-ssm==1.2.0

Update the paths in configs/dataset/uatd_detection.yml.

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China under Grant 62001443, and in part by the Natural Science Foundation of Shandong Province under Grant ZR2020QE294.

Contact

Hui Lin: harrylin929@gmail.com
Jiayi Li: leanolee58@gmail.com (GitHub)
Jing Wang: wangjingname@gmail.com
Shenghui Rong (Corresponding): rsh@ouc.edu.cn

License

The arXiv preprint is distributed under the arXiv.org perpetual, non-exclusive license.

Please cite the arXiv preprint if you use this work.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
configs		configs
losses		losses
models		models
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MambaDSF

Multi-Scale State-Space Model with Dilated Feature Fusion for Sonar Small Target Detection

Overview

Architecture

Qualitative Comparison

Installation

Acknowledgements

Contact

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MambaDSF

Multi-Scale State-Space Model with Dilated Feature Fusion for Sonar Small Target Detection

Overview

Architecture

Qualitative Comparison

Installation

Acknowledgements

Contact

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages