Skip to content

lx6c78/Vision-Mamba-A-Comprehensive-Survey-and-Taxonomy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 

Repository files navigation

Vision Mamba: A Comprehensive Survey and Taxonomy

Abstract: State Space Model (SSM) is a mathematical model used to describe and analyze the behavior of dynamic systems. This model has witnessed numerous applications in several fields, including control theory, signal processing, economics and machine learning. In the field of deep learning, state space models are used to process sequence data, such as time series analysis, natural language processing (NLP) and video understanding. By mapping sequence data to state space, long-term dependencies in the data can be better captured. In particular, modern SSMs have shown strong representational capabilities in NLP, especially in long sequence modeling, while maintaining linear time complexity. Notably, based on the latest state-space models, Mamba \cite{Mamba} merges time-varying parameters into SSMs and formulates a hardware-aware algorithm for efficient training and inference. Given its impressive efficiency and strong long-range dependency modeling capability, Mamba is expected to become a new AI architecture that may outperform Transformer. Recently, a number of works have attempted to study the potential of Mamba in various fields, such as general vision, multi-modal, medical image analysis and remote sensing image analysis, by extending Mamba from natural language domain to visual domain. To fully understand Mamba in the visual domain, we conduct a comprehensive survey and present a taxonomy study. This survey focuses on Mamba's application to a variety of visual tasks and data types, and discusses its predecessors, recent advances and far-reaching impact on a wide range of domains. Since Mamba is now on an upward trend, please actively notice us if you have new findings, and new progress on Mamba will be included in this survey in a timely manner and updated on the website: (https://github.com/lx6c78/Vision-Mamba-A-Comprehensive-Survey-and-Taxonomy).

We will timely update the latest representaive literatures and their released source code on this page. If you have any questions, please don't hesitate to contact us at any of the following emails: liuxiao@stu.cqu.edu.cn, zhangchenxu@cqu.edu.cn, leizhang@cqu.edu.cn

📢 Update Log

  • 2024.05.07: Our paper is released! [arXiv]
  • 2024.05.18: Added "Latest Visual Mamba Papers" column. We plan to update these papers in subsequent versions of our survey.

Citation

If you find this repository is useful for you, please cite our paper:

@misc{liu2024vision,
      title={Vision Mamba: A Comprehensive Survey and Taxonomy}, 
      author={Xiao Liu and Chenxu Zhang and Lei Zhang},
      year={2024},
      eprint={2405.04404},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contents

Related Survey

  • State Space Model for New-Generation Network Alternative to Transformers: A Survey. [15 April 2024] [ArXiv, 2024]
    Xiao Wang, Shiao Wang, Yuhe Ding, Yuehang Li, Wentao Wu, Yao Rong, Weizhe Kong, Ju Huang, Shihao Li, Haoxiang Yang, Ziwen Wang, Bowei Jiang, Chenglong Li, Yaowei Wang, Yonghong Tian, Jin Tang.
    [Paper] [Github]
  • A Survey on Visual Mamba. [26 April, 2024] [ArXiv, 2024]
    Hanwei Zhang, Ying Zhu, Dan Wang, Lijun Zhang, Tianxiang Chen, Zi Ye.
    [Paper]
  • Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges. [24 April, 2024] [ArXiv, 2024]
    Badri Narayana Patro, Vijay Srinivas Agneeswaran.
    [Paper] [Gihub]
  • A Survey on Vision Mamba: Models, Applications and Challenges. [29 April, 2024] [ArXiv, 2024]
    Rui Xu, Shu Yang, Yihui Wang, Bo Du, Hao Chen.
    [Paper] [Gihub]

Latest vision Mamba paper

We plan to update these papers in subsequent versions of our survey.

  • CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation. [30 April, 2024] [ArXiv, 2024]
    Weiquan Huang, Yifei Shen, Yifan Yang.
    [Paper] [Code]
  • SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients. [5 May, 2024] [ArXiv, 2024]
    Tushar Verma, Jyotsna Singh, Yash Bhartari, Rishi Jarwal, Suraj Singh, Shubhkarman Singh.
    [Paper] [Code]
  • SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising. [15 May, 2024] [ArXiv, 2024]
    Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Yuntao Qian.
    [Paper] [Code]
  • FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space. [9 May, 2024] [ArXiv, 2024]
    Hui Ma, Sen Lei, Turgay Celik, Heng-Chao Li.
    [Paper] [Code]
  • SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion. [5 May, 2024] [ArXiv, 2024]<br.> Ziyun Qian, Zeyu Xiao, Zhenyi Wu, Dingkang Yang, Mingcheng Li, Shunli Wang, Shuaibing Wang, Dongliang Kou, Lihua Zhang.
    [Paper]
  • DVMSR: Distillated Vision Mamba for Efficient Super-Resolution. [11 May, 2024] [ArXiv, 2024]
    Xiaoyan Lei, Wenlong Zhang, Weifeng Cao.
    [Paper] [Code]
  • AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation. [5 May, 2024] [ArXiv, 2024]
    Xiaoyan Lei, Wenlong Zhang, Weifeng Cao.
    [Paper] [Code]
  • Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement. [6 May, 2024] [ArXiv, 2024]
    Jiesong Bai, Yuhao Yin, Qiyuan He.
    [Paper] [Code]
  • VMambaCC: A Visual State Space Model for Crowd Counting. [6 May, 2024] [ArXiv, 2024]
    Hao-Yuan Ma, Li Zhang, Shuai Shi.
    [Paper]
  • Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models. [8 May, 2024] [ArXiv, 2024]
    Zhengxing Lan, Hongbo Li, Lingshan Liu, Bo Fan, Yisheng Lv, Yilong Ren, Zhiyong Cui.
    [Paper]
  • Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution. [8 May, 2024] [ArXiv, 2024]
    Yi Xiao, Qiangqiang Yuan, Kui Jiang, Yuzeng Chen, Qiang Zhang, Chia-Wen Lin.
    [Paper]
  • HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation. [11 May, 2024] [ArXiv, 2024]
    Jiashu Xu.
    [Paper]
  • VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis. [9 May, 2024] [ArXiv, 2024]
    Zhihan Ju, Wanting Zhou.
    [Paper]
  • Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba. [9 May, 2024] [ArXiv, 2024]
    Hongwei Ren, Yue Zhou, Jiadong Zhu, Haotian Fu, Yulong Huang, Xiaopeng Lin, Yuetong Fang, Fei Ma, Hao Yu, Bojun Cheng.
    [Paper]
  • Sakuga-42M Dataset: Scaling Up Cartoon Research. [12 May, 2024] [ArXiv, 2024]
    Zhenglin Pan, Yu Zhu, Yuxuan Mu.
    [Paper] [Code]
  • GMSR:Gradient-Guided Mamba for Spectral Reconstruction from RGB Images. [13 May, 2024] [ArXiv, 2024]
    Xinying Wang, Zhixiong Huang, Sifan Zhang, Jiawen Zhu, Lin Feng.
    [Paper] [Code]
  • OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition. [13 May, 2024] [ArXiv, 2024]
    Qiuchi Xiang, Jintao Cheng, Jiehao Luo, Jin Wu, Rui Fan, Xieyuanli Chen, Xiaoyu Tang.
    [Paper]
  • MambaOut: Do We Really Need Mamba for Vision? [14 May, 2024] [ArXiv, 2024]
    Weihao Yu, Xinchao Wang.
    [Paper] [Code]
  • Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study. [14 May, 2024] [ArXiv, 2024]
    Qinfeng Zhu, Yuan Fang, Yuanzhi Cai, Cheng Chen, Lei Fan.
    [Paper]
  • IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model. [16 May, 2024] [ArXiv, 2024]
    Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Shinichiro Omachi.
    [Paper] [Code]
  • RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing. [16 May, 2024] [ArXiv, 2024]
    Huiling Zhou, Xianhao Wu, Hongming Chen, Xiang Chen, Xin He.
    [Paper]

General Vision

1 High-level/Mid-level Vision

1.1 Vision Backbone with Mamba

  • Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model. [10 February, 2024] [ArXiv, 2024]
    Lianghui Zhu, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, Xinggang Wang.
    [Paper] [Code]
  • VMamba: Visual State Space Model. [10 April, 2024] [ArXiv, 2024]
    Yue Liu, Yunjie Tian, Yuzhong Zhao, Hongtian Yu, Lingxi Xie, Yaowei Wang, Qixiang Ye, Yunfan Liu.
    [Paper] [Code]
  • Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data. [19 March, 2024] [ArXiv, 2024]
    Shufan Li, Harkanwar Singh, Aditya Grover.
    [Paper] [Code]
  • LocalMamba: Visual State Space Model with Windowed Selective Scan. [14 March, 2024] [ArXiv, 2024]
    Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu.
    [Paper] [Code]
  • EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba. [14 March, 2024] [ArXiv, 2024]
    Xiaohuan Pei, Tao Huang, Chang Xu.
    [Paper] [Code]
  • SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series. [24 April, 2024] [ArXiv, 2024]
    Badri N. Patro, Vijay S. Agneeswaran.
    [Paper] [Code]
  • PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition. [26 March, 2024] [ArXiv, 2024]
    Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley.
    [Paper] [Code]
  • On the low-shot transferability of [V]-Mamba. [15 March, 2024] [ArXiv, 2024]
    Diganta Misra, Jay Gala, Antonio Orvieto.
    [Paper]
  • DGMamba: Domain Generalization via Generalized State Space Model. [11 April, 2024] [ArXiv, 2024]
    Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan.
    [Paper] [Code]

1.2 Video Analysis and Understanding

  • VideoMamba: State Space Model for Efficient Video Understanding. [March, 2024] [ArXiv, 2024]
    Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao.
    [Paper] [Code]
  • Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding. [14 March, 2024] [ArXiv, 2024]
    Guo Chen, Yifei Huang, Jilan Xu, Baoqi Pei, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang.
    [Paper] [Code]
  • RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos. [9 April, 2024] [ArXiv, 2024]
    Bochao Zou, Zizheng Guo, Xiaocheng Hu, Huimin Ma.
    [Paper] [Code]

1.3 Down-stream Visual Applications

  • Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning. [28 April, 2024] [ArXiv, 2024]
    Chi-Sheng Chen, Guan-Ying Chen, Dong Zhou, Di Jiang, Dai-Shi Chen.
    [Paper] [Code]
  • InsectMamba: Insect Pest Classification with State Space Model. [4 April, 2024] [ArXiv, 2024]
    Qianning Wang, Chenglin Wang, Zhixin Lai, Yucheng Zhou.
    [Paper]
  • MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection. [17 March, 2024] [ArXiv, 2024]
    Tianxiang Chen, Zhentao Tan, Tao Gong, Qi Chu, Yue Wu, Bin Liu, Jieping Ye, Nenghai Yu.
    [Paper] [Code]
  • MemoryMamba: Memory-Augmented State Space Model for Defect Recognition. [6 May, 2024] [ArXiv, 2024]
    Qianning Wang, He Hu, Yucheng Zhou.
    [Paper]

2 Low-level Vision

2.1 Image Denoising

  • U-shaped Vision Mamba for Single Image Dehazing. [15 February, 2024] [ArXiv, 2024]
    Zhuoran Zheng, Chen Wu.
    [Paper] [Code]
  • FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining. [15 April, 2024] [ArXiv, 2024]
    Zou Zhen, Yu Hu, Zhao Feng.
    [Paper]

2.2 Image Restoration

  • MambaIR: A Simple Baseline for Image Restoration with State-Space Model. [25 March, 2024] [ArXiv, 2024]
    Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, Shu-Tao Xia.
    [Paper] [Code]
  • Activating Wider Areas in Image Super-Resolution. [13 March, 2024] [ArXiv, 2024]
    Cheng Cheng, Hang Wang, Hongbin Sun.
    [Paper]
  • CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration. [17 April, 2024] [ArXiv, 2024]
    Rui Deng, Tianpei Gu.
    [Paper]
  • VmambaIR: Visual State Space Model for Image Restoration. [17 March, 2024] [ArXiv, 2024]
    Yuan Shi, Bin Xia, Xiaoyu Jin, Xing Wang, Tianyu Zhao, Xin Xia, Xuefeng Xiao, Wenming Yang.
    [Paper] [Code]
  • Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement. [6 May, 2024] [ArXiv, 2024]
    Jiesong Bai, Yuhao Yin, Qiyuan He.
    [Paper] [Code]

3 3-D Visual Recognition

3.1 Point Could Analysis

  • PointMamba: A Simple State Space Model for Point Cloud Analysis. [2 April, 2024] [ArXiv, 2024]
    Dingkang Liang, Xin Zhou, Xinyu Wang, Xingkui Zhu, Wei Xu, Zhikang Zou, Xiaoqing Ye, Xiang Bai.
    [Paper] [Code]
  • Point Cloud Mamba: Point Cloud Learning via State Space Model. [29 March, 2024] [ArXiv, 2024]
    Tao Zhang, Xiangtai Li, Haobo Yuan, Shunping Ji, Shuicheng Yan.
    [Paper] [Code]
  • Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy. [17 March, 2024] [ArXiv, 2024]
    Jiuming Liu, Ruiji Yu, Yian Wang, Yu Zheng, Tianchen Deng, Weicai Ye, Hesheng Wang.
    [Paper] [Code]
  • 3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion. [10 April, 2024] [ArXiv, 2024]
    Yixuan Li, Weidong Yang, Ben Fei.
    [Paper]

3.2 Hyperspectral Imaging Analysis

  • Mamba-FETrack: Frame-Event Tracking via State Space Model. [28 April, 2024] [ArXiv, 2024]
    Ju Huang, Shiao Wang, Shuai Wang, Zhe Wu, Xiao Wang, Bo Jiang.
    [Paper] [Code]

4 Visual Data Generation

  • ZigMa: A DiT-style Zigzag Mamba Diffusion Model. [1 April, 2024] [ArXiv, 2024]
    Vincent Tao Hu, Stefan Andreas Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes Fischer, Björn Ommer.
    [Paper] [Homepage] [Code]
  • Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM. [19 March, 2024] [ArXiv, 2024]
    Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang.
    [Paper] [Homepage] [Code]
  • Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction. [29 March, 2024] [ArXiv, 2024]
    Qiuhong Shen, Xuanyu Yi, Zike Wu, Pan Zhou, Hanwang Zhang, Shuicheng Yan, Xinchao Wang.
    [Paper]
  • Matten: Video Generation with Mamba-Attention. [5 May, 2024] [ArXiv, 2024]
    Yu Gao, Jiancheng Huang, Xiaopeng Sun, Zequn Jie, Yujie Zhong, Lin Ma.
    [Paper]
  • SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion. [5 May, 2024] [ArXiv, 2024]
    Ziyun Qian, Zeyu Xiao, Zhenyi Wu, Dingkang Yang, Mingcheng Li, Shunli Wang, Shuaibing Wang, Dongliang Kou, Lihua Zhang.
    [Paper]

Multi-Modal

1 Heterologous Stream

1.1 Multi-Modal Understanding

  • MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models. [14 March, 2024] [ArXiv, 2024]
    Zunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang, Xiu Li.
    [Paper]
  • ReMamber: Referring Image Segmentation with Mamba Twister. [26 March, 2024] [ArXiv, 2024]
    Yuhuan Yang, Chaofan Ma, Jiangchao Yao, Zhun Zhong, Ya Zhang, Yanfeng Wang.
    [Paper]
  • SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding. [1 April, 2024] [ArXiv, 2024]
    Wenrui Li, Xiaopeng Hong, Xiaopeng Fan.
    [Paper]

1.2 Multimodal large language models

  • VL-Mamba: Exploring State Space Models for Multimodal Learning. [20 March, 2024] [ArXiv, 2024]
    Yanyuan Qiao, Zheng Yu, Longteng Guo, Sihan Chen, Zijia Zhao, Mingzhen Sun, Qi Wu, Jing Liu.
    [Paper] [Homepage] [Code]
  • Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference. [22 March, 2024] [ArXiv, 2024]
    Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang.
    [Paper] [Homepage] [Code]

2 Homologous Stream

  • Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation. [5 April, 2024] [ArXiv, 2024]
    Zifu Wan, Yuhao Wang, Silong Yong, Pingping Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie.
    [Paper] [Code]
  • Fusion-Mamba for Cross-modality Object Detection. [14 April, 2024] [ArXiv, 2024]
    Wenhao Dong, Haodong Zhu, Shaohui Lin, Xiaoyan Luo, Yunhang Shen, Xuhui Liu, Juan Zhang, Guodong Guo, Baochang Zhang.
    [Paper]

Vertical Application

1 Remote Sensing Image

1.1 Remote Sensing Image Processing

  • Pan-Mamba: Effective pan-sharpening with State Space Model. [8 March, 2024] [ArXiv, 2024]
    Xuanhua He, Ke Cao, Keyu Yan, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou.
    [Paper] [Code]
  • HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising. [15 April, 2024] [ArXiv, 2024]
    Yang Liu, Jiahua Xiao, Yu Guo, Peilin Jiang, Haiwei Yang, Fei Wang.
    [Paper]

1.2 Remote Sensing Image Classification

  • RSMamba: Remote Sensing Image Classification with State Space Model. [28 March, 2024] [ArXiv, 2024]
    Keyan Chen, Bowen Chen, Chenyang Liu, Wenyuan Li, Zhengxia Zou, Zhenwei Shi.
    [Paper]
  • SpectralMamba: Efficient Mamba for Hyperspectral Image Classification. [12 April, 2024] [ArXiv, 2024]
    Jing Yao, Danfeng Hong, Chenyu Li, Jocelyn Chanussot.
    [Paper] [Code]
  • Spectral-Spatial Mamba for Hyperspectral Image Classification. [29 Apr, 2024] [ArXiv, 2024]
    Lingbo Huang, Yushi Chen, Xin He.
    [Paper] [Code]
  • S2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification. [28 April, 2024] [ArXiv, 2024]
    Guanchun Wang, Xiangrong Zhang, Zelin Peng, Tianyang Zhang, Xiuping Jia, Licheng Jiao.
    [Paper] [Code]

1.3 Remote Sensing Image Change Detection

  • ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model. [14 April, 2024] [ArXiv, 2024]
    Hongruixuan Chen, Jian Song, Chengxi Han, Junshi Xia, Naoto Yokoya.
    [Paper] [Code]
  • RSCaMa: Remote Sensing Image Change Captioning with State Space Model. [2 May, 2024] [ArXiv, 2024]
    Chenyang Liu, Keyan Chen, Bowen Chen, Haotian Zhang, Zhengxia Zou, Zhenwei Shi.
    [Paper] [Code]

1.4 Remote Sensing Image Segmentation

  • Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model. [11 April, 2024] [ArXiv, 2024]
    Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan, Anh Nguyen.
    [Paper] [Code]
  • RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation. [3 April, 2024] [ArXiv, 2024]
    Xianping Ma, Xiaokang Zhang, Man-On Pun.
    [Paper] [Code]
  • RS-Mamba for Large Remote Sensing Image Dense Prediction. [10 April, 2024] [ArXiv, 2024]
    Sijie Zhao, Hao Chen, Xueliang Zhang, Pengfeng Xiao, Lei Bai, Wanli Ouyang.
    [Paper] [Code]

1.5 Remote Sensing Image Fusion

  • FusionMamba: Efficient Image Fusion with State Space Model. [11 April, 2024] [ArXiv, 2024]
    Siran Peng, Xiangyu Zhu, Haoyu Deng, Zhen Lei, Liang-Jian Deng.
    [Paper]
  • A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion. [14 April, 2024] [ArXiv, 2024]
    Zihan Cao, Xiao Wu, Liang-Jian Deng, Yu Zhong.
    [Paper]

2 Medical Image

2.1 Medical Image Segmentation

2.1.1 Preliminary explorations of U-shaped Mamba
  • U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation. [9 January, 2024] [ArXiv, 2024]
    Jun Ma, Feifei Li, Bo Wang.
    [Paper] [Homepage] [Code]
  • VM-UNet: Vision Mamba UNet for Medical Image Segmentation. [4 February, 2024] [ArXiv, 2024]
    Jiacheng Ruan, Suncheng Xiang.
    [Paper] [Code]
  • Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation. [30 March, 2024] [ArXiv, 2024]
    Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei Li.
    [Paper] [Code]
  • Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining. [6 March, 2024] [ArXiv, 2024]
    Jiarun Liu, Hao Yang, Hong-Yu Zhou, Yan Xi, Lequan Yu, Yizhou Yu, Yong Liang, Guangming Shi, Shaoting Zhang, Hairong Zheng, Shanshan Wang.
    [Paper] [Code]
2.1.2 Improvements to the U-shaped Mamba
  • LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation. [11 March, 2024] [ArXiv, 2024]
    Weibin Liao, Yinghao Zhu, Xinyuan Wang, Chengwei Pan, Yasha Wang, Liantao Ma.
    [Paper] [Code]
  • VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation . [14 March, 2024] [ArXiv, 2024]
    Mingya Zhang, Yue Yu, Limei Gu, Tingsheng Lin, Xianping Tao.
    [Paper] [Code]
  • Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention. [12 March, 2024] [ArXiv, 2024]
    Jinhong Wang, Jintai Chen, Danny Chen, Jian Wu.
    [Paper] [Code]
  • H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation. [20 March, 2024] [ArXiv, 2024]
    Renkai Wu, Yinghao Liu, Pengchen Liang, Qing Chang.
    [Paper] [Code]
  • Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion. [26 Mar, 2024] [ArXiv, 2024]
    Kazi Shahriar Sanjid, Md. Tanzim Hossain, Md. Shakib Shahariar Junayed, Dr. Mohammad Monir Uddin.
    [Paper]
  • Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation. [16 April, 2024] [ArXiv, 2024]
    Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu, Kaihong Wu.
    [Paper]
  • UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation. [24 April, 2024] [ArXiv, 2024]
    Renkai Wu, Yinghao Liu, Pengchen Liang, Qing Chang.
    [Paper] [Code]
2.1.3 U-shaped Mamba with other methodologies
  • Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation. [29 March, 2024] [ArXiv, 2024]
    Chao Ma, Ziyang Wang.
    [Paper] [Code]
  • Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation. [16 February, 2024] [ArXiv, 2024]
    Ziyang Wang, Chao Ma.
    [Paper] [Code]
  • ProMamba: Prompt-Mamba for polyp segmentation. [26 March, 2024] [ArXiv, 2024]
    Jianhao Xie, Ruofan Liao, Ziang Zhang, Sida Yi, Yuesheng Zhu, Guibo Luo.
    [Paper]
  • P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation. [15 March, 2024] [ArXiv, 2024]
    Zi Ye, Tianxiang Chen, Fangyijie Wang, Hanwei Zhang, Guanxi Li, Lijun Zhang.
    [Paper]
2.1.4 Multi-Dimensional Medical Data Segmentation
  • SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation. [25 February, 2024] [ArXiv, 2024]
    Zhaohu Xing, Tian Ye, Yijun Yang, Guang Liu, Lei Zhu.
    [Paper] [Code]
  • nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model. [10 March, 2024] [ArXiv, 2024]
    Haifan Gong, Luoyao Kang, Yitao Wang, Xiang Wan, Haofeng Li.
    [Paper] [Code]
  • T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation. [1 April, 2024] [ArXiv, 2024]
    Jing Hao, Lei He, Kuo Feng Hung.
    [Paper] [Code]
  • Vivim: a Video Vision Mamba for Medical Video Object Segmentation. [12 March, 2024] [ArXiv, 2024]
    Yijun Yang, Zhaohu Xing, Chunwang Huang, Lei Zhu.
    [Paper] [Code]

2.2 Pathological Diagnosis

  • MedMamba: Vision Mamba for Medical Image Classification. [2 April, 2024] [ArXiv, 2024]
    Yubiao Yue, Zhenzhang Li.
    [Paper] [Code]
  • MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models. [8 March, 2024] [ArXiv, 2024]
    Zijie Fang, Yifeng Wang, Zhi Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhang.
    [Paper]
  • MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology. [11 March, 2024] [ArXiv, 2024]
    Shu Yang, Yihui Wang, Hao Chen.
    [Paper] [Code]
  • CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification. [25 March, 2024] [ArXiv, 2024]
    Guangqian Yang, Kangrui Du, Zhihan Yang, Ye Du, Yongping Zheng, Shujun Wang.
    [Paper]
  • SurvMamba: State Space Model with Multi-grained Multi-modal Interaction for Survival Prediction. [11 April, 2024] [ArXiv, 2024]
    Ying Chen, Jiajing Xie, Yuxiang Lin, Yuhang Song, Wenxian Yang, Rongshan Yu.
    [Paper]

2.3 Deformable Image Registration

  • MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration. [12 March, 2024] [ArXiv, 2024]
    Tao Guo, Yinuo Wang, Shihao Shu, Diansheng Chen, Zhouping Tang, Cai Meng, Xiangzhi Bai.
    [Paper] [Code]
  • VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration. [7 Apr, 2024] [ArXiv, 2024]
    Ziyang Wang, Jian-Qing Zheng, Chao Ma, Tao Guo.
    [Paper] [Code]

2.4 Medical Image Reconstruction

  • FD-Vision Mamba for Endoscopic Exposure Correction. [14 February, 2024] [ArXiv, 2024]
    Zhuoran Zheng, Jun Zhang.
    [Paper] [Code]
  • MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation. [19 March, 2024] [ArXiv, 2024]
    Jiahao Huang, Liutao Yang, Fanwen Wang, Yinzhe Wu, Yang Nan, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang.
    [Paper] [Code]
  • FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba. [20 April, 2024] [ArXiv, 2024]
    Xinyu Xie, Yawen Cui, Chio-In Ieong, Tao Tan, Xiaozhi Zhang, Xubin Zheng, Zitong Yu.
    [Paper] [Code]
  • MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion. [12 April, 2024] [ArXiv, 2024]
    Zhe Li, Haiwei Pan, Kejia Zhang, Yuhua Wang, Fengming Yu.
    [Paper]

2.5 Other Medical Tasks

  • MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction. [13 March, 2024] [ArXiv, 2024]
    Linjie Fu, Xia Li, Xiuding Cai, Yingkai Wang, Xueyao Wang, Yali Shen, Yu Yao.
    [Paper] [Code]
  • Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy. [20 April, 2024] [ArXiv, 2024]
    Yuelin Zhang, Wanquan Yan, Kim Yan, Chun Ping Lam, Yufu Qiu, Pengyu Zheng, Raymond Shing-Yan Tang, Shing Shin Cheng.
    [Paper] [Code]

Other Domains

coming soon

About

Vision Mamba: A Comprehensive Survey and Taxonomy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published