MambaUIE&SR: Unraveling the Ocean's Secrets with Only 2.8 GFLOPs

Underwater Image Enhancement (UIE) techniques aim to address the problem of underwater image degradation due to light absorption and scattering. In recent years, both Convolution Neural Network (CNN)-based and Transformer-based methods have been widely explored. In addition, combining CNN and Transf...

Full description

Saved in:
Bibliographic Details
Main Authors Chen, Zhihao, Ge, Yiyuan
Format Journal Article
LanguageEnglish
Published 22.04.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Underwater Image Enhancement (UIE) techniques aim to address the problem of underwater image degradation due to light absorption and scattering. In recent years, both Convolution Neural Network (CNN)-based and Transformer-based methods have been widely explored. In addition, combining CNN and Transformer can effectively combine global and local information for enhancement. However, this approach is still affected by the secondary complexity of the Transformer and cannot maximize the performance. Recently, the state-space model (SSM) based architecture Mamba has been proposed, which excels in modeling long distances while maintaining linear complexity. This paper explores the potential of this SSM-based model for UIE from both efficiency and effectiveness perspectives. However, the performance of directly applying Mamba is poor because local fine-grained features, which are crucial for image enhancement, cannot be fully utilized. Specifically, we customize the MambaUIE architecture for efficient UIE. Specifically, we introduce visual state space (VSS) blocks to capture global contextual information at the macro level while mining local information at the micro level. Also, for these two kinds of information, we propose a Dynamic Interaction Block (DIB) and Spatial feed-forward Network (SGFN) for intra-block feature aggregation. MambaUIE is able to efficiently synthesize global and local information and maintains a very small number of parameters with high accuracy. Experiments on UIEB datasets show that our method reduces GFLOPs by 67.4% (2.715G) relative to the SOTA method. To the best of our knowledge, this is the first UIE model constructed based on SSM that breaks the limitation of FLOPs on accuracy in UIE. The official repository of MambaUIE at https://github.com/1024AILab/MambaUIE.
AbstractList Underwater Image Enhancement (UIE) techniques aim to address the problem of underwater image degradation due to light absorption and scattering. In recent years, both Convolution Neural Network (CNN)-based and Transformer-based methods have been widely explored. In addition, combining CNN and Transformer can effectively combine global and local information for enhancement. However, this approach is still affected by the secondary complexity of the Transformer and cannot maximize the performance. Recently, the state-space model (SSM) based architecture Mamba has been proposed, which excels in modeling long distances while maintaining linear complexity. This paper explores the potential of this SSM-based model for UIE from both efficiency and effectiveness perspectives. However, the performance of directly applying Mamba is poor because local fine-grained features, which are crucial for image enhancement, cannot be fully utilized. Specifically, we customize the MambaUIE architecture for efficient UIE. Specifically, we introduce visual state space (VSS) blocks to capture global contextual information at the macro level while mining local information at the micro level. Also, for these two kinds of information, we propose a Dynamic Interaction Block (DIB) and Spatial feed-forward Network (SGFN) for intra-block feature aggregation. MambaUIE is able to efficiently synthesize global and local information and maintains a very small number of parameters with high accuracy. Experiments on UIEB datasets show that our method reduces GFLOPs by 67.4% (2.715G) relative to the SOTA method. To the best of our knowledge, this is the first UIE model constructed based on SSM that breaks the limitation of FLOPs on accuracy in UIE. The official repository of MambaUIE at https://github.com/1024AILab/MambaUIE.
Author Ge, Yiyuan
Chen, Zhihao
Author_xml – sequence: 1
  givenname: Zhihao
  surname: Chen
  fullname: Chen, Zhihao
– sequence: 2
  givenname: Yiyuan
  surname: Ge
  fullname: Ge, Yiyuan
BackLink https://doi.org/10.48550/arXiv.2404.13884$$DView paper in arXiv
BookMark eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIxMNEzNLawMOFkcPBNzE1KDPV0VQsOslIIzStKLEvNycxLVyjJSFXwT05NzFMvVghOTS5KLSlWKM8syVDwz8upVDDSs1Bwd_PxDyjmYWBNS8wpTuWF0twM8m6uIc4eumC74guKMnMTiyrjQXbGg-00JqwCABvbNfQ
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by-nc-nd/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by-nc-nd/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2404.13884
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2404_13884
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2404_138843
IEDL.DBID GOX
IngestDate Tue May 28 12:10:23 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2404_138843
OpenAccessLink https://arxiv.org/abs/2404.13884
ParticipantIDs arxiv_primary_2404_13884
PublicationCentury 2000
PublicationDate 2024-04-22
PublicationDateYYYYMMDD 2024-04-22
PublicationDate_xml – month: 04
  year: 2024
  text: 2024-04-22
  day: 22
PublicationDecade 2020
PublicationYear 2024
Score 3.8347354
SecondaryResourceType preprint
Snippet Underwater Image Enhancement (UIE) techniques aim to address the problem of underwater image degradation due to light absorption and scattering. In recent...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Title MambaUIE&SR: Unraveling the Ocean's Secrets with Only 2.8 GFLOPs
URI https://arxiv.org/abs/2404.13884
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV3PS8MwFH5sO3kRh8r8MX0H0VOnJmnaelJk3RRnxVnorSRdBgMtslbR_968dKKXXZNH8kgO35f34wvAiTAR8zX3PV5I3xNFFHmhYdQIHBVKBkIaTfGOyaMcp-I-87MW4G8vjFp-LT4bfWBdnVu4EYNLHoaiDW3GqGRrlGRNctJJca3s_-wsx3RD_0Ai3oLNFbvDm-Y6utAy5TZcT9SbVund8HT6fIVpSR_-UBM4Wu6FSWFUeVbhlOhbXSHFRTEpX7-RDUIcxQ_JU7UDx_Hw5XbsuT3z90YgIid3cucO34WOfcabHiAX4VxqEXAdzEUw01pSZacgdQctL2ZqD3rrVtlfP3UAG8zCLOU3GDuETr38MH0Lk7U-cmf1A8d0aPY
link.rule.ids 228,230,786,891
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MambaUIE%26SR%3A+Unraveling+the+Ocean%27s+Secrets+with+Only+2.8+GFLOPs&rft.au=Chen%2C+Zhihao&rft.au=Ge%2C+Yiyuan&rft.date=2024-04-22&rft_id=info:doi/10.48550%2Farxiv.2404.13884&rft.externalDocID=2404_13884