A Transformer has received a lot of attention in computer vision. Because of global self-attention, the computational complexity of Transformer is quadratic with the number of tokens, leading to limitations for practical applications. Hence, the computational complexity issue can be efficiently resolved by computing the self-attention in groups of smaller fixed-size windows. In this article, we propose a novel pyramid Shuffleand-Reshuffle Transformer (PSRT) for the task of multispectral and hyperspectral image fusion (MHIF). Considering the strong correlation among different patches in remote sensing images and complementary information among patches with high similarity, we design Shuffle-and-Reshuffle (SaR) modules to consider the information interaction among global patches in an efficient manner. Besides, using pyramid structures based on window self-attention, the detail extraction is supported. Extensive experiments on four widely used benchmark datasets demonstrate the superiority of the proposed PSRT with a few parameters compared with several state-of-the-art approaches. The related code is available at https://github.com/Dengshangqi/PSRThttps://github.com/Deng-shangqi/PSRT.

PSRT: Pyramid Shuffle-and-Reshuffle Transformer for Multispectral and Hyperspectral Image Fusion

Vivone Gemine
2023

Abstract

A Transformer has received a lot of attention in computer vision. Because of global self-attention, the computational complexity of Transformer is quadratic with the number of tokens, leading to limitations for practical applications. Hence, the computational complexity issue can be efficiently resolved by computing the self-attention in groups of smaller fixed-size windows. In this article, we propose a novel pyramid Shuffleand-Reshuffle Transformer (PSRT) for the task of multispectral and hyperspectral image fusion (MHIF). Considering the strong correlation among different patches in remote sensing images and complementary information among patches with high similarity, we design Shuffle-and-Reshuffle (SaR) modules to consider the information interaction among global patches in an efficient manner. Besides, using pyramid structures based on window self-attention, the detail extraction is supported. Extensive experiments on four widely used benchmark datasets demonstrate the superiority of the proposed PSRT with a few parameters compared with several state-of-the-art approaches. The related code is available at https://github.com/Dengshangqi/PSRThttps://github.com/Deng-shangqi/PSRT.
2023
Istituto di Metodologie per l'Analisi Ambientale - IMAA
Inglese
61
Art.n.5503715-1
Art.n.5503715-15
15
https://ieeexplore.ieee.org/document/10044141
Sì, ma tipo non specificato
Image enhancement
image fusion
multispectral and hyperspectral image fusion (MHIF)
pyramid structure
remote sensing
Shuffle-and-Reshuffle (SaR) Transformer
6
info:eu-repo/semantics/article
262
Deng, Shangqi; Deng, Liangjian; Wu, Xiao; Ran, Ran; Hong, Danfeng; Vivone, Gemine
01 Contributo su Rivista::01.01 Articolo in rivista
restricted
File in questo prodotto:
File Dimensione Formato  
prod_486626-doc_201944.pdf

solo utenti autorizzati

Descrizione: PSRT: Pyramid Shuffle-and-Reshuffle Transformer for Multispectral and Hyperspectral Image Fusion
Tipologia: Versione Editoriale (PDF)
Dimensione 15.71 MB
Formato Adobe PDF
15.71 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/456663
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 182
  • ???jsp.display-item.citation.isi??? 162
social impact